`w.serving_endpoints_data_plane`: Serving endpoints DataPlane¶

class databricks.sdk.service.serving.ServingEndpointsDataPlaneAPI¶

Serving endpoints DataPlane provides a set of operations to interact with data plane endpoints for Serving endpoints service.

query(name: str [, client_request_id: Optional[str], dataframe_records: Optional[List[Any]], dataframe_split: Optional[DataframeSplitInput], extra_params: Optional[Dict[str, str]], input: Optional[Any], inputs: Optional[Any], instances: Optional[List[Any]], max_tokens: Optional[int], messages: Optional[List[ChatMessage]], n: Optional[int], prompt: Optional[Any], stop: Optional[List[str]], stream: Optional[bool], temperature: Optional[float], usage_context: Optional[Dict[str, str]]]) → QueryEndpointResponse¶

Query a serving endpoint

Parameters:

name – str The name of the serving endpoint. This field is required and is provided via the path parameter.
client_request_id – str (optional) Optional user-provided request identifier that will be recorded in the inference table and the usage tracking table.
dataframe_records – List[Any] (optional) Pandas Dataframe input in the records orientation.
dataframe_split – DataframeSplitInput (optional) Pandas Dataframe input in the split orientation.
extra_params – Dict[str,str] (optional) The extra parameters field used ONLY for completions, chat, and embeddings external & foundation model serving endpoints. This is a map of strings and should only be used with other external/foundation model query fields.
input – Any (optional) The input string (or array of strings) field used ONLY for embeddings external & foundation model serving endpoints and is the only field (along with extra_params if needed) used by embeddings queries.
inputs – Any (optional) Tensor-based input in columnar format.
instances – List[Any] (optional) Tensor-based input in row format.
max_tokens – int (optional) The max tokens field used ONLY for completions and chat external & foundation model serving endpoints. This is an integer and should only be used with other chat/completions query fields.
messages – List[ChatMessage] (optional) The messages field used ONLY for chat external & foundation model serving endpoints. This is an array of ChatMessage objects and should only be used with other chat query fields.
n – int (optional) The n (number of candidates) field used ONLY for completions and chat external & foundation model serving endpoints. This is an integer between 1 and 5 with a default of 1 and should only be used with other chat/completions query fields.
prompt – Any (optional) The prompt string (or array of strings) field used ONLY for completions external & foundation model serving endpoints and should only be used with other completions query fields.
stop – List[str] (optional) The stop sequences field used ONLY for completions and chat external & foundation model serving endpoints. This is a list of strings and should only be used with other chat/completions query fields.
stream – bool (optional) The stream field used ONLY for completions and chat external & foundation model serving endpoints. This is a boolean defaulting to false and should only be used with other chat/completions query fields.
temperature – float (optional) The temperature field used ONLY for completions and chat external & foundation model serving endpoints. This is a float between 0.0 and 2.0 with a default of 1.0 and should only be used with other chat/completions query fields.
usage_context – Dict[str,str] (optional) Optional user-provided context that will be recorded in the usage tracking table.

Returns:

QueryEndpointResponse

Navigation

Related Topics

`w.serving_endpoints_data_plane`: Serving endpoints DataPlane¶

w.serving_endpoints_data_plane: Serving endpoints DataPlane¶

`w.serving_endpoints_data_plane`: Serving endpoints DataPlane¶