w.serving_endpoints_data_plane: Serving endpoints DataPlane¶
- class databricks.sdk.service.serving.ServingEndpointsDataPlaneAPI¶
Serving endpoints DataPlane provides a set of operations to interact with data plane endpoints for Serving endpoints service.
- query(name: str [, client_request_id: Optional[str], dataframe_records: Optional[List[Any]], dataframe_split: Optional[DataframeSplitInput], extra_params: Optional[Dict[str, str]], input: Optional[Any], inputs: Optional[Any], instances: Optional[List[Any]], max_tokens: Optional[int], messages: Optional[List[ChatMessage]], n: Optional[int], prompt: Optional[Any], stop: Optional[List[str]], stream: Optional[bool], temperature: Optional[float], usage_context: Optional[Dict[str, str]]]) QueryEndpointResponse¶
Query a serving endpoint
- Parameters:
name – str The name of the serving endpoint. This field is required and is provided via the path parameter.
client_request_id – str (optional) Optional user-provided request identifier that will be recorded in the inference table and the usage tracking table.
dataframe_records – List[Any] (optional) Pandas Dataframe input in the records orientation.
dataframe_split –
DataframeSplitInput(optional) Pandas Dataframe input in the split orientation.extra_params – Dict[str,str] (optional) The extra parameters field used ONLY for __completions, chat,__ and __embeddings external & foundation model__ serving endpoints. This is a map of strings and should only be used with other external/foundation model query fields.
input – Any (optional) The input string (or array of strings) field used ONLY for __embeddings external & foundation model__ serving endpoints and is the only field (along with extra_params if needed) used by embeddings queries.
inputs – Any (optional) Tensor-based input in columnar format.
instances – List[Any] (optional) Tensor-based input in row format.
max_tokens – int (optional) The max tokens field used ONLY for __completions__ and __chat external & foundation model__ serving endpoints. This is an integer and should only be used with other chat/completions query fields.
messages – List[
ChatMessage] (optional) The messages field used ONLY for __chat external & foundation model__ serving endpoints. This is an array of ChatMessage objects and should only be used with other chat query fields.n – int (optional) The n (number of candidates) field used ONLY for __completions__ and __chat external & foundation model__ serving endpoints. This is an integer between 1 and 5 with a default of 1 and should only be used with other chat/completions query fields.
prompt – Any (optional) The prompt string (or array of strings) field used ONLY for __completions external & foundation model__ serving endpoints and should only be used with other completions query fields.
stop – List[str] (optional) The stop sequences field used ONLY for __completions__ and __chat external & foundation model__ serving endpoints. This is a list of strings and should only be used with other chat/completions query fields.
stream – bool (optional) The stream field used ONLY for __completions__ and __chat external & foundation model__ serving endpoints. This is a boolean defaulting to false and should only be used with other chat/completions query fields.
temperature – float (optional) The temperature field used ONLY for __completions__ and __chat external & foundation model__ serving endpoints. This is a float between 0.0 and 2.0 with a default of 1.0 and should only be used with other chat/completions query fields.
usage_context – Dict[str,str] (optional) Optional user-provided context that will be recorded in the usage tracking table.
- Returns: