Get Query Metrics - Unify Documentation

Returns aggregated telemetry data from previous queries to the /chat/completions endpoint, specifically the p50 and p95 for generation time and tokens per second, and also the total prompt and completion tokens processed within the interval. The user id and total request count within the interval are also returned.

Authorizations

Authorization

string

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Query Parameters

start_time

string

Timestamp of the earliest query to aggregate. Format is YYYY-MM-DD hh:mm:ss.

end_time

string

Timestamp of the latest query to aggregate. Format is YYYY-MM-DD hh:mm:ss.

models

string

Models to fetch metrics from. The list must be a set of comma-separated strings. i.e. gpt-3.5-turbo,gpt-4o

providers

string

Providers to fetch metrics from. The list must be a set of comma-separated strings. i.e. openai,together-ai

interval

string

default: "300"

Number of seconds in the aggregation interval.

secondary_user_id

string

Secondary user id. The secondary user id will match any string previously sent in the user attribute of /chat/completions.

​Authorizations

​Query Parameters

Authorizations

Query Parameters