GET
/
v0
/
metrics

Returns aggregated telemetry data from previous queries to the /chat/completions endpoint, specifically the p50 and p95 for generation time and tokens per second, and also the total prompt and completion tokens processed within the interval. The user id and total request count within the interval are also returned.

Authorizations

Authorization
string
required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Query Parameters

start_time
string

Timestamp of the earliest query to aggregate. Format is YYYY-MM-DD hh:mm:ss.

end_time
string

Timestamp of the latest query to aggregate. Format is YYYY-MM-DD hh:mm:ss.

models
string

Models to fetch metrics from. The list must be a set of comma-separated strings. i.e. gpt-3.5-turbo,gpt-4o

providers
string

Providers to fetch metrics from. The list must be a set of comma-separated strings. i.e. openai,together-ai

interval
string
default: "300"

Number of seconds in the aggregation interval.

secondary_user_id
string

Secondary user id. The secondary user id will match any string previously sent in the user attribute of /chat/completions.