/chat/completions
endpoint, specifically the p50 and p95 for generation time and tokens per second,
and also the total prompt and completion tokens processed within the interval. The
user id and total request count within the interval are also returned.
Bearer <token>
, where <token>
is your auth token.YYYY-MM-DD hh:mm:ss
.YYYY-MM-DD hh:mm:ss
.gpt-3.5-turbo,gpt-4o
openai,together-ai
user
attribute of /chat/completions
.