Logging
Logging enables previously sent requests to the retrieved at a later date for analysis, reporting, debugging and/or testing. For convenience, these prompts can be filtered based on many factors, such as arbitrary string tags, the model, the provider, the endpoint, start time for retrieval window, end time for retrieval window etc.
Queries sent to Unify’s chat completion API can be logged behind the scenes (#via-unify), and queries made to external LLM providers can also be logged explicitly (#Other Clients), both of which are explained below.
Via Unify
If the LLM queries are being handled by Unify, then we could make queries as follows, such that the queries are all sensibly categorized (tagged).
Lets assume we’re building an AI educational assistant, which is serving many students for many different subjects. In this case, it would make sense to tag queries based on both the subject and the student:
In Python, this would look like:
If you want to turn off logging entirely, this can be done via the usage page in your console.
Presuming that logging is turned on in the console, and presuming you have a
Professional Plan or higher, then logging can be controlled
on a prompt-by-prompt basis, by specifying one or both of the arguments log_query_body
and log_response_body
in the
chat completions endpoint.
As a cURL request, this looks as follows:
In Python, the arguments are passed as keywords:
If log_query_body
is False
, then log_response_body
will be ignored.
In other words, a response cannot be logged without a corresponding logged query.
Other Clients
If you are not deploying your LLM via Unify, you can still manually log your prompts to the Unify platform via the CURL request as follows:
Similarly, if you also want to include the LLM response in your logged query,
you can use the response_body
key as follows:
In Python, this is more convenient via the unify.with_logging
decorator,
as follows for the OpenAI client:
The same can be done for any other provider.
As before, the arguments log_query_body
and log_response_body
can be specified,
controlling exactly what is logged to the account.
The function unify.with_logging
by default will assume tags
to be empty.
The log arguments can either be specified when unify.with_logging
is called as a decorator,
or the arguments can be intercepted from the wrapped inner function call.
Arguments passed to the inner wrapped function override arguments passed directly to
unify.with_logging
, except for the tags
argument which will be extended with the
additional tags.
For example, the following will tag the prompts as tagA
and tagB
.
The following will also tag the prompts as tagA
and tagB
.
However, the following will tag the prompts with tagA
, tagB
and tagC
.
If you’ve already processed the input and output, and would like to retrospectively log
the query, you can make use of unify.log_query
, which is one of the utility functions
which thinly wraps the REST API:
Retrieving Queries
Every query made via the API or manually logged can then be retrieved at a later
stage, using the GET
request with the
/queries
endpoint,
as follows:
Again, in Python this would look like:
We could also query only "maths"
and return the maths prompts for all students,
or we could query only "john_smith"
and return the prompts across all subjects for
this student.
If you want to simply retrieve all queries made you can leave the tags
argument
empty, or if you want to retrieve all queries for a student you can omit the subject
tag, and vice versa.
If there is a lot of production traffic, you can also limit the retrieval to a specific
time window, using the argument start_time
(and optionally also end_time
), like so:
Extracting historic prompts in this manner can also be useful for creating prompt datasets from production traffic, as explained in the Production Data section.