usage
parameter with the read unit consumption of each request that is made.
embed
operation or automatically when upserting or querying an index with integrated embedding, return a usage
parameter with the total tokens generated.
For example, the following request to use the multilingual-e5-large
model to generate embeddings for sentences related to the word “apple” might return this request and summary of embedding tokens generated: