usage parameter with the read unit consumption of each request that is made.
embed operation or automatically when upserting or querying an index with integrated embedding, return a usage parameter with the total tokens generated.
For example, the following request to use the multilingual-e5-large model to generate embeddings for sentences related to the word “apple” might return this request and summary of embedding tokens generated: