This page describes the pricing and limits of Pinecone Assistant.

Pricing

The cost of using Pinecone Assistant is determined by the following factors:

Invoice line itemDescription
Assistants Context Tokens ProcessedNumber of tokens processed for context retrieval.
Assistants Evaluation Tokens OutNumber of tokens used to calculate evaluation metrics.
Assistants Evaluation Tokens ProcessedNumber of tokens used to prompt evaluation metrics.
Assistants Hourly CountNumber of hours the assistant is available.
Assistants Input TokensNumber of tokens processed by the assistant.
Assistants Output TokensNumber of tokens output by the assistant.
Assistants Total Storage GB/HoursTotal size of files stored in the assistant per month.

See Pricing for up-to-date pricing information.

Token usage

Pinecone Assistant usage is measured in tokens, with different counts and cost for input and output tokens.

Pinecone Assistant consumes input tokens for both planning and retrieval. Input token usage is calculated based on the chat history, the document structure and data density (e.g., how many words are in a page), and the number of documents that meet the filter criteria. This means that, in general, the total number of input tokens used is the sum of the chat history token count plus in the order of 10,000 tokens used for document retrieval. The maximum input tokens per query is 64,000.

Output tokens are the number of tokens generated as part of the answer generation. The total number depends on the complexity of the question and the number of documents that were retrieved and are relevant for the question. The output typically ranges from a few dozen to several hundred tokens.

Limits

The following Pinecone Assistant limit apply to each organization and vary based on pricing plan:

MetricStarter planStandard planEnterprise plan
Max number of assistants3UnlimitedUnlimited
Max tokens per minute (TPM) input30,000150,000150,000
Max number of total LLM processed tokens1,500,000UnlimitedUnlimited
Max input tokens per query64,00064,00064,000
Max total output tokens200,000UnlimitedUnlimited

The following file limits apply to each assistant and vary based on pricing plan:

Starter planStandard planEnterprise plan
Max file size10MB10MB10MB
Max PDF file size10MB100MB100MB
Max file storage1GB10GB10GB
Max files uploaded1010,00010,000