Pinecone Assistant limits vary based on subscription plan.

Object limits

Object limits are restrictions on the number or size of assistant-related objects.
MetricStarter planStandard planEnterprise plan
Assistants per project5UnlimitedUnlimited
File storage per project1 GBUnlimitedUnlimited
Chat input tokens per project1,500,000UnlimitedUnlimited
Chat output tokens per project200,000UnlimitedUnlimited
Context retrieval tokens per project500,000UnlimitedUnlimited
Evaluation input tokens per projectNot available150,000500,000
Files per assistant1010,00010,000
File size (.docx, .json, .md, .txt)10 MB10 MB10 MB
File size (.pdf)10 MB100 MB100 MB
Metadata size per file16 KB16 KB16 KB
Additionally, the following limits apply to multimodal PDFs (currently in public preview):
MetricStarter planStandard planEnterprise plan
Max file size10 MB50 MB50 MB
Page limit100100100
Multimodal PDFs per assistant12020

Rate limits

Rate limits are restrictions on the frequency of requests within a specified period of time. Requests that exceed a rate limit fail and return a 429 - TOO_MANY_REQUESTS status.
To handle rate limits, implement retry logic with exponential backoff.
MetricStarter planStandard planEnterprise plan
Assistant list/get requests per minute40100500
Assistant create/update requests per minute2050100
Assistant delete requests per minute2050100
File list/get requests per minute1003006000
File upload requests per minute520300
File delete requests per minute520300
Chat input tokens per minute100,000300,0001,000,000
Chat history tokens per query64,00064,00064,000