Rate limits
Rate limits are restrictions on the frequency of requests within a specified period of time. Rate limits vary based on pricing plan and apply to serverless indexes only.Metric | Starter plan | Standard plan | Enterprise plan |
---|---|---|---|
Read units per month per project | 1,000,000 | Unlimited | Unlimited |
Write units per month per project | 2,000,000 | Unlimited | Unlimited |
Upsert size per second per namespace | 50 MB | 50 MB | 50 MB |
Query read units per second per index | 2,000 | 2,000 | 2,000 |
Update records per second per namespace | 100 | 100 | 100 |
Fetch requests per second per index | 100 | 100 | 100 |
List requests per second per index | 200 | 200 | 200 |
Describe index stats requests per second per index | 100 | 100 | 100 |
Delete records per second per namespace | 5,000 | 5,000 | 5,000 |
Delete records per second per index | 5,000 | 5,000 | 5,000 |
Embedding tokens per minute per model | Model-specific | Model-specific | Model-specific |
Embedding tokens per month per model | 5,000,000 | Unlimited | Unlimited |
Rerank requests per minute per model | Model-specific | Model-specific | Model-specific |
Rerank requests per month per model | 500 | Model-specific | Model-specific |
Read units per month per project
Starter plan | Standard plan | Enterprise plan |
---|---|---|
1,000,000 | Unlimited | Unlimited |
429 - TOO_MANY_REQUESTS
status with the following error:
- Open the Pinecone console.
- Select the project.
- Select any index in the project.
- Look under Starter Usage.
Write units per month per project
Starter plan | Standard plan | Enterprise plan |
---|---|---|
2,000,000 | Unlimited | Unlimited |
429 - TOO_MANY_REQUESTS
status with the following error:
- Open the Pinecone console.
- Select the project.
- Select any index in the project.
- Look under Starter Usage.
Upsert size per second per namespace
Starter plan | Standard plan | Enterprise plan |
---|---|---|
50 MB | 50 MB | 50 MB |
429 - TOO_MANY_REQUESTS
status with the following error:
Query read units per second per index
Starter plan | Standard plan | Enterprise plan |
---|---|---|
2,000 | 2,000 | 2,000 |
429 - TOO_MANY_REQUESTS
status with the following error:
Update records per second per namespace
Starter plan | Standard plan | Enterprise plan |
---|---|---|
100 | 100 | 100 |
429 - TOO_MANY_REQUESTS
status with the following error:
Fetch requests per second per index
Starter plan | Standard plan | Enterprise plan |
---|---|---|
100 | 100 | 100 |
429 - TOO_MANY_REQUESTS
status with the following error:
List requests per second per index
Starter plan | Standard plan | Enterprise plan |
---|---|---|
200 | 200 | 200 |
429 - TOO_MANY_REQUESTS
status with the following error:
Describe index stats requests per second per index
Starter plan | Standard plan | Enterprise plan |
---|---|---|
100 | 100 | 100 |
429 - TOO_MANY_REQUESTS
status with the following error:
Delete records per second per namespace
Starter plan | Standard plan | Enterprise plan |
---|---|---|
5000 | 5000 | 5000 |
429 - TOO_MANY_REQUESTS
status with the following error:
Delete records per second per index
Starter plan | Standard plan | Enterprise plan |
---|---|---|
5000 | 5000 | 5000 |
429 - TOO_MANY_REQUESTS
status with the following error:
Embedding tokens per minute per model
Embedding model | Input type | Starter plan | Standard plan | Enterprise plan |
---|---|---|---|---|
llama-text-embed-v2 | Passage | 250,000 | 1,000,000 | 1,000,000 |
Query | 50,000 | 250,000 | 250,000 | |
multilingual-e5-large | Passage | 250,000 | 1,000,000 | 1,000,000 |
Query | 50,000 | 250,000 | 250,000 | |
pinecone-sparse-english-v0 | Passage | 250,000 | 3,000,000 | 3,000,000 |
Query | 250,000 | 3,000,000 | 3,000,000 |
429 - TOO_MANY_REQUESTS
status with the following error:
Embedding tokens per month per model
Starter plan | Standard plan | Enterprise plan |
---|---|---|
5,000,000 | Unlimited | Unlimited |
429 - TOO_MANY_REQUESTS
status with the following error:
Rerank requests per minute per model
Reranking model | Starter plan | Standard plan | Enterprise plan |
---|---|---|---|
cohere-rerank-3.5 | Not available | 300 | 300 |
bge-reranker-v2-m3 | 60 | 60 | 60 |
pinecone-rerank-v0 | 60 | 60 | 60 |
429 - TOO_MANY_REQUESTS
status with the following error:
Rerank requests per month per model
Reranking model | Starter plan | Standard plan | Enterprise plan |
---|---|---|---|
cohere-rerank-3.5 | Not available | Unlimited | Unlimited |
bge-reranker-v2-m3 | 500 | Unlimited | Unlimited |
pinecone-rerank-v0 | 500 | Unlimited | Unlimited |
429 - TOO_MANY_REQUESTS
status with the following error:
Object limits
Object limits are restrictions on the number or size of objects in Pinecone. Object limits vary based on pricing plan.Metric | Starter plan | Standard plan | Enterprise plan |
---|---|---|---|
Projects per organization | 1 | 20 | 100 |
Serverless indexes per project 1 | 5 | 20 | 200 |
Serverless index storage per project | 2 GB | N/A | N/A |
Namespaces per serverless index | 100 | 100,000 | 100,000 |
Serverless backups per project | N/A | 500 | 1000 |
Namespaces per serverless backup | N/A | 2000 | 2000 |
Collections per project | 100 | N/A | N/A |
us-east-1
region of AWS.
Projects per organization
Starter plan | Standard plan | Enterprise plan |
---|---|---|
1 | 20 | 100 |
403 - QUOTA_EXCEEDED
status with the following error:
Serverless indexes per project
Starter plan | Standard plan | Enterprise plan |
---|---|---|
5 | 20 | 200 |
403 - QUOTA_EXCEEDED
status with the following error:
Serverless index storage per project
This limit applies to organizations on the Starter plan only.
Starter plan | Standard plan | Enterprise plan |
---|---|---|
2 GB | N/A | N/A |
403 - QUOTA_EXCEEDED
status with the following error:
Namespaces per serverless index
Starter plan | Standard plan | Enterprise plan |
---|---|---|
100 | 100,000 | 100,000 |
403 - QUOTA_EXCEEDED
status with the following error:
These quotas are intended to provide reasonable boundaries and prevent unexpected or unintentional misuse. To increase your quota beyond the standard allotment, contact Support.
Serverless backups per project
Starter plan | Standard plan | Enterprise plan |
---|---|---|
N/A | 500 | 1000 |
403 - QUOTA_EXCEEDED
status with the following error:
Namespaces per serverless backup
Starter plan | Standard plan | Enterprise plan |
---|---|---|
N/A | 2000 | 2000 |
403 - QUOTA_EXCEEDED
status.
Collections per project
Starter plan | Standard plan | Enterprise plan |
---|---|---|
100 | N/A | N/A |
403 - QUOTA_EXCEEDED
status with the following error:
Operation limits
Operation limits are restrictions on the size, number, or other characteristics of operations in Pinecone. Operation limits are fixed and do not vary based on pricing plan.Upsert limits
Metric | Limit |
---|---|
Max batch size | 2 MB or 1000 records with vectors 96 records with text |
Max metadata size per record | 40 KB |
Max length for a record ID | 512 characters |
Max dimensionality for dense vectors | 20,000 |
Max non-zero values for sparse vectors | 2048 |
Max dimensionality for sparse vectors | 4.2 billion |
Import limits
Metric | Limit |
---|---|
Max size per import request | 2 TB or 200,000,000 records |
Max namespaces per import request | 10,000 |
Max files per import request | 100,000 |
Max size per file | 10 GB |
Query limits
Metric | Limit |
---|---|
Max top_k value | 10,000 |
Max result size | 4MB |
If a query fails due to exceeding the 4MB result size limit, choose a lower
top_k
value, or use include_metadata=False
or include_values=False
to exclude metadata or values from the result.Fetch limits
Limit | |
---|---|
Max records per fetch request | 1,000 |
Delete limits
Delete | Limit |
---|---|
Max records per delete request | 1,000 |
Identifier limits
An identifier is a string of characters (up to 255 characters in length) used to identify “named” objects in Pinecone. The following Pinecone objects use strings as identifiers:Object | Field | Max # characters | Allowed characters |
---|---|---|---|
Organization | name | 512 | UTF-8 except \0 |
Project | name | 512 | UTF-8 except \0 |
Index | name | 45 | A-Z , a-z , 0-9 , and - |
Namespace | namespace | 512 | ASCII except \0 |
Record | id | 512 | ASCII except \0 |