This page provides guidance on managing the cost of Pinecone. For the latest pricing details, see our pricing page. For help estimating total cost, see Understanding total cost. To see a calculation of your current usage and costs, see the usage dashboard in the Pinecone console.

Choose the right index

  • Serverless: With serverless indexes, you don’t configure or manage any compute or storage resources. Instead, based on a breakthough architecture, serverless indexes scale automatically based on usage, and you pay only for the amount of data stored and operations performed, with no minimums. This means that there’s no extra cost for having additional indexes.
  • Pod-based: Pod sizes are designed for different applications, and some are more expensive than others. Choose the appropriate pod type and size, so you pay for the resources you need. For example, the s1 pod type provides large storage capacity and lower overall costs with slightly higher query latencies than p1 pods. By switching to a different pod type, you may be able to reduce costs while still getting the performance your application needs.

List by ID prefix

list is supported only for serverless indexes.

By using a hierarchical ID schema, you can retrieve records without performing a query. To do so, you can use list to retrieve records by ID prefix, then use fetch to retrieve the records you need. This can reduce costs, because query consumes more RUs when scanning a larger namespace, while fetch consumes a fixed ratio of RUs to records retrieved. Similarly, you can use list and then delete by ID prefix. To learn more, see Manage RAG documents.

Back up inactive indexes

Serverless indexes do not support collections at this time.

When a specific index is not in use, back it up using collections and delete the inactive index. When you’re ready to use these vectors again, you can create a new index from the collection. This new index can also use a different index type or size. Because it’s relatively cheap to store collections, you can reduce costs by only running an index when it’s in use.

Use namespaces for multitenancy

If your application requires you to isolate the data of each customer/user, consider implementing multitenancy with serverless indexes and namespaces. With serverless indexes, you pay only for the amount of data stored and operations performed. For queries in particular, the cost is partly based on the total number of records that must be scanned, so using namespaces can significantly reduce query costs.

Commit to annual spend

Users who commit to an annual contract may qualify for discounted rates. To learn more, contact Pinecone sales.

Talk to support

Users on the Standard and Enterprise plans can contact support for help in optimizing costs.

See also