2024 releases
October 2024
Features
-
Added the
chat_assistant
endpoint to the Assistant API. It can be used to chat with your assistant, and get responses and citations back in a structured form. -
You can now add instructions when creating or updating an assistant. Instructions are a short description or directive for the assistant to apply to all of its responses. For example, you can update the instructions to reflect the assistant’s role or purpose.
-
You can now update an existing assistant with new instructions or metadata.
-
You can now import data into an index from Amazon S3. This feature is in public preview.
September 2024
Features
-
When using Pinecone Assistant, you can now choose an LLM for the assistant to use and filter the assistant’s responses by metadata.
-
If you are upgrading from the Starter plan, you can now connect your Pinecone organization to the AWS Marketplace, GCP Marketplace, or Azure Marketplace for billing purposes.
-
Refreshed the navigation and overall visual interface of the Pinecone console.
-
Prometheus monitoring for serverless indexes is now in public preview.
APIs
-
Added the Evaluation API, which provides a way to evaluate the correctness and completeness of a response from a RAG system. This feature is in public preview.
-
When using Pinecone Assistant API, you can now choose an LLM for the assistant to use and filter the assistant’s responses by metadata.
SDKs
-
Released
v3.0.3
of the Pinecone Node.js SDK. This version removes extra logging and makes general internal enhancements. -
Released
v5.2.0
,v5.3.0
, andv5.3.1
of the Pinecone Python SDK.v5.2.0
adds support for reranking documents with Pinecone Inference; it is no longer necessary to install thepinecone-plugin-inference
package seperately. This feature is in public preview.v5.3.0
adds support for bulk import operations. This feature is in public preview.v5.3.1
adds a missingpython-dateutil
dependency.
-
Released
v1.1.1
of the Pinecone Go SDK. This version adds support for non-secure client connections. -
Released
v2.1.0
of the Pinecone Java SDK. This version adds support for non-secure client connections.
Documentation
-
Added the following integration pages:
-
Added Go examples for batch upserts, parallel upserts, and deleting all records for a parent document.
-
Added guidance on using the Node.js SDK with proxies.
August 2024
Features
-
Serverless indexes are now in general availability on GCP and Azure for Standard and Enterprise plans.
-
You can now deploy serverless indexes in the
europe-west1
(Netherlands) region of GCP. -
You can now back up and restore serverless indexes. This feature is in public preview.
-
Pinecone Assistant is now in public preview.
-
The Pinecone Inference API now supports reranking.
APIs
- The Pinecone Assistant API is now in public preview.
- The Inference API Rerank endpoint is now in public preview.
SDKs
-
Released
v1.0.0
of the Pinecone .NET SDK. For usage examples, see our guides or the GitHub README. -
Released
v1.1.0
of the Pinecone Go SDK. This version adds support for generating embeddings via Pinecone Inference. -
Release
v3.0.2
of the Pinecone Node.js SDK. This version removes a native Node utility function that was causing issues for users running inEdge
. There are no downstream affects of its removal; existing code should not be impacted. -
Released
v5.1.0
of the Pinecone Python SDK. With this version, the SDK can now be installed withpip install pinecone
/pip install "pinecone[grpc]"
. This version also includes ahas_index()
helper function to check if an index exists. -
Released
v0.1.0
andv0.1.1
of the Pinecone Rust SDK. The Rust SDK is in “alpha” and is under active development. The SDK should be considered unstable and should not be used in production. Before a 1.0 release, there are no guarantees of backward compatibility between minor versions. See the Rust SDK README for full installation instructions and usage examples.
Documentation
July 2024
Features
-
Serverless indexes are now in public preview on GCP and Azure for Standard and Enterprise plans.
-
Released version 1.1.0 of the official Spark connector for Pinecone. In this release, you can now set a source tag. Additionally, you can now upsert records with 40KB of metadata, increased from 5KB.
-
To ensure that your applications continue to work as expected as the platform evolves, the Database and Inference APIs are now versioned, and Pinecone SDK versions are now pinned to specific API versions. For more details, see API versioning.
-
Added the ability to prevent accidental index deletion.
APIs
-
Released version
2024-07
of the Database API and Inference API. This version includes the following highlights:-
The
create_index
andconfigure_index
endpoints now support thedeletion_protection
parameter. Setting this parameter to"enabled"
prevents an index from accidental deletion. For more details, see Prevent index deletion. -
The
describe_index
andlist_index
responses now include thedeletion_protection
field. This field indicates whether deletion protection is enabled for an index. -
The
spec.serverless.cloud
andspec.serverless.region
parameters ofcreate_index
now supportgcp
/us-central
andazure
/eastus2
as part of the serverless public preview on GCP and Azure.
-
SDKs
-
Released
v5.0.0
of the Pinecone Python SDK. This version depends on Pinecone API version2024-07
and includes the ability to prevent accidental index deletion. Additionally, thepinecone-plugin-inference
package required to generate embeddings with Pinecone Inference is now included by default; it is no longer necessary to install the plugin seperately. -
Released
v3.0.0
of the Pinecone Node.js SDK. This version depends on Pinecone API version2024-07
and includes the ability to prevent accidental index deletion. Additionally, this version supports generating embeddings via Pinecone Inference. -
Released
v2.0.0
of the Pinecone Java SDK. This version depends on Pinecone API version2024-07
and includes the ability to prevent accidental index deletion. Additionally, this version includes the following breaking changes:createServerlessIndex()
now requires a new argument:DeletionProtection.ENABLED
orDeletionProtection.DISABLED
.configureIndex()
has been renamedconfigurePodsIndex()
.
For more details, see the Java SDK v2.0.0 migration guide.
-
Released
v1.0.0
of the Pinecone Go SDK. This version depends on Pinecone API version2024-07
and includes the ability to prevent accidental index deletion. With this version, the Go SDK is officially supported by Pinecone.
Documentation
-
Updated the Build a RAG chatbot guide to use Pinecone Inference for generating embeddings.
-
Added an introduction to key concepts in Pinecone and how they relate to each other.
-
Added the following integration pages:
-
Added a list of possible HTTP status codes that the Pinecone API can return in responses.
June 2024
Features
- Added a model gallery with details and guidance on popular embedding and reranking models, including models hosted on Pinecone’s infrastructure.
- You can now load a sample dataset into a new project.
- Simplified the process for migrating paid pod indexes to serverless.
- The Assistant API is now in beta release.
- The Inference API is now in public preview.
- The Enterprise quota for namespaces per serverless index has increased from 50,000 to 100,000.
SDKs
-
Released version 1.2.2 of the Pinecone Java SDK. This release simplifies the proxy configuration process. It also fixes an issue where the user agent string was not correctly setup for gRPC calls. Now, if the source tag is set by the user, it is appended to the custom user agent string.
-
Released version 4.1.1 of the Pinecone Python SDK. In this release, you can now use colons inside soure tags. Additionally, the gRPC version of the Python SDK now allows retries of up to
MAX_MSG_SIZE
.
Documentation
-
Added the following integration pages:
-
Updated Python code samples to use the gRPC version of the Python SDK, which is more performant than the Python SDK that interacts with Pinecone via HTTP requests.
-
Added a new Legal semantic search sample app that demonstrates low-latency natural language search over a knowledge base of legal documents.
May 2024
Features
- You can now use the
ConnectPopup
function to bypass the Connect widget and open the “Connect to Pinecone” flow in a popup. This can be used in an app or website for a seamless Pinecone signup and login process. - Pinecone now supports AWS PrivateLink. Create and use Private Endpoints to connect AWS PrivateLink to Pinecone while keeping your VPC private from the public internet.
SDKs
- Released versions 1.1.0 and 1.2.0 of the Pinecone Java SDK.
- v1.1.0 added the ability to list record IDs with a common prefix.
- v1.2.0 added the ability to list all record IDs in a namespace.
- 1.2.1 fixed the error
Could Not Find NameResolverProvider
using uber jar.
- Released version 1.0.0 of the official Spark connector for Pinecone. In this release, you can now upsert records into serverless indexes.
- Released version 4.0.0 of the Pinecone Python SDK. In this release, we are upgrading the
protobuf
dependency in our optionalgrpc
extras from3.20.3
to4.25.3
. Significant performance improvements have been made with this update. This is a breaking change for users of the optional GRPC addon (installed withpinecone[grpc]
).
Documentation
- Added the following integration pages:
April 2024
Features
- The free Starter plan now includes 1 project, 5 serverless indexes in the
us-east-1
region of AWS, and up to 2 GB of storage. Although the Starter plan has stricter limits than other plans, you can upgrade whenever you’re ready. - You can now deploy serverless indexes in the
eu-west-1
region of AWS. - Pinecone now supports an official Terraform integration to let you manage your Pinecone database using your Terraform configuration.
- Pinecone now provides a Connect widget that can be embedded into an app, website, or Colab notebook for a seamless signup and login process.
SDKs
-
Released version 1.0.0 of the Pinecone Java SDK. With this version, the Java SDK is officially supported by Pinecone. For full details on the release, see the v1.0.0 release notes in GitHub. For usage examples, see our guides or the GitHub README. To migrate to v1.0.0 from version 0.8.x or below, see the Java v1.0.0 migration guide.
-
Released version 0.9.0 of the Canopy SDK. This version adds support for OctoAI LLM and embeddings, and Qdrant as a supported knowledge base. See the v0.9.0 release notes in GitHub for more details.
-
As announced in January 2024, control plane operations like
create_index
,describe_index
, andlist_indexes
now use a single global URL,https://api.pinecone.io
, regardless of the cloud environment where an index is hosted. This is now in general availability. As a result, the legacy version of the API, which required regional URLs for control plane operations, is deprecated as of April 15, 2024 and will be removed in a future, to be announced, release.
Documentation
- The docs now have a new AI chatbot. Use the search bar at the top of our docs to find related content across all of our resources.
- We’ve updated the look and feel of our example notebooks and sample apps. A new sample app, Namespace Notes, a simple multi-tenant RAG app that uploads documents, has also been added.
- Added the lifecycle policy of the Pinecone API, which describes the availability phases applicable to APIs, features, and SDK versions.
- Added the following integration page:
March 2024
Features
-
You can now configure single single-on to manage your teams’ access to Pinecone through any identity management solution with SAML 2.0 support, such as Okta.
This feature is available on the Enterprise plan only.
-
You can now deploy serverless indexes in the
us-east-1
region of AWS.
Improvements
- Fixed a bug that caused inaccurate index fullness reporting for some pod-based indexes on GCP.
Console
-
The Pinecone console has a new look and feel, with a brighter, minimalist design; reorganized menu items for quicker, more intuitive navigation; and easy access to recently viewed indexes in the sidebar.
-
When viewing the list of indexes in a project, you can now search indexes by index name; sort indexes alphabetically, by how recently they were viewed or created, or by status; and filter indexes by index type (serverless, pod-based, or starter).
SDKs
-
Released versions 3.2.0, 3.2.1, and 3.2.2 of the Pinecone Python SDK.
-
v3.2.0 adds four optional configuration properties that enable the use of Pinecone via proxy.
-
v3.2.1 adds an optional
source_tag
that you can set when constructing a Pinecone client to help Pinecone associate API activity to the specified source. See the v3.2.1 release notes in GitHub for more details. -
v3.2.2 fixes a minor issue introduced in v3.2.0 that resulted in a
DeprecationWarning
being incorrectly shown to users who are not passing in the deprecatedopenapi_config
property. This warning can safely be ignored by anyone who is not preparing to upgrade.
-
-
Released versions 2.1.0 and 2.2.0 of the Pinecone Node.js SDK.
-
v2.1.0 adds support for listing the IDs of records in a serverless index. You can list all records or just those with a common ID prefix. Listing by common ID prefix is especially useful as part of managing RAG documents.
-
v2.2.0 adds an optional
sourceTag
that you can set when constructing a Pinecone client to help Pinecone associate API activity to the specified source.
See the v2.1.0 and v2.2.0 release notes in GitHub for more details.
-
-
Released versions 0.4.0 and 0.4.1 of the Pinecone Go SDK.
-
v0.4.0 is a comprehensive re-write and adds support for all current Pinecone API operations.
-
v0.4.1 adds an optional
SourceTag
that you can set when constructing a Pinecone client to help Pinecone associate API activity to the specified source.
See the v0.4.0 and v0.4.1 release notes in GitHub for more details.
-
-
Released version 0.8.1 of the Canopy SDK. This version includes bug fixes, the removal of an unused field for Cohere chat calls, and added guidance on creating a knowledge base with a specified record encoder when using the core libary. See the v0.8.1 release notes in GitHub for more details.
Documentation
- Added an explanation of the Pinecone serverless architecture, including descriptions of the high-level components and explanations of the distinct paths for writes and reads.
- Added considerations for querying serverless indexes with metadata filters.
- Added a Troubleshooting section, which includes content on best practices, troubleshooting, and how to address common errors.
February 2024
Features
- It is now possible to convert a pod-based starter index to a serverless index. For organizations on the Starter plan, this requires upgrading to Standard or Enterprise; however, upgrading comes with $100 in serverless credits, which will cover the cost of a converted index for some time.
SDKs
- The latest version of Pinecone’s Python SDK (v3.1.0) adds support for listing the IDs of records in a serverless index. You can list all records or just those with a common ID prefix. Listing by common ID prefix is especially useful as part of managing RAG documents. See the Python SDK release notes in GitHub for more details.
- The latest version of the Canopy SDK (v0.8.0) adds support for Pydantic v2. For applications depending on Pydantic v1, this is a breaking change; review the Pydantic v1 to v2 migration guide and make the necessary changes before upgrading. See the Canopy SDK release notes in GitHub for more details.
Documentation
- Improved the docs for setting up billing through the AWS marketplace and GCP marketplace.
- Added a Llamaindex integration guide on building a RAG pipeline with LlamaIndex and Pinecone.
- Updated the Langchain integration guide to avoid a namespace collision issue.
January 2024
Features
The new Pinecone API gives you the same great vector database but with a drastically improved developer experience. The most significant improvements include:
-
Serverless indexes: With serverless indexes, you don’t configure or manage compute and storage resources. You just load your data and your indexes scale automatically based on usage. Likewise, you don’t pay for dedicated resources that may sometimes lay idle. Instead, the pricing model for serverless indexes is consumption-based: You pay only for the amount of data stored and operations performed, with no minimums.
-
Multi-region projects: Instead of choosing a cloud region for an entire project, you now choose a region for each index in a project. This makes it possible to consolidate related indexes in the same project, even when they are hosted in different regions.
-
Global URL for control plane operations: Control plane operations like
create_index
,describe_index
, andlist_indexes
now use a single global URL,https://api.pinecone.io
, regardless of the cloud environment where an index is hosted. This simplifies the experience compared to the legacy API, where each environment has a unique URL.
SDKs
- The latest versions of Pinecone’s Python SDK (v3.0.0) and Node.js SDK (v2.0.0) support the new API mentioned above. To use the new API, existing users must upgrade to the new client versions and adapt some code. For guidance, see the Python SDK v3 migration guide and Node.js SDK v2 migration guide.
- The latest version of the Canopy SDK (v0.6.0) adds support for the new API mentioned above as well as namespaces, LLMs that do not have function calling functionality for query generation, and more. See the release notes in GitHub for more details.
Documentation
- The Pinecone documentation is now versioned. The default “latest” version reflects the new Pinecone API mentioned above. The “legacy” version reflects the previous API, which requires regional URLs for control plane operations and does not support serverless indexes.
Was this page helpful?