You can retrieve the context snippets that Pinecone Assistant uses to generate its responses. This data includes relevant chunks, relevancy scores, and references.

Use cases

Retrieving context snippets is useful for performing tasks like the following:

  • Understanding what relevant data snippets Pinecone Assistant is providing to the LLM for chat generation.
  • Using the retrieved snippets with your own LLM.
  • Using the retrieved snippets with your own RAG application or agentic workflow.

SDK support

The Pinecone Python SDK and Node.js SDK provide convenient programmatic access to retrieve context snippets.

To interact with Pinecone Assistant using the Python SDK, upgrade the client and install the pinecone-plugin-assistant package as follows:

HTTP
pip install --upgrade pinecone pinecone-plugin-assistant

Pricing

Context retrieval usage is measured in tokens, similar to Pinecone Assistant. See Pricing for up-to-date pricing information.

Pricing updates specific to context retrieval will be made as the feature becomes generally available.