Use the Inference API to generate vector embeddings and rerank results using embedding models and reranking models hosted on Pinecone’s infrastructure.You can use the Inference API as a standalone service, via the embed and rerank endpoints, or integrated with Pinecone’s database operations, via the create_for_model, upsert_records, and search_records endpoints.The following Pinecone SDKs support using the Inference API: