For general-purpose text retrieval, we recommend full-text search: it runs over your raw
text fields with BM25 ranking and Lucene query syntax.pinecone-sparse-english-v0 is the right choice for workflows that need a learned sparse-vector representation — for example, when your application already produces sparse vectors upstream of Pinecone, or when you’re pairing it with a dense encoder in a single-index hybrid workflow on the vector API.You can call the embed operation through Pinecone Inference to turn text into vectors without writing to an index. That differs from upsert_records on an index with integrated embedding, where each request embeds and stores records in one step. To see how embedding consumption appears in billing and usage reports, see Embedding tokens.input_type as either query or passage. When creating an index with integrated embedding, input_type defaults to query for reads and passage for writes. Optionally, you can:- Return the string tokens using
"return_tokens": true. - Raise the max input tokens limit from the default of
512to the maximum of2048using"max_tokens_per_sequence": 2048. - Return an error when the input exceeds
max_tokens_per_sequenceusing"truncate": "NONE".