search
operation with the rerank
parameter, including the hosted reranking model you want to use, the number of reranked results to return, and the fields to use for reranking, if different than the main query.
For example, the following code searches for the 3 records most semantically related to a query text and uses the hosted bge-reranker-v2-m3
model to rerank the results and return only the 2 most relevant documents:
_score
represents the relevance of a document to the query, normalized between 0 and 1, with scores closer to 1 indicating higher relevance.
rerank
operation with the hosted reranking model you want to use, the query results and the query, the number of ranked results to return, the field to use for reranking, and any other model-specific parameters.
For example, the following code uses the hosted bge-reranker-v2-m3
model to rerank the values of the documents.chunk_text
fields based on their relevance to the query and return only the 2 most relevant documents, along with their score:
cohere-rerank-3.5
cohere-rerank-3.5
is Cohere’s leading reranking model, balancing performance and latency for a wide range of enterprise search applications.Detailscohere-rerank-3.5
model supports the following parameters:Parameter | Type | Required/Optional | Description | |
---|---|---|---|---|
max_chunks_per_doc | integer | Optional | Long documents will be automatically truncated to the specified number of chunks. Accepted range: 1 - 3072 . | |
rank_fields | array of strings | Optional | The fields to use for reranking. The model reranks based on the order of the fields specified (e.g., ["field1", "field2", "field3"] ). | ["text"] |
bge-reranker-v2-m3
bge-reranker-v2-m3
is a high-performance, multilingual reranking model that works well on messy data and short queries expected to return medium-length passages of text (1-2 paragraphs).Detailsbge-reranker-v2-m3
model supports the following parameters:Parameter | Type | Required/Optional | Description | Default |
---|---|---|---|---|
truncate | string | Optional | How to handle inputs longer than those supported by the model. Accepted values: END or NONE .END truncates the input sequence at the input token limit. NONE returns an error when the input exceeds the input token limit. | NONE |
rank_fields | array of strings | Optional | The field to use for reranking. The model supports only a single rerank field. | ["text"] |
pinecone-rerank-v0
pinecone-rerank-v0
is a state of the art reranking model that out-performs competitors on widely accepted benchmarks. It can handle chunks up to 512 tokens (1-2 paragraphs).Detailspinecone-rerank-v0
model supports the following parameters:Parameter | Type | Required/Optional | Description | Default |
---|---|---|---|---|
truncate | string | Optional | How to handle inputs longer than those supported by the model. Accepted values: END or NONE .END truncates the input sequence at the input token limit. NONE returns an error when the input exceeds the input token limit. | END |
rank_fields | array of strings | Optional | The field to use for reranking. The model supports only a single rerank field. | ["text"] |