Inference
Build end-to-end faster with models hosted by Pinecone.
Request a modelllama-text-embed-v2
NVIDIATask | Embedding |
Modality | Text |
Max Input Tokens | 2048 |
Price | $0.16 / million tokens |
multilingual-e5-large
MICROSOFTTask | Embedding |
Modality | Text |
Max Input Tokens | 507 |
Price | $0.08 / million tokens |
cohere-rerank-3.5
COHERETask | Rerank |
Modality | Text |
Max Input Tokens | 4096 |
Price | $2.00 / 1k requests |

pinecone-sparse-english-v0
PINECONETask | Embedding |
Modality | Text |
Max Input Tokens | 512 |
Price | $0.08 / million tokens |

bge-reranker-v2-m3
BAAITask | Rerank |
Modality | Text |
Max Input Tokens | 1024 |
Price | $2.00 / 1k requests |