Integrate with Amazon S3
This feature is in public preview and available only on Standard and Enterprise plans.
To connect your Pinecone project to an Amazon S3 bucket, you need to create a storage integration. This integration allows Pinecone to access the data in your S3 bucket.
Once your integration is set up, you can use it to import data from your S3 bucket into a Pinecone index.
Before you begin
Ensure you have the following:
- A Pinecone account.
- An Amazon S3 bucket.
Create an IAM policy
In the AWS IAM console:
- In the navigation pane, click Policies.
- Click Create policy.
- In Select a service section, select S3.
- Select the following actions to allow:
ListBucket
GetObject
- In the Resources section, select Specific.
- For the bucket, specify the ARN of the bucket you created. For example:
arn:aws:s3:::example-bucket-name
- For the object, specify an object ARN as the target resource. For example:
arn:aws:s3:::example-bucket-name/*
- Click Next.
- Specify the name of your policy. For example: “Pinecone-S3-Access”.
- Click Create policy.
Set up access using an IAM role
In the AWS IAM console:
-
In the navigation pane, click Roles.
-
Click Create role.
-
In the Trusted entity type section, select AWS account.
-
Select Another AWS account.
-
Enter the Pinecone AWS VPC account ID:
713131977538
-
Click Next.
-
Select the policy you created.
-
Click Next.
-
Specify the role name. For example: “Pinecone”.
-
Click Create role.
-
Click the role you created.
-
On the Summary page for the role, find the ARN.
For example:
arn:aws:iam::123456789012:role/PineconeAccess
-
Copy the ARN.
You need the ARN to add the integration in Pinecone.
Add the integration in Pinecone
To add a storage integration through the Pinecone console, take the following steps:
- Select your project.
- Go to Storage integrations.
- Click Add integration.
- Enter a unique integration name.
- Select Amazon S3.
- Enter the ARN of the IAM role you created.
- Click Add integration.
Use the integration
Use the storage integration to import data from your S3 bucket into a Pinecone index. To use the integration, specify the integration ID, found on the Storage integrations page, in the import request.
Was this page helpful?