Vector Database for AI - Serverless infrastructure for model development and production workloads.
Data infrastructure optimized for LLM and deep-learning applications
Vector Storage and Search
Store and search vectors and their metadata on serverless and scalable infrastructure. Use our hosted solution or deploy on your infrastructure.Learn more
Data Querying
Query your datasets to generate insights and uncover opportunities for improvement. All queries are saved and versioned, and your journey from raw data to trainable datasets is blazing.Query docs
Data Streaming
Whether you like PyTorch or TensorFlow, we got you covered. Connect your datasets to common ML frameworks with minimal code, and stream data while training without sacraficing GPU utilization.Learn more
Data Version Control
Modify dataset elements across different versions and seamlessly switch between them. Our intuitive Python API works with datasets of any size and overcomes the limitations of file-based version control.Learn more
Data Visualization
Maximize dataset quality by visualizing datasets of any size from the browser, without having to download data locally.Coco example
Store Data Anywhere
Store data locally, on Google Cloud, MinIO, AWS S3, Azure as well as Activeloop storage (no servers required). Directly stream datasets from long-term storage to ML workflows. It's that fast.Learn more
Team Collaboration
Keep your datasets private, share them with your organization, or with anyone on the web. We handle all the user-access management.
Transformations at Scale
Rapidly modify or process your data in order to find the optimal dataset for your models. Scale to hundreds of machines with one line of code.Learn more
- 16
coco-train
Created on: 9/3/2022
- 15
imagenet-train
Created on: 6/22/2022
- 15
cifar100-test
Created on: 12/11/2021
- 6
nih-chest-xray-train
Created on: 2/13/2022
- 6
mnist-train
Created on: 1/17/2022
- 5
wiki-art
Created on: 1/14/2022
- 5
ffhq
Created on: 6/12/2022
- 4
lsp-train
Created on: 3/17/2022
- 3
timit-train
Created on: 1/8/2022
- 3
nsynth-train
Created on: 2/6/2022
- 3
mura-train
Created on: 1/2/2022
- 3
lowlight-train
Created on: 2/8/2022
- 3
imagenet-test
Created on: 6/22/2022
- 3
gtzan-genre
Created on: 12/14/2021
- 2
timit-test
Created on: 1/8/2022
Public datasets1462
- 16
coco-train
Created on: 9/3/2022
- 15
imagenet-train
Created on: 6/22/2022
- 15
cifar100-test
Created on: 12/11/2021
- 6
SunLake
Created on: 7/8/2023
- 6
nih-chest-xray-train
Created on: 2/13/2022
- 6
mnist-train
Created on: 1/17/2022
- 5
wiki-art
Created on: 1/14/2022
- 5
ffhq
Created on: 6/12/2022
- 4
lsp-train
Created on: 3/17/2022
- 3
timit-train
Created on: 1/8/2022
- 3
nsynth-train
Created on: 2/6/2022
- 3
mura-train
Created on: 1/2/2022
- 3
lowlight-train
Created on: 2/8/2022
- 3
imagenet-test
Created on: 6/22/2022
- 3
gtzan-genre
Created on: 12/14/2021