Inference Index
All datasets
Directory · Datasets · Multimodal
Multimodal

DataComp-1B

Rigorously-filtered 1.28B image-text pairs. Quality-filtered subset that outperforms raw LAION at equivalent scale.

Size
1.28B pairs
Format
parquet
License
CC-BY-4.0
Maintainer
University of Washington / LAION / HuggingFace

What it\u2019s for

Rigorously-filtered 1.28B image-text pairs. Quality-filtered subset that outperforms raw LAION at equivalent scale.