←All datasets
Directory · Datasets · Multimodal
MultimodalDataComp-1B
Rigorously-filtered 1.28B image-text pairs. Quality-filtered subset that outperforms raw LAION at equivalent scale.
Size
1.28B pairs
Format
parquet
License
CC-BY-4.0
Maintainer
University of Washington / LAION / HuggingFace
What it\u2019s for
Rigorously-filtered 1.28B image-text pairs. Quality-filtered subset that outperforms raw LAION at equivalent scale.