PUBLIC DATA CATALOGUE

Curated datasets for every domain.

Hand-organised list of publicly available datasets across research domains. Browse by domain hierarchy or search by name / keyword.

Public

AERONET AOD

Climate & Environment › Renewable Energy › Solar PV › Soiling Loss

Ground-based aerosol optical depth measurements globally.

NASA AERONET· —CSV
Public on registration

DHS Nepal 2022

Health Sciences › Public Health

Demographic and Health Surveys data for Nepal 2022 round.

USAID/DHS Program· 1.2 GBSPSSStataCSV
CC-BY-NC

TON_IoT Telemetry + Attacks

Computer Science › IoT

Telemetry data from IoT/IIoT sensors with concurrent network-layer attack labels.

UNSW Canberra· 20 GBCSVPCAP
CC-BY

ERA5 Reanalysis

Climate & Environment › Hydrology

Global reanalysis dataset (1940–present) with hourly atmospheric variables.

ECMWF Copernicus· 100+ TBNetCDFGRIB
CC-BY

WHO Global Health Observatory

Health Sciences › Public Health

Indicator data for global, regional and country health statistics.

WHO· —CSVJSON
Public

OpenWebText2 (Nepali subset)

Computer Science › AI / ML › NLP for Low-Resource Languages › Nepali NLP

Curated Nepali web text corpus suitable for pre-training and instruction tuning.

EleutherAI· 12 GBJSONL

Need a dataset we haven't listed?

Tell us what you're looking for — we'll add curated entries weekly.

Request a dataset

Made with Emergent