Skip to content

juanmoisesd/latamdata-py

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

57 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

latamdata-py: Python Package for Latin American Research Datasets

DOI ORCID License: CC0 GitHub release CI codecov Docs Open In Colab

Part of the Open Research Collection — 1,273+ datasets · Author: Juan Moisés de la Serna Tuya · UNIR


Overview

latamdata-py is a Python package that provides one-line access to 38+ open research datasets from Latin America, covering neuroscience, mental health, epidemiology, and social determinants of health.

All datasets are free, open, and available under CC0 or CC-BY-4.0 licenses.


Installation

pip install latamdata-py

Requirements: Python 3.8+ · pandas · requests


Quick Start

import latamdata as ld

# List all available datasets
ld.list_datasets()

# Load a dataset as a pandas DataFrame
df = ld.load("alzheimer-latam")
print(df.head())

# Get metadata for a specific dataset
ld.info("mental-health-specialists")

Available Datasets

Dataset ID Title Topic DOI
alzheimer-latam Alzheimer's Disease in Latin America (2000–2050) Neuroscience 10.7910/DVN/UVHABW
mental-health-specialists Mental Health Specialists in Ibero-America (2000–2026) Mental Health 10.5281/zenodo.18984813
generative-ai-research Generative AI in Academic Research (2022–2025) AI & Education
psychological-wellbeing-latam Psychological Wellbeing of Researchers in Latin America Mental Health
global-mental-health Global Mental Health Statistics Epidemiology
ai-educational-impact-latam AI Educational Impact in Latin America AI & Education
mental-health-lockdown Mental Health and Lockdown Dataset Mental Health
adhd-latam ADHD in Latin America — GBD 1990–2026 Neuroscience

Use ld.list_datasets() to see the full list of 38+ datasets at runtime.


API Reference

latamdata.load(dataset_name, **kwargs)

Load a dataset as a pandas DataFrame.

df = ld.load("alzheimer-latam")
Parameter Type Description
dataset_name str Dataset identifier (see table above)
**kwargs Extra arguments passed to pandas.read_csv()

latamdata.list_datasets()

Returns a DataFrame with all available datasets (name, title, DOI, keywords).

latamdata.info(dataset_name)

Returns a dict with metadata for a single dataset (title, URL, DOI, description, keywords).


Citation

If you use latamdata-py in your research, please cite:

@software{serna_tuya_2026_latamdata,
  author    = {Serna Tuya, Juan Moisés},
  title     = {latamdata-py: One-line access to 38 open research datasets from Latin America},
  year      = {2026},
  version   = {1.0.0},
  doi       = {10.5281/zenodo.19145316},
  url       = {https://github.com/juanmoisesd/latamdata-py}
}

License

Code: MIT · Data: CC0 1.0 Universal


Juan Moisés de la Serna Tuya · UNIR · Harvard Dataverse

About

Python package: one-line access to 38 open research datasets from Latin America. pip install latamdata-py

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Packages

 
 
 

Contributors