WordCamp Asia 2026 — Mahangu Weerasinghe
SLIDES

Quick Start
Prerequisites: uv, git, make. Windows users: use WSL. See the GitHub repo for more details.
git clone https://github.com/mahangu/meltano-wordpress-org-data-starter-project.gitcd meltano-wordpress-org-data-starter-projectmake quickstart
The Problem
WordPress.org API theme and plugin data is useful, but using it directly means writing a lot of code.
The SOLUTION: THe Open Source Data Stack
Four open-source tools, all running locally:
- Meltano — Extract, Transform, and Load (ETL) data. Uses our custom extractor tap-wordpress-org and the target-duckdb loader.
- DuckDB — Store and query large datasets fast, locally.
- Jupyter — Run everything in a local notebook.
- Python — Powers it all.
Available Commands
make help Show available targetsmake quickstart Install, create sample data, extract events, launch notebookmake extract-plugins Extract WordPress plugins datamake extract-events Extract WordPress events datamake extract-themes Extract WordPress themes datamake extract-all Extract all available data streamsmake sample-data Create sample data from WordPress.org APImake notebook Start Jupyter notebookmake check-data Check what data is in the database
Extra Credit
LLMs can help with all of these:
- Extract more data:
make extract-all - Save to CSV or other formats
- Create charts with matplotlib, seaborn, or plotly
- Build interactive dashboards with Streamlit
- Keep data fresh with Meltano schedules and Apache Airflow