whoami
Yiming /i' miŋ/ Peng
I build data platforms that power decisions at scale.
Senior Data Engineer with a PhD in Machine Learning and 8+ years building scalable DataOps/MLOps platforms. Currently at Wētā FX, contributing to the operational data infrastructure behind world-class visual effects. Apache Airflow contributor.
$ ls ./featured-projects
DataOps Platform @ Wētā FX
Production on-premises data platform powering BI and ML workflows across Wētā FX. Built and maintains 25+ ETL pipelines ingesting from disparate sources — solving the persistent problem of unreliable, undocumented data hand-offs between departments.
Impact: Eliminated recurring pipeline failures; stakeholder teams now rely on it as production-critical infrastructure.
Apache Airflow — OSS Contributions
Contributor to Apache Airflow — the de facto standard for data pipeline orchestration used by thousands of organizations worldwide. Contributions focus on stability, usability, and operator improvements drawn from real production experience.
Links: GitHub → [PR links — add when available]
Data Quality & Observability System
Designed and built a platform-wide data quality framework from the ground up using Great Expectations — moving the team from reactive fire-fighting to proactive anomaly detection. Integrated into the CI/CD pipeline so quality checks run automatically on every deploy.
Impact: Dramatically reduced data incidents; stakeholders gained confidence to act on data without manual verification.
Kubernetes MLOps Platform
Co-designed and built a Kubernetes-based MLOps platform at Chorus NZ to close the gap between data science experimentation and production deployment. Enabled model training, versioning, and serving pipelines within a unified, reproducible infrastructure.
Impact: First production ML infrastructure at the organisation — made model deployment a routine operation rather than a heroic effort.
$ cat experience.log
Certifications
$ ls ./writing
$ Articles on data engineering, platform design, and lessons from production systems. Coming soon — will be published from Obsidian.