Platform Engineer

Secure cloud platforms for production-grade agentic AI.

I build the secure, scalable platforms that production agentic AI systems run on. 20+ years across telecommunications and banking — currently building petabyte-scale cloud platforms and writing LangChain agents, MCP integrations, and AI-powered automation on top of them.

Melbourne, Australia 20+ years in tech GitHub LinkedIn

Recent posts

Platform EngineeringInfrastructure

Supply Chain Security: The Seven-Day Delay That Protects Your Production Systems

How I protect 15+ projects across Python, JavaScript, and Rust from supply chain attacks using a three-layer defence: registry-level delays, automated PR scheduling, and lockfile discipline.

1 Apr 2026 · 14 min read
Platform EngineeringAgentic AI

Why agentic SDLC needs a schema first approach

Convention-driven PRD-to-HLD-to-LLD pipelines and why schema-gen should be your first commit.

28 Mar 2026 · 8 min read
Agentic AIPlatform Engineering

Agentic Ops: Working Backwards from the Metric That Matters

Start from a single business SLA — data freshness under 60 seconds — and trace backwards through dependency trees, metadata layers, known-error memory, and automated fixes to build an AI-operated production platform.

15 Mar 2026 · 14 min read
Platform EngineeringInfrastructure

Why Gatus Is My Preferred Health Check Tool (And Why Uptime Monitoring Isn't Enough)

Uptime tools tell you a service is running. Gatus tells you the data pipeline is actually working. How I use 73 custom health checks to monitor infrastructure, data freshness, and pipeline completeness.

1 Mar 2026 · 8 min read
DatabricksPython

Benchmarking PySpark shuffle: what the metrics actually tell you

Building a benchmarking utility for shuffle and network transfer metrics in Databricks clusters.

20 Feb 2026 · 7 min read
Data EngineeringBig Data

Upgrading Large Hadoop Cluster

A detailed account of upgrading a large Telco Hadoop cluster from HDP 2.6.4 to 3.1.5, covering practice runs, planning strategies, and lessons from executing the upgrade during COVID remote work.

18 May 2020 · 6 min read
View all posts →

Archive

Writing since 2011 — 37 posts across 9 years.