00/Washington, DC · building the cloud at home

Hi, I'm Kian.
A systems architect, building the cloud at home.
Slightly caffeinated.

When I'm not pushing buttons, I'm trying to not set off my smoke alarm, burying my nose in a fantasy novel, and performing my duties as a cat dad. What's left of my time goes toward endlessly tinkering with my homelab when I get that "is Plex down?" text message.

Currently working on
  • Running a multi-AZ homelab on bare-metal Talos
  • Scaling a design system across 50+ apps on legacy and modern stacks
  • AI tooling foundations for a 900-engineer org
  • Wiring local LLMs into the homelab's monitoring
8+ years shipping production-grade software
7 years in fintech
$12M revenue running through systems I've shipped
16 books read in 2025
1 cat
0 days since the last 'is Plex down?' text
01/Selected work

Three systems that had to hold.

Seven years at Mastercard
Data · Streaming — 01

A pipeline that doesn't flinch under burst load.

An event-driven pipeline that handles 20M+ transactions an hour on finite hardware. Autoscaling kicks in before the bursts land. Pre-warmed caches keep aggregations inside the 1-hour SLA.

20M+transactions per hour
1 hrhard SLA under burst load
KafkaSparkHadoopSpring BootPostgreSQL
Design System · Platform — 02

Breaking silos 20 years in the making.

Rolled out a shared design system to 50+ apps across 30+ teams, on a legacy financial platform that's been around longer than most of us. 7 apps migrated in 6 months, 28 more in flight. Developer velocity up 25%, DX satisfaction up 50%.

900+engineers on one system
+50%DX satisfaction lift
AngularTypeScriptStorybookFigmaGitOps
Architecture · Rewrite — 03

Rewrote the auth nobody understood.

A SAML-heavy auth backbone no one on the team fully understood. We rebuilt it piece by piece, without downtime. RAG surfaced what the old code actually did, and agent skills wrote the docs as we went. Context coverage went from under 20% to 85%. Onboarding dropped from 4 weeks to 1.

85%context coverage
1 weeknew-hire onboarding
RAGAgent skillsJava 17Spring Boot 6GitHub wiki
02/The workshop

What enterprise instincts look like after hours.

Self-hosted, self-financed
Infrastructure · Kubernetes

Running a production-grade cloud in the basement.

Multi-AZ Talos Kubernetes on Proxmox, declaratively managed with Ansible + ArgoCD. Same GitOps I use at work. The difference: when the power goes out, I'm the one with the flashlight.

TalosKubernetesProxmoxArgoCDAnsible
Security · Networking

A home network that trusts nothing by default.

Deny-by-default, allowlist-first. Six VLANs, VPN-only remote access, DMZ for anything that touches the public internet.

VLANsVPNDMZ
Storage · Durability

Multi-AZ NAS that follows the 3-2-1 rule.

Two TrueNAS nodes running ZFS, with Cloud Sync + RSync replicating across availability zones. The cat videos are not allowed to disappear.

TrueNASZFSCloud SyncRSync
Agentic Ops

Teaching a local LLM to run the ops rotation.

A local LLM watches my monitoring stack. It triages new alerts and tunes thresholds when the signal drifts. Ansible still runs Prometheus and Grafana. The LLM gets the 3 AM pages now.

Local LLMAnsiblePrometheusGrafana
03/The lab

The lab everything else on this page actually runs on.

Declarative from the metal up

Three Talos clusters, all managed from Git. ArgoCD does the syncing, LGTM does the watching. This site runs on it, so if the page loaded, the lab's alive.

TalosProxmoxKubernetesArgoCDAnsibleEnvoy GatewayCloudNativePGMetalLBGrafanaGitHub Actions