Independent consultant helping engineering organizations modernize deployment infrastructure, eliminate operational toil, and build reliability practices that hold up at scale.
8 years of experience, including 5 years operating large-scale Kubernetes infrastructure on AWS. Work spans the full platform lifecycle — from greenfield GitOps buildouts and observability deployments to on-call process design and IaC library development. Engagements typically serve organizations of 50–200+ engineers operating Kubernetes-based microservice environments on AWS.
Engagements across the full platform lifecycle — technical architecture and the operational processes built around it.
Greenfield or modernization. Argo, Helm, Kustomize. Gated repo architectures, environment state management, automated validation, and developer self-service workflows.
Prometheus, Grafana, Elastic. Standardized metrics, dashboards, and logging patterns across multi-team environments — replacing fragmented monitoring with maintainable, org-wide standards.
Module design with least-privilege patterns, automated testing, and CI/CD-driven deployments. Enterprise-grade module libraries consumed across large multi-team portfolios.
Versioned, modular pipeline libraries (GitHub Actions, Bash, Python, Golang) covering the full delivery lifecycle: build, security scanning, drift detection, change management, release notes.
On-call structure, incident response workflows, severity definitions, postmortem processes, and cross-timezone communication standards — designed to be durable and adopted, not just documented.
Tooling, intake processes, and workflows that help engineering ideas move from prototype to org-wide adoption, with developer-facing diagnostics and self-serve deployment visibility.
EKS cluster management, Istio service mesh, Kong ingress, and AWS Controller for Kubernetes (ACK), across multi-cluster, multi-account environments.
Replacing brittle, opaque deployment systems with auditable, maintainable platforms — with migration planning that preserves continuity for existing workloads.
Embedded analysis of application service flows to identify critical execution paths and instrumentation gaps. Writes Go instrumentation code to add missing Prometheus metrics and enable alerting on previously opaque paths.
A sample of recent work. Full engagement history available on request or in the resume.
Multi-year embedded SRE engagement supporting Kubernetes-based microservice application suites in Java and Golang. Direct ownership of one application platform spanning 3 AWS accounts and 6 EKS clusters; broader portfolio contribution across ~10 applications, 30 AWS accounts, and 50 clusters, in an engineering organization of 100+ developers.
Systems engineering engagement supporting GDIT's Global Shared Services Program for the U.S. Department of State.
Available for consulting engagements focused on platform reliability, Kubernetes operations, and developer-facing infrastructure. Reach out to discuss scope.