page
My Resume
Senior freelance SRE with 10 years of experience across observability, Kubernetes, Proxmox/Ceph, automation, and AI-assisted SRE tooling.
Summary
Senior freelance SRE with 10 years of experience operating infrastructure that can’t quietly fail. I work across high-volume observability, Kubernetes platform engineering, on-prem Proxmox/Ceph clusters, and AI-assisted SRE tooling.
Open Source Contributions
🌟 SSHplex
Built and maintained an open source terminal UI for SSH connection multiplexing, designed for infrastructure teams that need fast host discovery, bulk operations, and persistent sessions.
- GitHub Repository: SSHPlex
- Blog Post: Building SSHplex
- Combines NetBox, Ansible, Consul, and static lists as sources of truth for hosts and devices
- Supports three mux backends: tmux standalone, tmux + iTerm2, and native iTerm2 on macOS
- Provides broadcast commands and persistent sessions to replace expensive legacy tooling
Experience
Kindred France | Site Reliability Engineer | 2021 - Present
- Progressed from System Engineer to Site Reliability Engineer, shifting focus from infrastructure automation toward platform reliability, observability, diagnostics and performance.
- Operate observability workflows around Kubernetes with Thanos, Loki, Grafana, and Vector as core technologies.
- Built a HouseKeeping tool to diagnose stale and broken Grafana resources, reducing dashboard/config drift and improving platform hygiene.
- Built a Search Query Exporter to diagnose query slowness and establish SLOs across Thanos and Loki.
- Designed an SLO Dashboard Framework to standardize service-level visibility and make reliability reporting easier to adopt across teams.
- Building Graphia, a domain-specific SRE agent for Grafana diagnosis - RBAC-aware behavior, MCP-based diagnosis flows, and safeguards for enterprise operations.
- Daily hands-on work with Helm charts, Argo CD, container image lifecycle, Jenkins, GitLab, AWS CloudWatch, and CUR2 cost analysis.
Current stack and ownership
| Area | Components/Tools |
|---|---|
| Observability | Grafana, Loki, Thanos, Vector, AWS CloudWatch |
| Platform Engineering | Kubernetes, Helm, Argo CD, Container Images |
| CI/CD & Automation | Jenkins, GitLab CI, Terraform, Ansible |
| Data & Storage | Kafka, Redis, PostgreSQL, Microsoft SQL, Couchbase |
| Programming & AI | Go, Python, Bash, AI, MCP |
Previous impact within the same company
- Led the automated deployment of VMs and applications through CI/CD, enabling multiple deployments per day.
- Used Terraform to deploy across 10 datacenters and 4 providers (OpenStack, Proxmox, vSphere, NetBox) from shared templates.
- Used Ansible for VM initialization and application deployment, with Consul feeding service pools for HAProxy and Prometheus.
- Operated multi-cluster observability at multi-TB/day ingestion across logs, metrics, and traces, with Kafka pipelines feeding SIEM, logging, EDR, APM, and uptime monitoring.
- Integrated a highly available Proxmox cluster across 4 racks and 2 datacenters with Ceph, including PXE-based automation and 25 Gb networking per host.
- Accountable for the French security scope, driving remediation work for vulnerabilities and production hardening.
VINC | System engineer | 2019 - 2021
- Architected the new platform with new BGP routers and firewalls.
- Managed Proxmox cluster across 2 datacenters.
- Responsible for SLA and client communication during production incidents.
- Implemented websites around client needs.
- Implemented a new DNS stack with high availability in mind.
Multi-Visp / Azuria | System administrator | 2017 - 2019
- Installed complete new racks in Telehouse2.
- Cable management between two rooms.
- Installed managed Wi-Fi equipment.
- Implemented high-availability multi-datacenter VPN services.
Contact Information
- Email: [email protected]