SRE Consulting
I help engineering leaders stop firefighting and start designing for reliability. Through reliability targets, operational ownership, and measurable outcomes — not more tools.
Start with an Explorer →Sound Familiar?
New leadership. Years of accumulated issues nobody owns. Monitoring that exists but doesn't drive decisions. Your teams are aware something needs to change — but fixing everything at once isn't an option. You need a structured way to prioritise.
Most organisations overestimate their monitoring and underestimate their gaps. I map what's real — what's measured, what's missing, and who owns what — so you stop guessing.
Leadership wants “better reliability.” But what does that mean for each team? I help you define measurable targets that connect engineering work to what your customers actually experience.
Reliability that depends on heroics doesn't scale. Error budgets, ownership, and reviews become part of your operating rhythm — not an extra burden on top of delivery.
Everything I build is designed to be owned by your team. Documented processes, trained engineers, proven playbooks. My goal is to make myself unnecessary.
The Framework
Most organisations start at Level 1-2. The biggest ROI is in the jump to Level 3. Not everyone needs Level 5.
Self-Assessment
Most teams score Level 1-2 and don't realise it until a customer finds the problem first. Leave your email — I'll send you the self-assessment so you can see where you stand and what closing the gap looks like.
Done — check your inbox shortly.
How We Work Together
Every engagement starts with Explorer. If the findings don't warrant going further, you keep the assessment. No lock-in, no assumptions.
Find out where you stand. A structured reliability audit across your product domains — giving you the data to decide what to prioritise and where to invest.
From “we know the gaps” to “we have a plan and a working pilot.” I build the reliability strategy, present it to your leadership for alignment, then run workshops where your teams define their own targets — and implement monitoring for the pilot domain.
Hands-on support for teams rolling out reliability improvements across domains. I work alongside your engineers — building, coaching, and reviewing until the capability is theirs.
Proof of Work
“8 production alarms in ALARM state. Zero subscribers. The team learned about outages from customers.”
Trading platform, AWS EKS — Level 1 to Level 3 in one quarter. First issue caught by monitoring before a customer reported it.
About
Reliability problems are rarely technical problems. They're organizational problems wearing technical costumes. Teams don't need another monitoring tool. They need clear ownership, measurable targets, and the operational discipline to act on what the data tells them.
12+ years in DevOps and SRE — from enterprise DevOps frameworks to global Kubernetes platforms and observability architectures. I work alongside teams until the practices stick and the capability is theirs to own. Currently implementing reliability frameworks for trading platforms and data platforms on AWS.
Get Started
No pitch — just a useful discussion about your reliability challenges and whether working together makes sense.
Start a conversation →