Loading...
Back to Home
resources · playbooks · free

Engineering playbooks

Battle-tested checklists, runbooks, and templates we use on every project. Free, no fluff, no consultant-speak — drop your email and download.

01
downloads

What you can grab today

Cost OptimizationInteractive · 5 min

Cloud Cost TCO Calculator

Project your 3-year cloud total cost of ownership with RI coverage and idle waste assumptions. See where the real money leaks.

Open calculator →
KubernetesInteractive · 5 min

Kubernetes Rightsizing Calculator

Estimate monthly and annual savings from rightsizing your cluster toward a 65% utilization sweet spot. Bring numbers to the next infra review.

Open calculator →
Engineering LeadershipInteractive · 5 min

DevOps ROI Calculator

Quantify engineer hours lost to deploys and incidents, then compare against an automation investment. Payback in months, not marketing slides.

Open calculator →
SustainabilityInteractive · 5 min

Cloud Carbon Footprint Estimator

Estimate annual tonnes of CO2e from your compute spend by region. See what moving workloads to the greenest region would save.

Open calculator →
AI ComplianceInteractive · 20 min

VibeCheck AI — Compliance Scorecard

20-question scorecard covering ISO 42001, EU AI Act, and NIST AI RMF. Scores your AI governance across 6 domains and shows the top gaps to fix first.

Start scorecard →
DevOps MaturityInteractive · 15 min

DevOps Maturity Self-Assessment

Answer 15 questions across CI/CD, IaC, observability, security, and reliability — get a scored report with concrete next steps.

Take assessment →
ISO 42001PDF · 26 pages

ISO 42001 Implementation Runbook

Gap analysis, controls mapping, and technical implementation guide for ISO/IEC 42001 (AI Management System). Includes Terraform skeleton and CI/CD templates for model governance.

EU AI ActPDF · 18 pages

EU AI Act Technical Readiness Checklist

Engineering-focused readiness for EU AI Act (enforcement Aug 2026): risk classification, data governance, logging, human oversight, and post-market monitoring — with code patterns, not legalese.

AI SecurityPDF · 20 pages

LLM Security & Red-Team Checklist

OWASP LLM Top 10 translated into concrete tests: prompt injection, training data poisoning, insecure plugins, excessive agency. Includes fuzzing scripts and red-team scenarios.

AI GovernanceNotion Template

AI Model Cards & Data Provenance

Notion template for documenting models per ISO 42001 and EU AI Act transparency requirements. Intended use, training data, bias evaluation, human oversight — all in one place.

AI GovernanceNotion Template

AI Coding Assistant Governance Template

Enterprise policy for Cursor, Copilot, and Claude Code. SBOM, attestation, secrets handling, and IP exposure covered.

AI InfrastructurePDF · 24 pages

MCP Server Implementation Guide

Model Context Protocol server architecture, auth, observability, and common pitfalls. Build production-grade MCP integrations.

AI InfrastructurePDF · 22 pages

LLM Gateway Architecture Blueprint

Portkey, Langfuse, and Helicone patterns for rate limiting, routing, cost control, and fallback chains across multiple providers.

AI InfrastructurePDF · 28 pages

Agentic Workflow Orchestration Playbook

Temporal plus LangGraph reliability patterns. Retries, compensation, and durable state for long-running agent workflows.

Supply ChainPDF · 20 pages

Supply Chain Security Starter (SLSA Level 3)

Container signing with Cosign, SBOM generation with Syft, in-toto attestation, and Sigstore integration. Reach SLSA Level 3 in weeks.

Container SecurityPDF · 14 pages

Container Image Hardening Checklist

CIS benchmark applied in practice. Distroless base images, non-root users, capability dropping, and read-only filesystems.

Network SecurityPDF · 24 pages

Zero Trust Network Blueprint

Practical implementation, not a slide deck. Service mesh, mTLS, and identity-aware proxy wired together for real zero trust.

SecurityPDF · 18 pages

Secrets Rotation Playbook

Vault, AWS Secrets Manager, and GCP Secret Manager rotation schedules with automation scripts. No more 3am rotations.

SOC 2PDF · 12 pages

SOC 2 Type II Readiness Audit

Self-assessment for engineering teams preparing for their first SOC 2 audit. Maps controls to evidence and concrete tooling.

Platform EngineeringGitHub Template

Golden Path Template Repo

A complete monorepo with CI/CD, Terraform, Helm, and observability preconfigured. Fork it, rename it, ship in five minutes.

Platform EngineeringPDF · 26 pages

Backstage Implementation Guide

Internal developer portal setup, plugin ecosystem tour, and the anti-patterns that sink most Backstage rollouts.

Developer ExperienceGitHub Template

DevContainers Starter Kit

Reproducible local dev environments with VS Code devcontainers and Docker Compose. Onboard new engineers in minutes, not days.

TerraformZIP · GitHub repo

Production Terraform Module Library

Battle-tested Terraform modules for VPC, EKS, RDS, IAM, and VPN. Opinionated defaults, multi-env structure, and tfvars examples — drop-in for new AWS projects.

CI/CDPDF · 16 pages

GitOps Migration Guide

Step-by-step migration from imperative pipelines (Jenkins) to declarative GitOps with ArgoCD or Flux. Includes Terraform skeletons.

KubernetesPDF · 18 pages

Production Kubernetes Readiness Checklist

60 checks across security, observability, networking, autoscaling, and disaster recovery — the same list we run before signing off on a launch.

KubernetesPDF · 22 pages

Kubernetes Upgrade Runbook

Upgrade production Kubernetes clusters without downtime on EKS, AKS, and GKE. Pre-flight checks, rollback, and validation.

ObservabilityBlueprint · 24 pages

Observability Stack Blueprint

Reference architecture for Prometheus, Grafana, Loki, and Tempo. Includes dashboards, alert rules, SLO templates, and a cost sheet.

SREYAML Pack

SLO & Error Budget Template Library

Twenty ready-to-use SLO definitions for APIs, databases, queues, and ML services. Drop them into Datadog, Grafana, or Sloth.

SREPDF · 20 pages

On-Call Engineer Handbook

A no-BS handbook for running on-call rotations: escalation matrices, shadow shifts, postmortem template, and a burnout prevention checklist.

SREPDF · 14 pages

Incident Response Runbook Template

Battle-tested incident response runbook with severity definitions, escalation matrix, blameless postmortem template, and oncall rotation rules.

DatabasesPDF · 26 pages

Zero-Downtime Database Migration Playbook

Postgres logical replication, Aurora Blue/Green, pg_repack, and RDS blue/green walkthroughs. Cutover scripts included.

Cost OptimizationPDF · 22 pages

AWS Cost Reduction Playbook

Twelve patterns we use to cut cloud bills 30–60% without touching the application — Reserved Instances, Savings Plans, Spot, rightsizing.

Cloud MigrationPDF · 22 pages

Migrating off Heroku Playbook

Move from Heroku to Render, Railway, or EKS without downtime. DNS cutover, database migration, and CI/CD rewiring — end to end.

Cloud MigrationPDF + Excel

Cloud Migration Readiness Kit

Readiness checklist, TCO calculator in Excel, and decision tree for when NOT to migrate. Built for pre-migration assessments.

ScalingPDF · 24 pages

Startup to Series B Infrastructure Roadmap

What infra work matters at each funding stage. Pre-seed to Series B, what to build, what to defer, when to hire.

Team BuildingPDF · 18 pages

First DevOps Hire Handbook

30/60/90-day onboarding plan, hiring scorecard, and interview bank for your first SRE/DevOps engineer at Series A. Copy, fork, ship.

Engineering LeadershipPDF · 20 pages

VP Engineering DevOps Metrics Playbook

DORA plus SPACE, explained for leaders. What to measure, dashboards to build, and how to avoid vanity metrics.

ArchitectureNotion Template

Architecture Decision Records (ADR) System

Notion template for tracking architectural decisions, trade-offs, and reversibility — used by 100+ engineering teams.

need something specific

Looking for a playbook we don't have? Ask — we'll build it.

talk to an engineerFree 30-min discovery callBook
close