Hi, my name is

Endang.

I'm a DevOps Engineer|Site Reliability Engineer (SRE)|Cloud Engineer|Platform Engineer|Infrastructure Engineer |

A DevOps Engineer passionate about cloud infrastructure, baremetal servers, automation, containerization, Kubernetes, and CI/CD pipelines. I love building scalable systems and streamlining deployment processes.

in Infrastructure Engineering
50+
Servers Managed
20+
K8s Clusters
99.9%
Uptime Design
Endang Suwarna profile image

About Me

I break things on purpose so production doesn’t have to. With about 8 years in infrastructure engineering, I’ve learned that the best systems are the ones you don’t have to think about — because they just work.

My day-to-day lives in the space where Kubernetes meets bare metal, where CI/CD pipelines are living organisms, and where automation isn’t a luxury — it’s survival. I run a homelab that’s more production than most production environments, because if it breaks at 2 AM, I want it to be my problem, not a customer’s.

I’m equally comfortable debugging a kernel panic on a Proxmox host, tuning AWS EKS for cost efficiency, or explaining why “it works on my machine” isn’t a deployment strategy. Cloud-native, automation-first, and perpetually curious — that’s how I build.

What I Do

DevOps

Automating infrastructure, streamlining deployments, and bridging the gap between development and operations with modern CI/CD pipelines.

SRE

Ensuring system reliability through SLIs/SLOs, incident response, chaos engineering, and building resilient distributed systems.

Cloud Engineer

Architecting scalable cloud-native solutions on AWS and GCP, with a focus on cost optimization, security, and performance.

The Stack

Technologies I Work With
Container Orchestration

Building and managing containerized applications using Docker and container orchestration platforms. I create efficient, reproducible environments that streamline development and deployment.

  • Docker containerization and multi-stage builds
  • Container optimization and security best practices
  • Container registry management
  • Docker Compose for local development
Kubernetes & Orchestration

Designing and maintaining scalable Kubernetes clusters for production workloads. From bare-metal setups to managed services in cloud providers.

  • Kubernetes cluster setup on bare metal (k3s)
  • Cloud platforms (EKS and GKE)
  • Helm charts for application packaging
  • Custom Resource Definitions (CRDs)
  • Kubernetes Operators for automation
  • Multi-cluster management strategies
CI/CD Pipelines

Building automated pipelines that enable rapid, reliable deployments. From code commit to production, every step is automated and monitored.

  • GitHub Actions and GitLab CI workflows
  • Automated testing and linting
  • Zero-downtime deployment strategies
  • Infrastructure as Code integration
GitOps & Continuous Delivery

Implementing GitOps workflows for continuous deployment using Git as the single source of truth for infrastructure and applications.

  • ArgoCD and FluxCD for GitOps deployments
  • Managing Kubernetes applications through Git repositories
  • Implementing progressive delivery strategies
  • Automating synchronization between Git and cluster state
Cloud Infrastructure

Architecting and managing cloud infrastructure across AWS and GCP.

  • AWS services: EC2, EKS, S3, RDS, VPC
  • GCP services: GKE, Cloud SQL, Compute Engine
  • Best practices for cost optimization
  • Security and scalability implementation
Infrastructure as Code

Automating infrastructure provisioning and configuration management using modern IaC tools.

  • Terraform for provisioning infrastructure resources
  • VMs and Kubernetes clusters in AWS and GCP
  • Ansible for configuration management
  • Best practices for repeatability and scalability
Monitoring & Observability

Setting up comprehensive monitoring solutions to ensure system reliability and quick incident response.

  • Prometheus metrics collection
  • Grafana dashboards and alerts
  • Distributed tracing using Jaeger
  • Log aggregation (EFK, Loki)
  • Uptime monitoring
Infrastructure & Virtualization

Building and managing virtualized environments, Kubernetes clusters, and network security infrastructure.

  • Proxmox VE and OpenStack for virtualization
  • VM management
  • Kubernetes clusters on bare-metal infrastructure
  • Network security using pfSense firewall
Security & Compliance

Implementing security best practices throughout infrastructure lifecycle.

  • Container image scanning (Trivy)
  • Secrets management (HashiCorp Vault)
  • RBAC implementation
  • Compliance frameworks (ISO 27001, PCI DSS)
Fixed-Price · Self-Hosted · Open Source

Let's Fix Your Infrastructure

Fixed-price infrastructure services for startups — from a 48-hour health check to ongoing platform engineering.

Experience

Lead DevOps Engineer

@ Lyrid
Nov 2025 - Present
Full-time · Hybrid · Indonesia
  • Led a team of 3 DevOps engineers and 3 interns — established engineering standards, documentation frameworks, postmortem culture, and ticketing workflows (actor-action-reason methodology) from scratch
  • Drove product development direction for the Kubernetes platform — translating 8+ years of hands-on DevOps experience into platform roadmap decisions and feature prioritization
  • Managed 20 Kubernetes clusters across multi-region infrastructure spanning bare-metal (K3s + pfSense), OpenStack, Apache CloudStack (via Cluster API), and Proxmox — handling provisioning, migrations, and production support for 10+ tenants across ID, US, and Africa
  • Supported a European partner’s self-hosted CDN POC — assisted with Kubernetes cluster and ArgoCD setup across Indonesia, Europe, and US for edge content delivery
  • Conducted stakeholder and client meetings across Indonesia, Europe, and America — aligning platform development with business needs and providing production support
  • Collaborated with partners on hybrid infrastructure deployments, integrating bare-metal, virtualized, and cloud-native environments into a unified platform
Senior DevOps Engineer
May 2024 - Sept 2024
Full-time · Hybrid · Indonesia
  • Managed full lifecycle of client Kubernetes clusters — provisioning, upgrades, incident management, and application deployment support
  • Led migrations of client infrastructure from non-Kubernetes setups onto Kubernetes with minimal disruption
  • Built documentation and runbooks for DevOps team — standardized incident response procedures reducing resolution time

Part-time DevOps Engineer & Co-Lead Engineer

@ Duitin (PT Tjatra Yasa Indonesia)
Dec 2024 - Present
Part-time · Remote · Indonesia
  • Reduced monthly infrastructure cost by ~80% — migrated from Google Cloud App Engine to standard VPS after analysis showed average RPS was only 1-3 across 13 backend services and 1 frontend. Cost dropped from IDR 11-14M/month to ~IDR 2M/month, eliminating over-provisioned resources for a bootstrapping startup
  • Implemented open-source alternatives to cut operational costs — deployed Metabase for data visualization (giving management visibility into business data for decision-making) and Nextcloud as internal Google Drive alternative
  • Built Arus data pipeline project — synced production databases to data warehouse, enabling management to access data-driven insights. Mentored 1 data engineer intern on the project
  • Developed Anjungan platform — built a custom server management tool handling 9+ servers, replacing manual SSH key workflows with simple access, activity audit logs, and centralized control
  • Served as Co-Lead Engineer — contributed to engineering decisions beyond infrastructure, helping guide technical direction during the bootstrapping phase

Senior DevOps Engineer

@ Pintar Ventura Group
Oct 2024 - Oct 2025
Full-time · Hybrid · Jakarta, Indonesia
  • Managed AWS infrastructure (EKS, EC2, RDS, S3, Route53) supporting 7 Kubernetes clusters across dev, staging, and production — provisioned with Terraform/OpenTofu + Terragrunt
  • Reduced AWS costs by 30% through Reserved Instances and Spot Instance adoption for non-production node groups
  • Standardized GitLab CI/CD across teams with shared pipeline templates, reducing duplication and maintenance overhead
  • Led compliance efforts for ISO 27001:2022 and PJP1 (OJK) certification — ensuring infrastructure met regulatory security requirements
  • Set up full observability stack (Grafana, Prometheus, VictoriaMetrics, Uptime Kuma, Jaeger, OpenSearch) for centralized monitoring, tracing, and logging
  • Secured $9,000 in AWS credits through strategic vendor partnership engagement

Senior DevOps Engineer

@ Freighthub
Sep 2023 - Jan 2024
Contract · Remote · Indonesia
  • Reduced AWS costs by identifying and cleaning up orphaned/legacy resources accumulated by previous teams
  • Migrated CI/CD from Jenkins to GitHub Actions and set up self-hosted runners on AWS EC2
  • Mentored junior DevOps engineers and built comprehensive infrastructure documentation from scratch

Senior DevOps Engineer

@ Rukita
Feb 2021 - Jun 2023
Full-time · Hybrid · Jakarta, Indonesia
  • Migrated from single-server architecture to Docker Swarm — chose Swarm over Kubernetes as Django monolith + 3 frontends didn’t warrant K8s complexity
  • Migrated CI/CD from Bitbucket Pipelines to self-hosted Drone CI for greater flexibility and control
  • Introduced DevOps culture alongside the Principal Engineer — created documentation templates, standardized processes, and reformed meeting culture (silent meeting format reduced meetings from 1-2 hours to 15-30 minutes)
  • Managed 15+ servers with Ansible and assisted QA with Cypress test automation

DevOps Engineer

@ PINTU
May 2020 - Jan 2021
Full-time · Remote · Jakarta, Indonesia
  • Managed multi-cloud infrastructure (AWS production + GCP data team) — introduced Docker to replace manual EC2 deployments, enabling consistent environments across dev/staging/production
  • Reduced AWS costs by cleaning up orphaned resources and establishing tagging/lifecycle policies
  • Built monitoring with Grafana + Prometheus replacing default CloudWatch — improving incident response time and production visibility
  • Deployed Teleport for secure SSH/internal app access with full audit logging
  • Automated server management with Ansible and set up Consul for service discovery and distributed KV store
  • Researched and rolled out container orchestration — started with Nomad for developer onboarding, then migrated to Kubernetes as team maturity grew

DevOps Engineer

@ Kitabisa
Aug 2019 - Apr 2020
Full-time · Hybrid · Jakarta, Indonesia
  • Managed multi-cloud infrastructure (AWS + GCP) and migrated from AWS ECS to Google Kubernetes Engine (GKE)
  • Built internal CLI tool with Cookiecutter that auto-generated K8s manifests, CI/CD config, and app boilerplate — backend engineers ran a CLI command, filled a form (service name, domain, ingress), and deployed without waiting for infra team. Accelerated monolith-to-microservices migration
  • Supported data team with Tableau, Metabase, and Apache Airflow deployments
Jul 2018 - Nov 2018
Contract · On-site · Indonesia
  • Developed IoT applications for coal mining (GPS tracking, environmental monitoring) using LoRa radio, Node-RED, and MQTT

IoT & Infrastructure Engineer

@ Gravicode Multinovative Plexindo
Nov 2017 - Jun 2018
Part-time · Hybrid · Indonesia
  • Managed IoT infrastructure on Azure; developed embedded apps (STM32, Raspberry Pi) including bus gate system and Telkomsel data visualization (Windows Server, SQL Server, Power BI)

Projects

Anjungan
Go SvelteKit PostgreSQL 2FA
Anjungan
Internal Developer Platform (IDP) built with Go + SvelteKit. Features 2FA authentication, real-time audit logging, activity bookmarks, and service catalog management — providing centralized access to internal services and infrastructure.
OpsTerm
Python SSH AI Terminal
OpsTerm
A local AI terminal assistant that lives in your terminal. SSH into any server without losing AI access — because the AI runs on your local terminal, not on the remote server, giving you AI-powered command suggestions, analysis, and automation wherever you connect.
Container Cost
Docker Go SvelteKit Cost Analytics
Container Cost
Multi-VPS Docker container cost optimization dashboard. Agent/central server architecture that tracks container resource usage across servers, allocates costs by container, and provides actionable insights to reduce infrastructure spend.

Achievements

Top 10 InnOvate Telkomsel
Jun 2019
Smartband prototype for heart rate and fatigue detection in mining environments — built for operator safety monitoring in high-risk areas. Selected as Top 10 national finalist from Telkomsel incubation program.
2nd Runner Up DBS Indonesia Hackathon
Dec 2016
Goal-based savings feature for DBS digital banking app — designed auto-debit savings plans for Hajj, wedding, and other life goals with projected monthly contribution estimates. Placed 2nd Runner Up at DBS Indonesia Hackathon.

Get in Touch

My inbox is always open. Whether you have a question or just want to say hi, I’ll try my best to get back to you!