AI Infrastructure & SRE Engineer focused on GPU compute infrastructure, and Kubernetes-based AI model serving.
I write about what I learn at ezgitastan.systems
GPU & AI Infrastructure NVIDIA GPU Operator · MIG slicing · NFD · DCGM · NCCL · vLLM · H200 / H100 / B200 / A100
Orchestration & Platform OpenShift (ROSA) · Kubernetes · Helm · Kustomize · ArgoCD
Cloud & IaC AWS · GCP · Terraform · Terragrunt · Packer · Vagrant
Observability & Reliability Datadog · Sentry · Prometheus · Langfuse · K6 (load testing) · PagerDuty
Languages Go, Bash



