Ho Chi Minh City, Vietnam

Nguyen Ngoc Tu

aka Chrish

DevOps / Infrastructure Engineer

Cloud Infrastructure · Kubernetes Platforms · Terraform IaC · CI/CD · GitOps · Observability · Secure Cloud Operations

About Me

DevOps / Cloud Engineer with 5+ years of hands-on experience designing, automating, and operating cloud-native infrastructure and Kubernetes-based platforms across AWS and Azure. Strong background in Terraform-based Infrastructure as Code, CI/CD automation, GitOps delivery, Linux operations, observability, alerting, incident response, and secure cloud operations. Experienced working with infrastructure, security, product engineering, and customer-facing stakeholders to improve reliability, delivery speed, and operational maturity.

Get In Touch

Cloud Platforms

AWS & Azure

Experience

5+ Years

Focus

DevSecOps

Discipline

SRE / Reliability

Core Skills

Technologies and practices I work with across cloud infrastructure, platform engineering, and DevSecOps.

Cloud

AWS EC2S3LambdaCloudWatchAzure AKSAzure MonitorApplication InsightsKey VaultAzure Networking

Infrastructure as Code & Automation

TerraformAtlantisAnsibleBash / ShellPython

CI/CD

GitHub ActionsAzure PipelinesGitLab CI/CDRelease AutomationDeployment Gates

Kubernetes & GitOps

KubernetesDockerAzure AKSEKS-aligned OperationsArgo CDExternal Secrets

Linux & Security

Linux AdministrationIAM / RBACKubernetes SecuritySecrets ManagementTLS / VPNVulnerability Scanning

Observability

PrometheusGrafanaELK / OpenSearchLokiSplunkDynatraceAzure MonitorLog Analytics

Reliability

SLI / SLOAlerting StrategyIncident ResponseOn-call EscalationRCA / PostmortemRecurring-issue Prevention

Data & Messaging

KafkaMQTTNATSPulsarMongoDBPostgreSQLOpenSearchMilvus

Experience

Professional background in cloud infrastructure, DevSecOps, and platform engineering.

DevSecOps / Platform Engineer

Technology Services / Cloud Platform Engineering
Sep 2021 – Present
  • Designed and managed cloud-native infrastructure across AWS and Azure using Terraform, Atlantis, and Ansible.
  • Built and maintained CI/CD pipelines using GitHub Actions, GitLab CI/CD, and Azure Pipelines.
  • Implemented GitOps delivery workflows with Argo CD to improve release consistency and rollback readiness.
  • Operated Kubernetes workloads across Azure AKS and AWS-aligned environments.
  • Built Bash/Python automation to reduce manual operational work and improve environment validation.
  • Operated monitoring and logging stacks with Prometheus, Grafana, ELK/OpenSearch, Loki, Splunk, Dynatrace, Azure Monitor, and Application Insights.
  • Improved platform reliability through alerting, SLI/SLO practices, incident troubleshooting, RCA, and preventive actions.
  • Strengthened platform security using IAM/RBAC, Kubernetes security controls, Key Vault, External Secrets, TLS/VPN, vulnerability scanning, and CI/CD security checks.

Research Contributor, Cloud / DevOps / AI Systems

Independent / Part-time Research
Part-time
  • Designed CI/CD pipelines and Kubernetes deployment workflows for AI/ML inference systems.
  • Built automated DevSecOps and AI validation environments using Docker, Kubernetes, Python, and Bash.

Selected Projects

A sample of cloud-native platforms and infrastructure systems I've designed, built, and operated.

Project details are intentionally anonymized to respect confidentiality, security policies, and previous employer/client agreements.

Azure AKS / GenAI / Platform Engineering

AI-Powered Financial Document Extraction Platform

Production-grade microservices platform for financial document extraction using OCR/LLM models, AI agents, APIs, event-driven services, vector search, and supporting cloud services.

Key Contributions

  • Provisioned scalable Azure infrastructure with Terraform and GitHub Actions.
  • Built CI/CD and GitOps workflows for application and infrastructure delivery.
  • Operated AKS workloads with reliability, scalability, and resource efficiency in mind.
  • Implemented monitoring with Prometheus, Grafana, Azure Monitor, Application Insights, and Log Analytics.
  • Supported SLI/SLO and alerting strategy for business-critical services.
AWS / Kubernetes / Real-Time Telemetry

Network Device Management & Monitoring Platform

Distributed cloud-native platform for managing and monitoring network devices with real-time data processing.

Key Contributions

  • Designed and operated Kubernetes-based microservices on AWS.
  • Automated infrastructure with Terraform and Atlantis.
  • Delivered GitOps-based continuous deployment using GitLab CI/CD and Argo CD.
  • Implemented secure operations with External Secrets and TLS/VPN connectivity.
  • Improved issue detection through Prometheus/Grafana observability and incident response practices.
Azure AKS / OpenSearch / Observability

Cloud Analytics & Anomaly Detection Platform

Cloud-native analytics platform supporting large-scale data processing and AI-based anomaly detection.

Key Contributions

  • Developed and operated Azure-based workloads focused on availability, performance, and secure operations.
  • Standardized deployment workflows with Azure Pipelines.
  • Improved observability through Splunk and Dynatrace.
  • Integrated AI-based anomaly detection to support proactive monitoring and earlier incident identification.

Certifications & Education

Certifications

Cisco DevNet Associate

Cisco

Microsoft Certified: Azure Fundamentals

Microsoft

Education

Bachelor of Engineering in Computer Networks and Communications

University of Information Technology

Ho Chi Minh City, Vietnam

Get In Touch

Let's discuss DevOps, Cloud Infrastructure, Platform Engineering, or SRE opportunities. I'm open to new roles and collaborations.

Ho Chi Minh City, Vietnam