Sergei Chukh

SRE / DevOps engineer

DevOps / SRE engineer based in Thailand; open to stable, long-term opportunities. Time zone GMT+7. Hands-on experience with Kubernetes, public clouds, CI/CD, and infrastructure as code.


Experience


Freelancer

DevOps Engineer / SRE / Developer

Working as a freelance engineer in a startup environment on a trading bot platform that connects to centralized
(Bybit, Mexc) and decentralized (Uniswap, Mira) exchanges via REST and WebSocket APIs.

  • Built Kubernetes infrastructure on GKE and Hetzner (Talos) for reproducible, reliable deployments
  • Automated infrastructure with Pulumi and Helm to reduce configuration errors and simplify updates
  • Created and maintain the open-source project pulumi-talos-cluster, streamlining installation and operations of Talos clusters
  • Stack: GCP (GKE), Talos, web3.js, VictoriaMetrics, Typescript, Nodejs, Pulumi, Hetzner
2024-12

AMarkets

DevOps Engineer / SRE

Online broker (Forex/CFD). SRE/DevOps responsible for platform reliability and operations, primarily around databases.

  • Diagnosed and resolved complex database and application performance issues (PostgreSQL, MySQL, Redis)
  • Introduced PostgreSQL auditing tools to identify missing indexes and query anti-patterns
  • Partnered with Go engineers on MetaTrader 5 services, improving reliability and reducing costs
  • Designed service interaction patterns and rollout strategies for the monolith-to-Kubernetes migration (Deckhouse)
  • Improved reliability and observability with Prometheus stack dashboards/alerts and Sentry error tracking
  • Stack: AWS, ansible, kubernetes, k8s (deckhouse), helm, werf, Golang, Metatrader 5, Pulumi, PostgreSQL, Sentry, Gitlab, MySQL, Redis, Prometheus stack
2023-06 - 2024-12

Gaijin Entertainment

DevOps Engineer / SRE

Maintained and developed infrastructure for games such as Warthunder and Enlisted.

  • Ensured uptime of high-load game backends, handled troubleshooting, and took part in on-call rotations
  • Built custom service discovery using native etcd mechanisms
  • Documented the entire server-side distribution stack for game clients as a docker-compose setup, containerizing numerous C++ and Go services
  • Stack: Linux (CentOS 7), Redis, AWS (RDS, EC2, VPC, Route53), Graphite + Telegraf, Kubernetes on bare metal, Docker as build environment, Golang, Puppet, Ansible, Terraform, Jenkins, Gerrit
2022-04 - 2023-04

Yandex

Support L2 Specialist

Served on the second-line support team for Yandex Cloud.

  • Handled customer support tickets
  • Troubleshot cloud-side services (MDB, Compute, Kubernetes)
2021-12 - 2022-03

Rostelecom IT

DevOps Engineer / SRE

Supported and developed the WINK project (formerly Rostelecom Interactive TV).

  • Updated, configured, and troubleshot product software (Wink API services)
  • Set up monitoring for servers and services (Zabbix, Prometheus, VictoriaMetrics), participated in on-call duty, and responded to incidents
  • Administered PostgreSQL databases (backups, replication, performance assessments)
  • Automated tasks and CI/CD workflows (GitLab CI)
  • Supported the product team by providing and creating tools as a service
  • Handled non-technical work: training the duty shift on the new platform, working with business analysts, planning and ordering new hardware for the platform
  • Stack: Linux (Centos 7), PostgreSQL (+Pgbouncer), NSQ, ClickHouse, Nginx, Reindexer (https://github.com/Restream/reindexer), Prometheus Stack (+Victoria Metrics), Opscode Chef, Gitlab, Golang
2019-05 - 2021-10

Equilibrium

DevOps Engineer

Helped build a crypto fintech product in a small team of enthusiasts.

  • Maintained public blockchain nodes (Parity for ETH, nodeos for EOS)
  • Introduced a feature-branch development approach using Docker Swarm on the staging environment to meet team needs
  • Stack: Linux (Ubuntu), AWS (RDS, EC2, VPC, Route53), Docker swarm, Prometheus Stack, HashiCorp Vault, Teamcity
2018-10 - 2019-05

Restream

Technical Support Engineer

Rostelecom's "Interactive TV" project. Worked in the monitoring department.

  • Processed requests in the ticketing system (Jira)
  • Performed initial OS setup (CentOS) on servers and handled inventory via IaC (Puppet)
2016-02 - 2018-10

Education

UGTU

2009-09-01 - 2014-05-01
Completed
Development and Operation of Oil and Gas Fields

Skills

  • Kubernetes: K8S, Helm
  • IaC: Pulumi, Terraform, Ansible, Chef
  • Linux: Debian, Talos, RHEL
  • Cloud: AWS, GCP, Hetzner Cloud
  • Databases: PostgreSQL, MySQL, MongoDB
  • Golang: Golang
  • Workload Orchestrator: Kubernetes, Docker Swarm
  • CI/CD: GitLab CI, TeamCity, GitHub Actions
  • Monitoring: Prometheus, Grafana, Zabbix, Telegraf
  • Logging: ELK, Loki, Datadog
  • Web Servers: Nginx, Traefik

Certifications