We are looking for a strong Principal DevOps / Platform Engineer to support and evolve the production infrastructure of an international high-load fintech project.
The role focuses on developing the existing infrastructure, supporting platform scalability, and continuously improving engineering and security practices.
Key Responsibilities
• Design and evolve cloud infrastructure in Azure and GCP
• Architect and support production AKS clusters
• Manage Azure PostgreSQL Flexible Server and Azure Cache for Redis
• Configure and monitor BigQuery (GCP)
• Develop and maintain Terraform-based Infrastructure as Code
• Design secure network architecture (VNet, Private Endpoints, VPN, segmentation)
• Build and improve CI/CD pipelines (Azure Pipelines)
• Implement monitoring, logging, and alerting
Required Qualifications
• 5+ years of experience in DevOps / Platform Engineering
• Experience designing production infrastructure for high-load systems
• Strong Terraform expertise (modular architecture, environment isolation, safe infrastructure changes)
• Production experience with:
• Azure (AKS, VNet, Private Endpoints, Key Vault, RBAC, Managed Identities)
• Azure PostgreSQL Flexible Server
• Azure Cache for Redis (Managed)
• Practical knowledge of GCP, particularly BigQuery
• Strong Kubernetes and Helm experience
• Experience configuring VPN and secure connectivity
• Understanding of network security, segmentation, and zero-trust principles
• Experience building and maintaining CI/CD processes
• Experience with monitoring stacks (Prometheus, Grafana, Azure Monitor)
• Automation skills in Bash / Python / PowerShell
Security & Processes
• Experience implementing security best practices in production environments
• Solid understanding of DevSecOps principles
• Experience with IAM, RBAC, and secrets management
• Experience preparing infrastructure for internal and external audits
• Understanding of compliance requirements in fintech or regulated environments
• Experience conducting RCA (Root Cause Analysis) and implementing corrective and preventive measures
• Experience establishing incident response and postmortem processes
• Understanding of SLO / SLA / error budget concepts
Nice to Have
• Cloudflare
• Service Bus or other messaging systems
• Experience working in high-load or fintech environments
Employment according to Labour Law of Belarus