Senior Site Reliability Engineer

Maya Philippines View all jobs

  • Mandaluyong City, Metro Manila
  • Permanent
  • Full-time
  • 4 hours ago
NATURE OF WORK
  • Lead architectural design and implementation of fault-tolerant, self-healing infrastructure across cloud and hybrid environments
  • Drive organization-wide automation initiatives, eliminating manual operations through advanced IaC and CI/CD frameworks
  • Own technical program leadership for reliability initiatives spanning multiple teams and services
  • Strategic management of OPEX and CAPEX budgets with cost optimization accountability
  • Deep expertise in compliance frameworks (CIS, PCI-DSS, BSP) with ability to architect compliant solutions
  • Establish and enforce cloud governance policies, account structures, and organizational standards across AWS/Azure/GCP environments
REQUIRED QUALIFICATIONS
  • Expert-level proficiency in Kubernetes (CRDs, Operators, multi-tenancy, advanced scheduling)
  • Advanced Terraform expertise (custom providers, module design, automated testing)
  • Deep Service Mesh knowledge (Istio traffic management, circuit breaking, rate limiting, mTLS)
  • Proven experience building Internal Developer Platforms (IDP) with self-service workflows
  • Advanced GitLab CI/CD and GitOps implementation (ArgoCD/FluxCD, multi-project pipelines)
  • Expert-level WAF, API Gateway (Kong, Apigee, AWS APIGW), and network security implementation
  • Strong software development skills in Go, Python, or Java with ability to review code for reliability impact
  • Experience leading technical programs and cross-functional reliability initiatives
  • Deep understanding of observability platforms (Dynatrace, Prometheus, OpenTelemetry) with custom integration experience
  • Proven track record architecting microservices with high-availability and resiliency patterns
  • Experience implementing AWS Organizations, Control Tower, Service Control Policies, and multi-account governance frameworks
  • Proficiency in cloud policy-as-code tools (AWS Config, OPA, Sentinel) and compliance automation
  • Knowledge of cloud security standards (CIS Benchmarks, AWS Well-Architected Framework, Azure/GCP best practices)
  • Advanced expertise in Dynatrace, Datadog, or Grafana for building enterprise observability solutions
  • Experience implementing SLO-based alerting, error budgets, and burn rate monitoring using Prometheus, Grafana, or commercial APM tools
  • Proficiency in distributed tracing (Jaeger, Zipkin, OpenTelemetry) and log aggregation (ELK, Loki)
  • Ability to design custom metrics, synthetic monitoring, and real user monitoring (RUM) strategies
About UsMaya is the all-in-one money platform that is bringing Filipinos bolder ways to master their money. It is powered by a unique integrated financial services ecosystem that addresses the ever-evolving needs of today’s generation of money makers through cutting edge technology.We lead millions of Filipinos — consumers, businesses, communities, and government agencies alike — into a version of the current digital economy that’s more inclusive, transparent, and empowering than ever.We are powered by the country's only end-to-end digital payments company Maya Philippines, Inc. and Maya Bank, Inc. for digital banking services.Maya Bank, Inc. and Maya Philippines, Inc. are regulated by the Bangko Sentral ng Pilipinas.

Maya Philippines

Similar Jobs

  • TSF Site Engineer

    GHD

    • Makati City, Metro Manila
    Job Category: Construction Job Description: Help us build the future and we'll help you build a rewarding and purposeful career. Our global network is made up of architects, …
    • 2 days ago
  • Engineer, Site Reliability

    Royal Caribbean Cruises

    • Pasay City, Metro Manila
    SRE Position Summary: The Site Reliability Engineer (Senior SRE) will report to the SRE Manager in support of the Royal Caribbean website by utilizing application and user perf…
    • 1 month ago