
Senior Platform Reliability Engineer
- Quezon City, Metro Manila
- Permanent
- Full-time
- Collaborating with other teams to provide application monitoring strategy.
- Building and setting up monitoring (Synthetics, APM, Infrastructure) within our Monitoring Platform.
- Design and Building new application monitors, dashboards, and creative solutions.
- Creating automation solutions using scripting languages using APIs.
- Using problem solving skills in concert with tooling to help uncover application issues and perform root cause analysis.
- Create visibility of our myriad of data and help our partners understand this data.
- Design, support and enhancing our current toolset.
- Promoting observability and reliability engineering within our team and beyond.
- Handling communications with our allies in the business to ensure information is comprehensive to technical and non-technical members.
- Maintaining positive relationships with many internal teams.
- Adapting to evolving priorities.
- Bachelor's degree in any IT related course
- 8 to 10 years+ working in a senior technical role in IT
- Designs, builds, and maintains the monitoring technology platform's
- Coding experience with Powershell, NodeJS or JavaScript.
- Scripting experience with languages such as PowerShell or Python.
- Experience with monitoring tools: New Relic, Dynatrace, CA APM, PRTG.
- Amenable to work UP Ayala Technohub (Quezon City)
- Amenable to work on a hybrid set-up (3x a week onsite)
- Amenable to work in a day and night shift schedule (team is currently supporting global services)
- Data driven and be passionate about using data to advance our effectiveness and enable superb customer experiences.
- Dedicated and able to work independently with a continuous learning approach.
- A focus and aim for creating excellent customer experiences.
- Innovation, adaptability and curiosity.
- Clear, concise, tactful communication skills.
- A sense of humor because our team likes to have fun!
- Vendor management skills.
- Adapt to change and overcome barriers.
- Knowledgeable in the following tools / concepts
- Terraform, Jenkins, GIT, automation tools.
- Devo, Azure, Docker, Kubernetes.
- Experienced with mentoring with leadership skills.
- Familiarity and knowledge with AI Operations tooling such as Moogsoft, Copilot, AgenticAI
- Understanding of Observability and Reliability Engineering and the benefits of it.
- An understanding of the Financial Services or Insurance Industry and/or Large Enterprise knowledge
- Any ITIL™ Foundations or Practitioner level certifications (especially Support Operations)
- We’ll empower you to learn and grow the career you want.
- We’ll recognize and support you in a flexible environment where well-being and inclusion are more than just words.
- As part of our global team, we’ll support you in shaping the future you want to see.