See all the jobs at MoEngage Inc here:
Site Reliability Engineer - II , FinOps Specialist
| Full-time
, ,About MoEngage:
MoEngage is an insights-led customer engagement platform trusted by 1,350+ global consumer brands, including McAfee, Flipkart, Domino’s, Nestle, Deutsche Telekom, and OYO. MoEngage combines data from multiple sources to help brands gain a 360-degree view of their customers.
MoEngage is an insights-led customer engagement platform trusted by 1,350+ global consumer brands, including McAfee, Flipkart, Domino’s, Nestle, Deutsche Telekom, and OYO. MoEngage combines data from multiple sources to help brands gain a 360-degree view of their customers.
MoEngage Analytics arms marketers and product owners with insights into customer behavior. Brands can leverage MoEngage Personalize to orchestrate journeys and build 1:1 conversations across the website, mobile, email, social, and messaging channels. MoEngage Inform, the transactional messaging infrastructure, helps unify promotional and transactional communication to a single platform for better insights and lower costs. MoEngage’s AI Suite helps marketers develop winning copies and creatives, optimize campaigns and channels that boost engagement, and help with faster execution.
For over a decade, consumer brands in 60+ countries have been using MoEngage to power digital experiences for over a billion monthly customers. With offices in 15 countries, MoEngage is backed by Goldman Sachs Asset Management, B Capital, Steadview Capital, Multiples Private Equity, Eight Roads, F-Prime Capital, Matrix Partners, Ventureast, and Helion Ventures.
MoEngage was named a Contender in The Forrester Wave™: Real-Time Interaction Management, Q1 2024 report, and Strong Performer in The Forrester Wave™ 2023 report. MoEngage was also featured as a Leader in the IDC MarketScape: Worldwide Omni-Channel Marketing Platforms for B2C Enterprises 2023.
About Role:
The Opportunity: Site Reliability Engineer (SRE-2)
Are you an SRE with a few years under your belt, itching to take on more significant challenges and drive impactful reliability initiatives? Do you have a solid grasp of cloud platforms and container orchestration, and a burning desire to automate everything in sight? As an SRE-2 at MoEngage, you'll be a critical member of our SRE team, responsible for the health and performance of key services and contributing directly to the evolution of our infrastructure at a scale that few engineers get to experience. This is your chance to deepen your technical expertise, take on more ownership, and mentor emerging talent while working on a platform that operates at the cutting edge.
The Opportunity: Site Reliability Engineer (SRE-2)
Are you an SRE with a few years under your belt, itching to take on more significant challenges and drive impactful reliability initiatives? Do you have a solid grasp of cloud platforms and container orchestration, and a burning desire to automate everything in sight? As an SRE-2 at MoEngage, you'll be a critical member of our SRE team, responsible for the health and performance of key services and contributing directly to the evolution of our infrastructure at a scale that few engineers get to experience. This is your chance to deepen your technical expertise, take on more ownership, and mentor emerging talent while working on a platform that operates at the cutting edge.
Roles and Responsibilities:
- Be a Reliability Champion: Take ownership of the reliability, performance, and efficiency of critical services.
- Automate, Automate, Automate: Design, develop, and implement robust automation solutions to eliminate toil, streamline operations, and improve system resilience.
- Battle Incidents (and Win): Lead troubleshooting efforts for complex production incidents, perform in-depth root cause analysis, and implement sustainable preventative measures.
- Sculpt Our Infrastructure: Actively contribute to the design, implementation, and optimization of our cloud infrastructure on AWS and GCP, leveraging your expertise in technologies like Kubernetes.
- Enhance Observability: Implement and refine advanced monitoring, alerting, and logging solutions to gain deep insights into system behavior and predict potential issues.
- Collaborate for Success: Partner closely with development teams to influence architectural decisions, ensuring reliability, scalability, and security are built in from the start.
- Strengthen Our Security Posture: Implement and advocate for advanced security practices within our infrastructure and operational workflows.
- Drive Efficiency: Analyze and optimize cloud infrastructure spend, identifying and implementing cost-saving opportunities.
- Guide the Next Wave: Mentor and guide SRE-1 engineers, contributing to the growth and knowledge sharing within the team.
- Be Ready for Action: Participate in our on-call rotation, acting as a key point of escalation and resolution for critical issues.
Requirements:
- 3-5 years of hands-on experience in Site Reliability Engineering, DevOps, or a similar role with a strong focus on production systems.
- Demonstrated expertise in Python or Go – you have a proven track record of automating complex tasks.
- Strong command of AWS and/or GCP cloud platforms.
- In-depth experience with containerization and orchestration using Kubernetes (K8s, ArgoCD, Helm/Kustomize).
- Experience with infrastructure as code tools like Terraform or Ansible is highly valued.
- Solid understanding and experience with monitoring and observability stacks (VictoriaMetrics, Prometheus, Grafana, ELK stack, etc.).
- Deep knowledge of Linux/Unix systems internals and advanced networking concepts.
- Proven ability to diagnose and resolve complex issues in large-scale distributed systems.
- A strong understanding of Cloud Security and Information Security principles and best practices.
- Experience with cloud cost analysis and optimization techniques.
- Familiarity with CI/CD pipelines and GitOps methodologies.
- Experience with messaging queues and distributed systems (Celery, Kafka) is a plus.
- Excellent communication, collaboration, and problem-solving skills.
- A desire to mentor and lead by example.
At MoEngage, we respect and value differences. We believe that when people from diverse backgrounds and perspectives collaborate, we create the most value – for our clients, our employees, and society. We embrace diversity and uphold a strong set of values. We are committed to inclusivity and take pride in providing equal opportunities for success and growth.
Employment at MoEngage is based solely on professional competence, skills, and experience. We stand firmly against all forms of discrimination and support equal rights and opportunities regardless of gender, ethnicity, abilities, age, identity, orientation or expression, marital status (including pregnancy), religion and beliefs, or any other status protected by law.
It is our policy to comply with all applicable national, state, and local laws related to non-discrimination and equal opportunity. MoEngage is truly a place where everyone can bring their passions, authentic selves, and talents to work, collaborating to drive progress and solve meaningful challenges.
Why Join Us!
At MoEngage, we are passionate about our team and technology - see below to know more about us.
Life@MoEngage
Tech@MoEngage
Scale @MoEngage
We handle more than a billion messages every day. Rest assured, you will be surrounded by really smart and passionate people as we scale much more to build a world-class technology team.