About the role
We're hiring a Senior SRE based in Latin America to work alongside our US-based engineering team, building out observability, on-call coverage, and deployment automation for a client with strict compliance requirements. We're specifically looking for someone with deep Azure expertise, see below.
This is a full-time role with real production ownership, not a support or ticket-queue position.
What You'll Do
- Support observability tooling implementation (Datadog and/or Azure Monitor/App Insights) and help build SLO definitions, alert rules, and synthetic checks
- Participate in a PagerDuty on-call rotation, including escalation handling and incident documentation
- Build and maintain operational runbooks for incident response, rollback, and recovery scenarios
- Contribute to deployment automation work (blue/green or canary patterns) and Infrastructure as Code
- Work across Azure SQL and Cosmos DB environments, supporting performance and cost optimization initiatives
- Collaborate closely with US-based engineers during overlapping working hours
Requirements
- 5+ years in SRE, DevOps, or cloud infrastructure roles
- Strong hands-on experience with Microsoft Azure (Azure SQL, Cosmos DB, Container Apps, App Service)
- Experience with observability tooling (Datadog, Azure Monitor, or similar) and on- call/incident response
- Familiarity with Infrastructure as Code (Terraform preferred)
- Strong written and spoken English; you'll be in daily communication with US-based team members and, at times, client stakeholders
- Availability with meaningful overlap with US Eastern or Mountain time zones
- Experience working in HIPAA-regulated environments, including handling PHI under a Business Associate Agreement (BAA) and working within least-privilege, audited access controls
- Willingness to complete a healthcare-industry-standard background check prior to production access
On-Call Expectations
- This role includes participation in a pager-based on-call rotation via PagerDuty, covering SEV- 1/SEV-2 incidents on a shared schedule with the SRE team. This is a core, required part of the role, not an occasional ask.
Benefits
- Work remotely
- Vacation: 10 business days a year
- Holidays: 5 National Holidays a year
- Company Holidays: 5 Company Holidays a year (Christmas Eve, Christmas Day, New Year's Eve, New Year's Day, Zipdev Day)
- Parental Leave
- Health Care Reimbursement
- Active Lifestyle Reimbursement
- Quarterly Home Office Reimbursement
- Payroll Deduction Purchase Plans
- Longevity Bonus
- Continuous Learning Bonus
- Access to Training and Professional Development Platforms
- Did we mention it's REMOTE?!!
One of our core values at Zipdev is "Be authentic." that's why we encourage you to answer the application form in your own words; we are interested in getting to know you, not a digital assistant.
Wondering how our remote environment or our payment method work? We've put together some helpful answers in our FAQs at the bottom our our career site. Take a look and let us know if you have any other questions!