Site Reliability Engineer
Site Reliability Engineers primarily work closely with Medic’s software development and product teams to ensure high-quality deployments of mobile health software and hardware.
What you’ll do in your day-to-day:
Proactive Monitoring and Team Support: 60%
Monitor all notification channels (Slack, GH, SRE Task board, CHT forums, partner outreach), ensure timely response and escalate as necessary. Be available (subject to time zones of team members) to respond to, troubleshoot, remediate, and document expected or unexpected outages, incidents, or problems in production.
Proactively monitor performance and reliability of production Medic systems.
Produce status pages consumable by non-technical users.
Consult on technical needs for larger-scale deployments, including local hosting, scalability, etc.
Provide remote troubleshooting support to active deployments as needed.
Prioritize urgent troubleshooting problems in live instances.
Identify possible production problems by checking through or reviewing the issues that have been reported.
Follow up and investigate questions asked on Slack channels and the CHT forum.
Keeping in contact with Core Devs and QA teams.
Provide technical information, explain processes, clarify interactions when requested and ensure proper documentation.
System image engineering and deployment support: 40%
Manage upgrades and upgrade processes on production instances.
Automate deployments to increase testability and reliability.
Automate deployment monitoring and alerting
Here’s what we use to make all that happen:
Good understanding of DevOps concepts and best practices
Detective Skills: Terrific at troubleshooting and debugging.
Checklists and documentation for processes and systems.
Cloud infrastructure (AWS but experience with others such as Azure and GCP is also valid).
Docker – Kubernetes.
Linux system administration, monitoring, security best practices, networking, and logging.
HTTP, SSL/TLS, REST, HTML, DNS.
Purpose & Impact
Ability to create and see real impact in your work.
Freedom to take initiative and innovate, bonus of an agile, small team.
Work for a globally awarded social enterprise recognized for developing a solution that can create global systems change in the health sector.
Work with a value and mission-driven team that is consistently described as warm, incredibly kind, and supportive.
Exposure to a diverse team: over 15 different nationalities.
Opportunities for global travel: all team meet-up + functional team meet-up + field visits.
Work/Life & Growth
Generous leave time: vacation, maternity/paternity, bereavement, & sick days.
Professional development funds & opportunities + 5 days off for professional development.
Home Office set up stipend
Flexible, remote schedules.
MODE OF APPLICATION
Click “APPLY FOR JOB” button above to apply for this job.
Interested and qualified candidates should forward their CV to: using the position as subject of email.
About Medic Mobile
Medic Mobile is a nonprofit organization on a mission to improve health in the hardest-to-reach communities. We build world-class, open-source software that supports health workers delivering equitable care that reaches everyone. Medic Mobile serves as the technical lead and a core contributor to the Community Health Toolkit, which helps health workers ensure safe deliveries, track outbreaks faster, treat illnesses door-to-door, keep stock of essential medicines, communicate about emergencies, and more.