Senior Site Reliability Engineer
โจ AI Summary
About Deimos: Deimos is a pioneering Cloud-native Developer and Security Operations technology firm dedicated to assisting businesses in transitioning to the Cloud for enhanced client service. Our remote team, known as "Martians," embraces engineering excellence and cutting-edge technologies to create competitive solutions.
Role Overview: We are seeking a seasoned Senior Site Reliability Engineer to join our Professional Services team, focusing on Software and DevSecOps projects. You will work under a Site Reliability Engineering Manager with a talented team that innovates solutions for diverse clients across multiple public cloud platforms such as AWS, Azure, and GCP.
Main Responsibilities:
- Implement observability practices and create dashboards for tracking performance metrics.
- Develop frameworks and tools to empower product teams in making informed deployment decisions.
- Enhance disaster recovery strategies through automation.
- Drive a culture of continuous improvement in alerting and on-call practices.
- Lead the management of reliability tools using Infrastructure as Code (IaC) techniques.
Requirements:
- Bachelor's degree in Computer Science or related field.
- 5+ years of experience in SRE, DevOps, or Platform Engineering.
- Proficient in Python or similar languages for automation.
- Hands-on experience with AWS and Infrastructure as Code tools.
- Strong understanding of observability and reliability principles.
How to Apply: If you are a proactive individual eager to join a dynamic, remote team, please submit your application to be considered for this exciting opportunity.




