Site Reliability Engineer
Job Summary
The Site Reliability Engineer (SRE) will ensure the high availability, performance, monitoring, and incident response for OVHcloud Bare Metal products and Services. This role involves supporting the reliability, configuration, and deployment of existing and new products and services. The SRE will investigate and debug errors, contribute to software development for service improvement, and automate tasks using scripts and tooling.
Essential Duties & Responsibilities
- Manage and maintain essential OVHcloud infrastructures, products, and services.
- Diagnose errors with a data-driven approach, analyzing the data for resolution.
- Create knowledge-based articles and instructional guides as needed.
- Develop scripts and tooling to automate tasks.
- Participate in building, deploying, and/or troubleshooting microservices software applications and other underlying APIs.
- Monitor alerting systems and submit configuration changes on a regular basis to ensure availability of systems and services.
- Install, deploy, and configure OVHcloud infrastructure as new capabilities are developed.
- Analyze data and develop meaningful automated reports to be used by technical and business leaders.
- Write well-documented root cause analyses with recommend official documentation to prevent future critical issues.
- Participate in User Acceptance Testing (UAT) for new product launches
- Collaborate in on-call rotations, including weekends.
Minimum Requirements
- 2+ years of relevant experience in an SRE, DevOps, programming, or similar position is required.
- 1+ years of experience performing system administration of Linux/Unix and Windows operating systems is required.
- Experience performing day-to-day Operational (SRE/DevOps) tasks and working with microservices and multiple APIs.
- Experience with languages such as Perl, Python, Bash, Go, etc.
- Experience managing a distributed, highly available, high-traffic infrastructure based on Linux is preferred.
- Experience with maintenance/configuration of monitoring, metrics, and logging infrastructures like Nagios, Grafana, Graylog.
- Experience with open-source configuration management tools, such as Puppet, Ansible, etc. preferred.
- Well-versed in cloud technologies and terminology.
- Experience with virtualization and container technology.
- Ability to prioritize, organize, and execute on competing priorities; ability to reprioritize based on company need, is critical.
- Bachelor’s degree in computer science or a related field preferred; or equivalent experience in lieu of degree.
Working Conditions
Standard office environment
Company Description – About OVHcloud
OVHcloud US is a subsidiary of OVHcloud, a global cloud provider that specializes in delivering industry-leading performance and cost-effective solutions to better manage, secure, and scale data. OVHcloud US delivers bare metal servers, hosted private cloud, hybrid and public cloud solutions. OVHcloud manages 40 data centers across 12 sites on four continents, manufacturing its own servers, building its own data centers and deploying its own fiber-optic global network to achieve maximum efficiency. Through the OVHcloud spirit of challenging the status quo, the company brings freedom, security and innovation to solve data challenges – today and tomorrow. With a 21-year heritage, OVHcloud is committed to developing responsible technology and strives to be the driving force behind the next cloud evolution. https://us.ovhcloud.com.
EEO Statement
OVHcloud is committed to providing equal employment opportunities to all employees and applicants without regard to race, ethnicity, religion, color, sex (including childbirth, breast feeding, and related medical conditions), gender identity or expression, sexual orientation, national origin, ancestry, citizenship status, uniform service member and veteran status, marital status, pregnancy, age, protected medical condition, genetic information, disability, or any other protected status in accordance with all applicable federal, state and local laws.