Site Reliability Engineer

Dallas, TX

Full Time

Mid Level

Job Summary
The Site Reliability Engineer (SRE) will ensure the high availability, performance, monitoring, and incident response for OVHcloud Bare Metal products and Services. This role involves supporting the reliability, configuration, and deployment of existing and new products and services. The SRE will investigate and debug errors, contribute to software development for service improvement, and automate tasks using scripts and tooling.

Essential Duties & Responsibilities

Manage and maintain essential OVHcloud infrastructures, products, and services.
Diagnose errors with a data-driven approach, analyzing the data for resolution.
Create knowledge-based articles and instructional guides as needed.
Develop scripts and tooling to automate tasks.
Participate in building, deploying, and/or troubleshooting microservices software applications and other underlying APIs.
Monitor alerting systems and submit configuration changes on a regular basis to ensure availability of systems and services.
Install, deploy, and configure OVHcloud infrastructure as new capabilities are developed.
Analyze data and develop meaningful automated reports to be used by technical and business leaders.
Write well-documented root cause analyses with recommend official documentation to prevent future critical issues.
Participate in User Acceptance Testing (UAT) for new product launches
Collaborate in on-call rotations, including weekends.

Minimum Requirements

2+ years of relevant experience in an SRE, DevOps, programming, or similar position is required.
1+ years of experience performing system administration of Linux/Unix and Windows operating systems is required.
Experience performing day-to-day Operational (SRE/DevOps) tasks and working with microservices and multiple APIs.
Experience with languages such as Perl, Python, Bash, Go, etc.
Experience managing a distributed, highly available, high-traffic infrastructure based on Linux is preferred.
Experience with maintenance/configuration of monitoring, metrics, and logging infrastructures like Nagios, Grafana, Graylog.
Experience with open-source configuration management tools, such as Puppet, Ansible, etc. preferred.
Well-versed in cloud technologies and terminology.
Experience with virtualization and container technology.
Ability to prioritize, organize, and execute on competing priorities; ability to reprioritize based on company need, is critical.
Bachelor’s degree in computer science or a related field preferred; or equivalent experience in lieu of degree.

Working Conditions

Standard office environment

Company Description – About OVHcloud

OVHcloud US is a subsidiary of OVHcloud, a global cloud provider that specializes in delivering industry-leading performance and cost-effective solutions to better manage, secure, and scale data. OVHcloud US delivers bare metal servers, hosted private cloud, hybrid and public cloud solutions. OVHcloud manages 40 data centers across 12 sites on four continents, manufacturing its own servers, building its own data centers and deploying its own fiber-optic global network to achieve maximum efficiency. Through the OVHcloud spirit of challenging the status quo, the company brings freedom, security and innovation to solve data challenges – today and tomorrow. With a 21-year heritage, OVHcloud is committed to developing responsible technology and strives to be the driving force behind the next cloud evolution. https://us.ovhcloud.com.

EEO Statement

OVHcloud is committed to providing equal employment opportunities to all employees and applicants without regard to race, ethnicity, religion, color, sex (including childbirth, breast feeding, and related medical conditions), gender identity or expression, sexual orientation, national origin, ancestry, citizenship status, uniform service member and veteran status, marital status, pregnancy, age, protected medical condition, genetic information, disability, or any other protected status in accordance with all applicable federal, state and local laws.

Apply for this position

Required*

Apply with

First Name*

Last Name*

Email Address*

Phone*

Address*

Resume*

We've received your resume. Click here to update it.

Attach resume or Paste resume

Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

Desired salary*

Available start date:*

Application source:*

What is your preferred name, if different from above?

Which pronouns/title do you prefer we use when referring to you? (Optional)

Who referred you? (Optional)

Are you a current employee of OVHcloud US?*

Have you ever been employed by any of OVHcloud's entities?*

Are you open to relocation?

Will you now or in the future require work authorization sponsorship?*

Are you legally authorized to work in the country where this job is based?*

Human Check*

Submit Application

Thanks for visiting the OVHcloud job board.

Site Reliability Engineer

Apply for this position