Principal Site Reliability Engineer

Company Info
Myriad Genetics, Inc
United States

Web Site:

Company Profile


Principal Site Reliability Engineer


Salt Lake City, UT 

Job ID:


Job Description:

Location: United States
Job Identification: 2344
Job Schedule: Full time

Principal Site Reliability Engineer (Remote)

Myriad Genetics is looking for a principal Site Reliability Engineer to use their skills and experience in software and systems to build and maintain scalable, reliable, and secure services for internal and external users. We will need you to approach the problem of running production systems from a software and engineering perspective with a focus on testing, automation, and managed change. You will provide tools and expertise to other Myriad technology groups to support their engineering efforts.

A Site Reliability Engineer at Myriad Genetics incorporates aspects of software engineering and applies them to infrastructure and operations problems alongside our software developers and IT operations staff. The main goals are to create ultra-scalable and highly reliable software systems.

Our SRE team builds tooling and enables teams to deploy, monitor, and maintain their own production environments.

  • Participate in the architecture of cloud/containerization infrastructure for the enterprise
  • Facilitate the migration of applications to cloud/containers and on boarding of development teams onto those technology platforms
  • Mentor a team of DevOps Engineers to improve their skill set and introduce new technologies
  • Design and develop software build automation based on defined process and procedure
  • Work with Development Teams and Architects to provide technical support on systems architecture, performance, capacity planning, deployments, environment configuration and monitoring
  • Aid in maintaining highly available and stable production systems by implementing monitoring and standardization of configurations
  • Address production related issues and work with developers to correct systematic issues
  • Spearhead the testing and evaluation of new technologies to increase the DevOps team's performance and application reliability
  • Be on call for critical outages in a scheduled rotation.
  • Manage overall health of software in production measured by uptime, performance metrics, and quality of service delivery
  • Operate and maintain container orchestration infrastructure in AWS
  • Establish and maintain proactive monitoring and alerting for container orchestration infrastructure and containerized applications
  • Establish effective working relationships between IT Operations and Development teams


Required Skills and Experience
  • Experience in a technical field including SRE, DevOps, software development, or systems administration: 8-10 years
  • Experience with algorithms, data structures, complexity analysis and software design
  • Experience with Python (5+ years)
  • Experience with containerization (Docker / Kubernetes / Openshift / etc) (3 years)
  • Experience with Amazon AWS (5 years)
  • Experience with at least on CI/CD technology, e.g. Tekton, ArgoCD, Jenkins, GoCD, TeamCity, etc
  • Experience with monitoring and metrics tools: DataDog, New Relic, Prometheus/Grafana, etc
  • Strong communication skills
  • A commitment to self-directed learning
  • Excellent time management, scheduling, and organizational skills
  • Ability to manage multiple tasks in a fast-paced environment
  • Ability to work effectively under tight timelines and schedules
  • Ability to work independently and as a contributing team member
  • Ability to sense the importance or impact of issues and situations and take appropriate actions
  • Must be flexible, innovative, and self-motivated
  • Must have the flexibility to work extra hours to meet corporate and departmental goals
  • Strong communication, interpersonal and organizational skills

Preferred Qualifications and Experience
  • BS degree in Computer Science or related technical field
  • Experience with other programming languages, including .NET, Java, Javascript, etc (5+ years)
  • Scripting languages: Groovy, Bash/Golang/Python/Perl etc.
  • Continuous integration/continuous delivery tools: GoCD, Jenkins
  • Experience with designing, analyzing and troubleshooting distributed systems
  • Database management: SQL, Oracle/Postgres, Liquibase/Flyway, etc.
  • Atlassian Suite administration
  • Logging: Splunk/SumoLogic
  • Asychronous message brokers: Rabbit MQ/Kafka
  • Linux/Unix/Windows OS: Ubuntu, RHEL, CentOS, Alpine, etc
  • Configuration management tools: SaltStack/Ansible
  • Cloud virtualization/infrastructure provisioning: Terraform, CloudFormation
  • Familiarity with architectural patterns like microservices, REST, DDD, Enterprise Integration Patterns a plus

  • Myriad Neuroscience:
  • Myriad Oncology:
  • Myriad Urology:
  • Myriad Women's Health:
  • For more information on how Myriad is making a difference, please visit the Company's website:
  • For other great information, you can visit us on Linkedin at:
  • Myriad Genetics: Health Illuminated



Same Posting Description for Internal and External Candidates


Apply Here