Principal Site Reliability Engineer

Company Info

Myriad Genetics, Inc
United States

Phone:
Web Site:

Company Profile

col-narrow

Title:

Location:

Salt Lake City, UT

Job ID:

72730

col-wide

Job Description:

Location: United States
Job Identification: 2344
Job Schedule: Full time

Principal Site Reliability Engineer (Remote)

Myriad Genetics is looking for a principal Site Reliability Engineer to use their skills and experience in software and systems to build and maintain scalable, reliable, and secure services for internal and external users. We will need you to approach the problem of running production systems from a software and engineering perspective with a focus on testing, automation, and managed change. You will provide tools and expertise to other Myriad technology groups to support their engineering efforts.

A Site Reliability Engineer at Myriad Genetics incorporates aspects of software engineering and applies them to infrastructure and operations problems alongside our software developers and IT operations staff. The main goals are to create ultra-scalable and highly reliable software systems.

Our SRE team builds tooling and enables teams to deploy, monitor, and maintain their own production environments.

WHAT YOU WILL DO

Participate in the architecture of cloud/containerization infrastructure for the enterprise
Facilitate the migration of applications to cloud/containers and on boarding of development teams onto those technology platforms
Mentor a team of DevOps Engineers to improve their skill set and introduce new technologies
Design and develop software build automation based on defined process and procedure
Work with Development Teams and Architects to provide technical support on systems architecture, performance, capacity planning, deployments, environment configuration and monitoring
Aid in maintaining highly available and stable production systems by implementing monitoring and standardization of configurations
Address production related issues and work with developers to correct systematic issues
Spearhead the testing and evaluation of new technologies to increase the DevOps team's performance and application reliability
Be on call for critical outages in a scheduled rotation.
Manage overall health of software in production measured by uptime, performance metrics, and quality of service delivery
Operate and maintain container orchestration infrastructure in AWS
Establish and maintain proactive monitoring and alerting for container orchestration infrastructure and containerized applications
Establish effective working relationships between IT Operations and Development teams

ABOUT YOU

Required Skills and Experience

Experience in a technical field including SRE, DevOps, software development, or systems administration: 8-10 years
Experience with algorithms, data structures, complexity analysis and software design
Experience with Python (5+ years)
Experience with containerization (Docker / Kubernetes / Openshift / etc) (3 years)
Experience with Amazon AWS (5 years)
Experience with at least on CI/CD technology, e.g. Tekton, ArgoCD, Jenkins, GoCD, TeamCity, etc
Experience with monitoring and metrics tools: DataDog, New Relic, Prometheus/Grafana, etc
Strong communication skills
A commitment to self-directed learning
Excellent time management, scheduling, and organizational skills
Ability to manage multiple tasks in a fast-paced environment
Ability to work effectively under tight timelines and schedules
Ability to work independently and as a contributing team member
Ability to sense the importance or impact of issues and situations and take appropriate actions
Must be flexible, innovative, and self-motivated
Must have the flexibility to work extra hours to meet corporate and departmental goals
Strong communication, interpersonal and organizational skills

Preferred Qualifications and Experience

BS degree in Computer Science or related technical field
Experience with other programming languages, including .NET, Java, Javascript, etc (5+ years)
Scripting languages: Groovy, Bash/Golang/Python/Perl etc.
Continuous integration/continuous delivery tools: GoCD, Jenkins
Experience with designing, analyzing and troubleshooting distributed systems
Database management: SQL, Oracle/Postgres, Liquibase/Flyway, etc.
Atlassian Suite administration
Logging: Splunk/SumoLogic
Asychronous message brokers: Rabbit MQ/Kafka
Linux/Unix/Windows OS: Ubuntu, RHEL, CentOS, Alpine, etc
Configuration management tools: SaltStack/Ansible
Cloud virtualization/infrastructure provisioning: Terraform, CloudFormation
Familiarity with architectural patterns like microservices, REST, DDD, Enterprise Integration Patterns a plus

TO DISCOVER ALL THE REMARKABLE ADVANCEMENTS THROUGHOUT MYRIAD GENETICS, INC PLEASE VISIT:

Myriad Neuroscience: https://genesight.com/about-myriad-neuroscience/
Myriad Oncology: https://myriad-oncology.com/
Myriad Urology: https://myriadmyrisk.com/urology/
Myriad Women's Health: https://myriadwomenshealth.com/

For more information on how Myriad is making a difference, please visit the Company's website: www.myriad.com
For other great information, you can visit us on Linkedin at: https://www.linkedin.com/company/myriad-genetics/
Myriad Genetics: Health Illuminated

#LI-remote

#LI-MG2

Same Posting Description for Internal and External Candidates

PI180667291

Apply Here