Site Reliability Engineer
Austin, Texas
Employer: Compunnel
Industry:
Salary: Competitive
Job type: Part-Time
Description:
The Client Site Reliability team is responsible for the operations and infrastructure of all consumer-facing production systems and developer-facing systems at Client Games, including NBA Client game services, customer-facing account services, and websites. This team handles systems and services spanning multiple datacenters both terrestrial and cloud-based.
What We Need:
We are looking for an expert engineer who is passionate about building multi-datacenter infrastructure and services. Robust systems and problem-solving skills are required as we develop solutions for game studios and support data centers around the world alongside a group of outstanding engineers. In this role, you will collaborate with network engineers, systems architects, and development staff to support our gamers and the needs of the business.
What you will do
What We Do
Build and operate highly resilient systems in a multi-datacenter and cloud global environment serving game and consumer services
Develop tools for the management and automation of the systems and service infrastructure
Define and implement standards that will impact systems, services, and multiple software environments
Diagnose and resolve technical issues from both internal and external customers and drive improvements to prevent them from recurring
Participate in Site Reliability Engineeringâs on-call rotation
Who We Believe Will Be an Outstanding Fit
You are eager to work in a fast-paced environment with other highly skilled engineers who are passionate about service availability and health!
If the idea of building data center infrastructure services from greenfield to implementation moves you!
Required Qualifications
6+ years of demonstrated influence across one or more teams for large scale projects that drive impact and improvement across the organization
6+ years of experience in an SRE role for online services in a multi-region, multi-cloud environment with specific experience in reliability and resiliency
6+ years of developing tools for automation of processes or augmenting off the shelf tool functionality
6+ years of AWS and/or GCP cloud experience running highly elastic mission critical workloads
6+ years of coding experience in at least one or more of Python, Ruby, Java, or Go and a good understanding of code management
6+ years of experience using Infrastructure as Code tools like Terraform, Pulumi, or others
Extensive knowledge of software build, test, and deploy processes using Git, Jenkins, Puppet, Ansible, Docker/containers, and Kubernetes
Experience with system analysis and troubleshooting
Serve as a mentor to junior engineers and provide technical leadership to the organization.
Bonus Points
Prior hands-on experience running large scale multiplayer video games at scale
Experience designing and crafting software for systems and network automation
Debugging, code optimization, and routine task automation skills
Demonstrated ability to decompose sophisticated problems. Ability to engage in lateral investigations.
Must Haves:
3 to 5 years exp. Kubernetes, Data Dog, cloud services, large scale systems, AWS&GCP, minor Azure
GKE, home strung clusters on prem, and AKS (Very Small), EKS
Consistent upgrades across all the clusters and clouds
Education: Bachelors Degree
Additional client information:
The Client Site Reliability team is responsible for the operations and infrastructure of all consumer-facing production systems and developer-facing systems at Client Games, including NBA Client game services, customer-facing account services, and websites. This team handles systems and services spanning multiple datacenters both terrestrial and cloud-based.
What We Need:
We are looking for an expert engineer who is passionate about building multi-datacenter infrastructure and services. Robust systems and problem-solving skills are required as we develop solutions for game studios and support data centers around the world alongside a group of outstanding engineers. In this role, you will collaborate with network engineers, systems architects, and development staff to support our gamers and the needs of the business.
What you will do
What We Do
Build and operate highly resilient systems in a multi-datacenter and cloud global environment serving game and consumer services
Develop tools for the management and automation of the systems and service infrastructure
Define and implement standards that will impact systems, services, and multiple software environments
Diagnose and resolve technical issues from both internal and external customers and drive improvements to prevent them from recurring
Participate in Site Reliability Engineeringâs on-call rotation
Who We Believe Will Be an Outstanding Fit
You are eager to work in a fast-paced environment with other highly skilled engineers who are passionate about service availability and health!
If the idea of building data center infrastructure services from greenfield to implementation moves you!
Required Qualifications
6+ years of demonstrated influence across one or more teams for large scale projects that drive impact and improvement across the organization
6+ years of experience in an SRE role for online services in a multi-region, multi-cloud environment with specific experience in reliability and resiliency
6+ years of developing tools for automation of processes or augmenting off the shelf tool functionality
6+ years of AWS and/or GCP cloud experience running highly elastic mission critical workloads
6+ years of coding experience in at least one or more of Python, Ruby, Java, or Go and a good understanding of code management
6+ years of experience using Infrastructure as Code tools like Terraform, Pulumi, or others
Extensive knowledge of software build, test, and deploy processes using Git, Jenkins, Puppet, Ansible, Docker/containers, and Kubernetes
Experience with system analysis and troubleshooting
Serve as a mentor to junior engineers and provide technical leadership to the organization.
Bonus Points
Prior hands-on experience running large scale multiplayer video games at scale
Experience designing and crafting software for systems and network automation
Debugging, code optimization, and routine task automation skills
Demonstrated ability to decompose sophisticated problems. Ability to engage in lateral investigations.
Must Haves:
3 to 5 years exp. Kubernetes, Data Dog, cloud services, large scale systems, AWS&GCP, minor Azure
GKE, home strung clusters on prem, and AKS (Very Small), EKS
Consistent upgrades across all the clusters and clouds
Education: Bachelors Degree
Additional client information:
Created: 2024-04-23
Reference: PATDC4869938
Country: United States
State: Texas
City: Austin
ZIP: 78749
Similar jobs:
-
Lead Reliability Engineer
WestRock in Evadale, Texas -
Reliability & Integrity Engineer (Electrical)
Hunt Guillot & Associates in Corpus Christi, Texas -
Analyzer Reliability Engineer
Koch Industries in Victoria, Texas -
Sr. Site Reliability Engineer (SRE)
Cognizant Technology Solutions in Dallas, Texas -
Site Reliability Engineer III
LexisNexis Risk Solutions in HOME-BASED, Texas -
Lead - Site Reliability Engineer
Innova solutions in Dallas, Texas -
Mechanical Reliability Engineer III
EDG Inc in Houston, Texas -
Staff Database Reliability Engineer
Procore in Austin, Texas -
Site Reliability Engineer, Enterprise Justice
Tyler Technologies in Plano, Texas -
Site Reliability Engineer, Enterprise Justice
Tyler Technologies in Plano, Texas -
Senior Site Reliability Engineer - Ad Platforms
Apple in Austin, Texas -
Site Reliability Engineer - Ad Platforms
Apple in Austin, Texas -
Data Platform Site Reliability Engineer (SRE) - Apple Services Engineering
Apple in Austin, Texas -
Site Reliability Engineer II
Procore in Austin, Texas💸 $112320 - $154440. per year -
Site Reliability Engineer
Saxon Global in [dallas, Tx, 75201], Texas -
SRE / Reliability Engineer (Senior) with skills ITSM Principles, AWS - EKS, AWS - CloudFormation, SRE Architecture, AWS-Apps, GCP-Apps, AWS-Infra, SRE Engineering, AWS DBA for location Dallas, Texas, US
Infogain Europe in Dallas, Texas -
Reliability Engineer
Saint Gobain in BRYAN, Texas -
Site Reliability Engineer
Esolvit, Inc. in Austin, Texas -
DevOps Site Reliability Engineer
The Reynolds and Reynolds Company in Houston, Texas -
Quality \u0026 Reliability Engineer, Annapurna Labs USA
Amazon in Austin, Texas