Site Reliability Engineer

San Francisco, California

Employer: AEG

Industry: Technical Services

Salary: Competitive

Job type: Full-Time

In order to be considered for this role, after clicking "Apply Now" above and being redirected, you must fully complete the application process on the follow-up screen.

Swish Analytics is a sports analytics, betting and fantasy startup building the next generation of predictive sports analytics data products. We believe that oddsmaking is a challenge rooted in engineering, mathematics, and sports betting expertise; not intuition. We're looking for team-oriented individuals with an authentic passion for accurate and predictive real-time data who can execute in a fast-paced, creative, and continually-evolving environment without sacrificing technical excellence. Our challenges are unique, so we hope you are comfortable in uncharted territory and passionate about building systems to support products across a variety of industries and enterprise clients. About the team The Swish Analytics DevSecOps and Infrastructure team is looking for an experienced Site Reliability Engineer based in Europe who will support our enterprise infrastructure during non-US hours. In addition to supporting you will assist in optimizing incident response, observability, and working with technical teams to improve overall workload resiliency. Responsibilities

Support production systems and help triage issues during live sporting events
Monitor the system and respond to incidents to maintain system SLO/SLA, review and follow up production incidents
Write and review code, develop documentation, and debug problems, live, on complex distributed systems
Optimize and facilitate incident response, conduct root cause analysis and blameless retrospectives
Work closely with technical teams to implement, optimize, maintain, scale and debug workloads on Kubernetes using CI/CD, automation tools and scripting languages to deliver tools/software to improve the reliability and scalability of services

Qualifications

3+ years of experience working in an SRE leaning DevOps or full SRE roles
3+ years building CICD pipelines with Github Actions, Gitlab CICD, or similar
Extensive experience with Kubernetes
Experience in managing customer-facing systems in a 24/7 environment including escalations
Experience triaging and escalation policies/protocols
Strong communication and documentation skills
Comfortable with scripting languages like Bash, Python, or similar

Preferred

Networking and routing experience
Terraform in AWS to support global-scale services
Improving observability in an engineering organization
Past experience with PagerDuty or similar tools

Swish Analytics is an Equal Opportunity Employer. All candidates who meet the qualifications will be considered without regard to race, color, religion, sex, national origin, age, disability, sexual orientation, pregnancy status, genetic, military, veteran status, marital status, or any other characteristic protected by law. The position responsibilities are not limited to the responsibilities outlined above and are subject to change. At the employer's discretion, this position may require successful completion of background and reference checks.

Created: 2024-04-23

Reference: 2069834

Country: United States

State: California

City: San Francisco

ZIP: 94130

About AEG

Founded in: 1994

Number of Employees: 28000

Website: https://www.aegworldwide.com/

Career site: https://www.aegworldwide.com/careers

Wikipedia: https://en.wikipedia.org/wiki/Anschutz_Entertainment_Group

LinkedIn: https://www.linkedin.com/company/aeg

Facebook: https://www.facebook.com/AEGWorldwide/

Similar jobs:

Software Development Engineer in Test -II, WWGST Quality Reliability Engineering

Amazon in Irvine, California

💸 $115000 per year
Site Reliability Engineer, Cloud Infrastructure- USDS

TikTok in Los Angeles, California
Senior Site Reliability Engineer (SRE) - ASE / iCloud

Apple in Cupertino, California
Hardware Reliability Engineer, Product Integrity

Google in Mountain View, California
Senior Backend Software Engineer (Reliability Assurance) - Server Architecture

TikTok in San Jose, California
Senior Site Reliability Engineer, TikTok Server Architecture

TikTok in San Jose, California
Site Reliability Engineer, Data Engineering - USDS

TikTok in Los Angeles, California
Sr Reliability Engineer

Valero Energy in Benicia, California

💸 $123520 - $169840 per year
Sr. Site Reliability Engineer, Cell Factory

Tesla Motors in Fremont, California
Site Reliability Engineer, Data Engineering - USDS

TikTok in Mountain View, California
Sr Manager, Quality Engineering (Reliability Engineering)

Chipotle in Newport Beach, California
Site Reliability Engineer, Recommendation Infrastructure - USDS

TikTok in Los Angeles, California
AIML - Sr Engineering Manager, Siri Performance and Reliability

Apple in Cupertino, California
Site Reliability Engineer

Apple in Cupertino, California
Senior Software Engineering Manager, Reliability Engineering

Roblox in San Mateo, California
Site Reliability Engineer - Solr

Apple in Cupertino, California
Site Reliability Engineer - Data Infrastructure

TikTok in San Jose, California
Reliability Engineer, Semi Chassis Systems

Tesla Motors in Fremont, California

💸 $84000 - $276000 per year
Plant Reliability Engineer

Gables Search Group in San Francisco, California
Site Reliability Engineer, Engineering Technology & Operations

Tesla Motors in Palo Alto, California