Sr. Manager, Site Reliability Engineering

Dallas, Texas


Employer: BRINKER
Industry: 
Salary: Competitive
Job type: Full-Time

Sr. Manager, Site Reliability Engineer

Dallas TX

What does it mean to be a BrinkerHead? We play like a team, take pride in our culture and seek every opportunity to make people feel special. Life is short. Work happy. At Brinker, we connect, serve and give to create the best life for our Team Members, Guests and community. Through our cultural beliefs, Brinker empowers its Team Members to positively impact our 4 Key Results: Engaging Team Members, Bringing Back Guests, Growing Sales and Increasing Profits. Brinker International is an equal opportunity employer; we foster an inclusion environment that promotes respect, diversity of thought and success for all.

Job Summary

As a Senior Site Reliability Manager, you will play a crucial role in ensuring the stability and scalability of our systems. You will be responsible for leading a team of talented engineers, driving initiatives to enhance reliability for our technology systems, streamline operations, and minimize downtime. Your technical expertise, coupled with strong communication skills, and strategic thinking will be instrumental in fostering collaboration across teams and implementing best practices throughout the Digital Guest Experience team.

Your Key Job Functions
  • Lead and mentor a team of Site Reliability Engineers, providing guidance and support, while also implementing best practices and resolving complex technical challenges.
  • Collaborate with cross-functional teams to define reliability requirements, establish service level objectives (SLOs), and develop a strategic vision along with defined action items to hold accountability among the team
  • Monitor system performance, conduct root cause analysis of incidents, implement and document solutions to prevent recurrence.
  • Implement monitoring and alerting systems using tools including, but not limited to, New Relic, Noibu, and GCP Logs/ AWS Cloud Logs to proactively identify issues and reach resolution
  • Develop and maintain incident response plans, including documentation of procedures, solutions and escalation pathways
  • Drive automation initiatives to streamline operations, improve efficiency, and reduce manual intervention.
  • Ensure compliance with relevant regulations and standards, including the Americans with Disabilities Act (ADA), California Consumer Privacy Act (CCPA), and General Data Protection Regulation (GDPR).
  • Create standardized documentation for all systems, processes, and procedures to ensure support and knowledge sharing across the team and ensure it remains current and relevant
  • Have knowledge of industry trends and emerging technologies, evaluating their potential impact on our current systems and utilizing data based recommendations for adoption.

What You Bring to the Team
  • Master's degree and/or bachelor's degree in combination with equivalent experience in Computer Science, Engineering, or related field.
  • 5+ years as a Site Reliability Engineer or similar role, with a demonstrated track record of successfully managing reliability and scalability of large-scale systems.
  • Proven technical proficiency in cloud-based environments, including, but not limited to, Google Cloud Platform (GCP).
  • Proficiency in utilizing tools to monitor and track reliability, systems performance, data gathering to troubleshoot with tools including, but not limited to New Relic, Noibu, Datadog and GCP Logs
  • Demonstrated ability to build and maintain dashboards in tools, including, but not limited to, New Relic and Noibu.
  • Excellent written and verbal communication skills and proven ability to utilize various communication in combination with strong interpersonal skills to explain complex technical concepts to stakeholders and/or team members with varying degrees of technical knowledge
  • Demonstrated leadership experience, with a passion for mentoring and developing team members.
  • Proven ability to problem solve complex issues in a timely fashion
  • Proven ability to quickly adapt and flex to a dynamic environment by being a "self-starter"

• Certifications in cloud computing, including but not limited to, Google Cloud - Professional Cloud Architect

• Familiarity with continuous integration and deployment pipelines and infrastructure (CI/CD) as code (IaC) principles

• Previous experience working within Agile or DevOps organizations

Why Brinker

We offer a competitive benefits package including medical/dental/vision, life insurance, paid vacation/holidays, and 401(k) with company match and generous dining discounts. Every team member working at the Restaurant Support Center (aka Brinker headquarters) is eligible for annual bonus potential.

Our campus includes an onsite gym plus opportunities to increase your wellbeing with onsite Yoga and boot camp programs . Work/Life/Fun balance in a casual and collaborative work environment! Team members enjoy company-wide events and celebrations . Regular volunteer opportunities with our community give back programs

Check our Careers page for more exciting opportunities! Brinker Careers

Join our talent communities! Brinker LinkedIn

#LifeisShortWorkHappy
#brinkerjobs
#brinkerhead

Created: 2024-09-11
Reference: 004I16
Country: United States
State: Texas
City: Dallas
ZIP: 75287


Similar jobs: