Service Reliability Engineer (SRE), Data Infrastructure
Cupertino, California
Summary
The Apple Services Engineering team (ASE) is one of the most exciting examples of Apple's long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. And they do it at an extensive scale, meeting our high expectations with dedication to deliver a huge variety of entertainment in over 35 languages to more than 150 countries.
These engineers build secure, end-to-end solutions. They develop the custom software used to process all the creative work, the tools that providers use to deliver that media, all the server-side systems, and the APIs for many Apple services.
Thanks to Apple's unique integration of hardware, software, and services, engineers here partner to get behind a single unified vision. That vision always includes a deep commitment to strengthening Apple's privacy policy, one of our core values. Although services are a bigger part of Apple's business than ever before, these teams remain small, and multi-functional, offering greater exposure to the array of opportunities here.
Key Qualifications
At least 5 years in a Service Reliability Engineering (SRE), DevOps or infrastructure focused role
5+ years of running services in a large scale *nix environment
Understanding of SRE principles and goals along with prior on-call experience
The ability to design, author, and release code in any language (Go, Python, Ruby or Java would be a plus)
Deep understanding and experience in one or more of the following - Docker, Mesos, Kubernetes, AWS, Ansible, Hadoop, Spark, Cassandra
Experience and understanding on scaling, capacity planning and disaster recovery
Fast learner with excellent analytical problem solving and interpersonal skills
Experience working on supporting Java applications
Experience using monitoring and logging solutions like Prometheus, Grafana, Splunk etc.
Familiarity with DNS, HTTP, message queues, queueing theory, RPC frameworks, datastore
Experience working with geographically distributed teams and implement high level projects and migrations
Strong communication skills and ability deliver results on time with high quality
Description
The Service Reliability Engineer (SRE) role in Apple Services Engineering requires a mix of strategic engineering and design along with hands-on, technical work. This SRE will configure, tune, and fix multi-tiered systems to achieve optimal application performance, stability and availability. We work closely with the systems engineers, network engineers, database administrators, monitoring team and information security team.
If you love designing, running systems and infrastructure that will impact millions of users then this is the place for you!
Responsibilities include:
*Be primary point of contact for the data pipeline that involves Kafka, Hadoop, Cassandra etc and infrastructure components.
*Ensure Service level SLAs are met write, review and develop code and documentation that solves the hardest problems that live on some of the largest and most sophisticated systems in the world.
*Engage and improve life cycle of service from inception and design, to deployment, operation, and migration.
*Code in Python, Golang and Java.
*Passion for quality and automation, ability to understand sophisticated systems and a desire to constantly make things better.
*Set priorities and work efficiently in a fast-paced environment measure and optimize system performance.
Education & Experience
BS degree in computer science or equivalent field with 5+ years or MS degree with 3+ years experience, or equivalent.
The Apple Services Engineering team (ASE) is one of the most exciting examples of Apple's long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. And they do it at an extensive scale, meeting our high expectations with dedication to deliver a huge variety of entertainment in over 35 languages to more than 150 countries.
These engineers build secure, end-to-end solutions. They develop the custom software used to process all the creative work, the tools that providers use to deliver that media, all the server-side systems, and the APIs for many Apple services.
Thanks to Apple's unique integration of hardware, software, and services, engineers here partner to get behind a single unified vision. That vision always includes a deep commitment to strengthening Apple's privacy policy, one of our core values. Although services are a bigger part of Apple's business than ever before, these teams remain small, and multi-functional, offering greater exposure to the array of opportunities here.
Key Qualifications
At least 5 years in a Service Reliability Engineering (SRE), DevOps or infrastructure focused role
5+ years of running services in a large scale *nix environment
Understanding of SRE principles and goals along with prior on-call experience
The ability to design, author, and release code in any language (Go, Python, Ruby or Java would be a plus)
Deep understanding and experience in one or more of the following - Docker, Mesos, Kubernetes, AWS, Ansible, Hadoop, Spark, Cassandra
Experience and understanding on scaling, capacity planning and disaster recovery
Fast learner with excellent analytical problem solving and interpersonal skills
Experience working on supporting Java applications
Experience using monitoring and logging solutions like Prometheus, Grafana, Splunk etc.
Familiarity with DNS, HTTP, message queues, queueing theory, RPC frameworks, datastore
Experience working with geographically distributed teams and implement high level projects and migrations
Strong communication skills and ability deliver results on time with high quality
Description
The Service Reliability Engineer (SRE) role in Apple Services Engineering requires a mix of strategic engineering and design along with hands-on, technical work. This SRE will configure, tune, and fix multi-tiered systems to achieve optimal application performance, stability and availability. We work closely with the systems engineers, network engineers, database administrators, monitoring team and information security team.
If you love designing, running systems and infrastructure that will impact millions of users then this is the place for you!
Responsibilities include:
*Be primary point of contact for the data pipeline that involves Kafka, Hadoop, Cassandra etc and infrastructure components.
*Ensure Service level SLAs are met write, review and develop code and documentation that solves the hardest problems that live on some of the largest and most sophisticated systems in the world.
*Engage and improve life cycle of service from inception and design, to deployment, operation, and migration.
*Code in Python, Golang and Java.
*Passion for quality and automation, ability to understand sophisticated systems and a desire to constantly make things better.
*Set priorities and work efficiently in a fast-paced environment measure and optimize system performance.
Education & Experience
BS degree in computer science or equivalent field with 5+ years or MS degree with 3+ years experience, or equivalent.
Created: 2024-04-16
Reference: 200541296
Country: United States
State: California
City: Cupertino
About Apple
Founded in: 1976
Number of Employees: 154000
Website: https://www.apple.com/
Career site: https://www.apple.com/careers/us/
Wikipedia: https://en.wikipedia.org/wiki/Apple_Inc.
Instagram: https://www.instagram.com/apple/
LinkedIn: https://www.linkedin.com/company/apple
Similar jobs:
-
Senior Software Engineer, TikTok Protected Data Infrastructure
TikTok in San Jose, California -
Technical Program Manager III, Data Infrastructure, Google Cloud
Google in Sunnyvale, California -
AIML - Senior Data Infrastructure Software Engineer, Machine Learning Platform and Technology
Apple in Santa Clara Valley (Cupertino), California -
AI Ops Site Reliability Engineer - Data Infrastructure
TikTok in San Jose, California -
Site Reliability Engineer - Data Infrastructure
TikTok in San Jose, California -
Senior Software Engineer, Infrastructure, Google Cloud Data Management
Google in Sunnyvale, California -
Backend Software Engineer, Recommendation - Data Infrastructure (Storage Systems)
TikTok in San Jose, California -
Principal Engineer, Data/AI Infrastructure (Mailchimp)
Intuit in Mountain View, California -
Backend Software Engineer, Recommendation - Data Infrastructure (Data Lake)
TikTok in San Jose, California -
Finance Manager, Infrastructure Data Center
Meta in Menlo Park, California -
Senior Product Manager, Generative AI Data Quality Infrastructure
Google in Mountain View, California -
Engineering Manager, Infrastructure Data Circulation, Google Assistant
Google in Mountain View, California -
Senior Software Engineer, Data Infrastructure
Roblox in San Mateo, California -
Software Engineering Manager II, Infrastructure, Google Cloud Data Management
Google in Sunnyvale, California -
Senior Site Reliability Engineer - Data Infrastructure
TikTok in San Jose, California -
Senior Product Manager - Data Infrastructure, Risk...
Walmart in SUNNYVALE, California💸 $117000.00 per year -
Software Engineer III, Infrastructure, Google Cloud Data Management
Google in Sunnyvale, California -
Tech Lead Software Engineer, TikTok Protected Data Infrastructure
TikTok in San Jose, California -
Principal Big Data Software Engineer - Streaming Infrastructure
Roblox in San Mateo, California -
AIML - Sr Software Data Infrastructure Engineer, Data and ML Innovation
Apple in Cupertino, California