AIML - Sr Data Engineer, Data and ML Innovation
Cupertino, California
Summary
The AIML Data organization seeks to improve products by using data as the voice of our customers. Within this organization, the Siri Data Engineering team builds systems that process data reliably at scale to generate scalable and high-quality datasets that support confident, data informed decision-making for Siri to be an effective product. We're looking for exceptional data engineers who are passionate about our product and values; who love working with data at scale, and who are committed to continuously improve. As a part of this group, you will work with petabytes of data daily using diverse technologies. You will be expected to effectively partner with upstream engineering teams and downstream consumers, including data scientists and ML engineers.
Key Qualifications
7+ years of technical experience designing, building, and maintaining distributed data processing platforms.
5+ years of industry experience working with batch or streaming distributed data processing technologies (e.g. Hadoop, MapReduce, Spark, Flink, Kafka, Presto, etc.) for building efficient & large-scale data pipelines.
3+ years of data modeling experience designing data warehouse table schemas and logging schemas.
Proficiency in at least one high-level programming language (Java, Scala, Python, Go or equivalent).
Experience with large, complex, highly dimensional data sets; hands-on experience with SQL.
Experience working with cross-functional teams to collect business requirements, build consensus, and manage expectations.
You are self-directed and capable of operating amidst ambiguity.
You are humble, continually growing in self-awareness, and possessing a growth mindset.
You are curious and have excellent written and verbal communication as well as problem-solving skills.
You are excited about digging into massive petabyte-scale semi-structured datasets.
Description
In this role, you will be building ultra large scale batch & streaming datasets to support analytics, experimentation and machine learning and helping to drive our self-serve strategy for reporting on-behalf of data scientists and product engineers as we collectively make product better. You will help design instrumentation required to log data from device and server side and validate data is flowing in the correct shape, frequency, and quality into the Data Warehouse. Curate a high performance and easy to understand data model that meets the needs of the many. Identify common patterns and build self-serve tools to scale data engineering, and automate lifecycle of datasets with highest standards of data quality. Educate your consumers on how to access your products, assuring transparency and understanding in logic definitions and enabling self-service.
Education & Experience
Surprise us! Many will have an MS or BS in CS, Engineering, Math, Statistics, or a related field or equivalent practical experience in data engineering.
The AIML Data organization seeks to improve products by using data as the voice of our customers. Within this organization, the Siri Data Engineering team builds systems that process data reliably at scale to generate scalable and high-quality datasets that support confident, data informed decision-making for Siri to be an effective product. We're looking for exceptional data engineers who are passionate about our product and values; who love working with data at scale, and who are committed to continuously improve. As a part of this group, you will work with petabytes of data daily using diverse technologies. You will be expected to effectively partner with upstream engineering teams and downstream consumers, including data scientists and ML engineers.
Key Qualifications
7+ years of technical experience designing, building, and maintaining distributed data processing platforms.
5+ years of industry experience working with batch or streaming distributed data processing technologies (e.g. Hadoop, MapReduce, Spark, Flink, Kafka, Presto, etc.) for building efficient & large-scale data pipelines.
3+ years of data modeling experience designing data warehouse table schemas and logging schemas.
Proficiency in at least one high-level programming language (Java, Scala, Python, Go or equivalent).
Experience with large, complex, highly dimensional data sets; hands-on experience with SQL.
Experience working with cross-functional teams to collect business requirements, build consensus, and manage expectations.
You are self-directed and capable of operating amidst ambiguity.
You are humble, continually growing in self-awareness, and possessing a growth mindset.
You are curious and have excellent written and verbal communication as well as problem-solving skills.
You are excited about digging into massive petabyte-scale semi-structured datasets.
Description
In this role, you will be building ultra large scale batch & streaming datasets to support analytics, experimentation and machine learning and helping to drive our self-serve strategy for reporting on-behalf of data scientists and product engineers as we collectively make product better. You will help design instrumentation required to log data from device and server side and validate data is flowing in the correct shape, frequency, and quality into the Data Warehouse. Curate a high performance and easy to understand data model that meets the needs of the many. Identify common patterns and build self-serve tools to scale data engineering, and automate lifecycle of datasets with highest standards of data quality. Educate your consumers on how to access your products, assuring transparency and understanding in logic definitions and enabling self-service.
Education & Experience
Surprise us! Many will have an MS or BS in CS, Engineering, Math, Statistics, or a related field or equivalent practical experience in data engineering.
Created: 2024-05-01
Reference: 200548865
Country: United States
State: California
City: Cupertino
About Apple
Founded in: 1976
Number of Employees: 154000
Website: https://www.apple.com/
Career site: https://www.apple.com/careers/us/
Wikipedia: https://en.wikipedia.org/wiki/Apple_Inc.
Instagram: https://www.instagram.com/apple/
LinkedIn: https://www.linkedin.com/company/apple
Similar jobs:
-
AIML - Machine Learning Researcher, MLR
Apple in Santa Clara Valley (Cupertino), California -
AIML - Director of Engineering, Siri Device Ecosystem
Apple in Cupertino, California -
AIML - Sr Manager, Program Office, Data Operations
Apple in Cupertino, California -
AIML - Sr Events & Experiences Project Manager
Apple in Cupertino, California -
AIML - Senior Software Engineer, Privacy - Machine Learning Platform and Technology
Apple in Santa Clara Valley (Cupertino), California -
AIML - Software Engineer, Siri Speech
Apple in Cupertino, California -
AIML - Full Stack Engineer, Siri and Information Intelligence
Apple in Cupertino, California -
AIML - Machine Learning Engineer, MIND
Apple in San Francisco, California -
AIML - Data Engineer, Data and ML Innovation
Apple in Cupertino, California -
AIML - ML Engineer, Siri & Information Intelligence
Apple in Cupertino, California -
AIML - Sr Engineering Manager, Siri Performance and Reliability
Apple in Cupertino, California -
AIML - Senior Data Infrastructure Software Engineer, Machine Learning Platform and Technology
Apple in Santa Clara Valley (Cupertino), California -
AIML - Senior Software Engineer, Privacy - Machine Learning Platform and Technology
Apple in Cupertino, California -
AIML- Machine Learning Engineer, Machine Learning Platform & Infrastructure
Apple in Cupertino, California -
AIML - iOS Engineer Voice Input/Dictation, Siri and Information Intelligence
Apple in Cupertino, California -
AIML - Senior Backend Software Engineer, ML Innovation
Apple in Cupertino, California -
AIML - Sr ML Engineer, Data and ML Innovation
Apple in Cupertino, California -
AIML - Sr Engineering Program Manager, Siri Metrics & Evaluation Methods
Apple in San Francisco, California -
AIML - Communications Specialist, Editorial
Apple in Cupertino, California -
AIML - Research Communications Manager
Apple in San Francisco, California