AIML - Machine Learning Engineer, Machine Learning Platform & Infrastructure
Cupertino, California
Summary
At Apple, the AIML - On-Device Machine Learning group is responsible for accelerating the creation of amazing on- device ML experiences, and we are looking for a senior software engineer to help define and implement features that accelerate and compress large state of the art (SoTA) models (e.g., LLMs) in our on-device inference stack. We are a dedicated team working on ground breaking technology in the field of natural language processing, computer vision and artificial intelligence. We are designing, developing, and optimizing large-scale language/vision/multi-modal models that power on-device inference capabilities across various Apple products and services. This is a unique opportunity to work on powerful new technologies and contribute to Apple's ecosystem, with a commitment to privacy and user experience impacting millions of users worldwide.
Are you someone who can write high-quality, well-tested code and collaborate cross-functionally with partner HW, SW and ML teams across the company? If so, come join us and be part of the team that is helping Machine Learning developers innovate and ship enriching experiences on Apple devices!
Description
As a member of this team, the successful candidate will:
- Build features for our on-device inference stack to support the most relevant accuracy preserving, general purpose techniques that empower model developers to compress and accelerate SoTA models (e.g., LLMs) in apps
- Convert models from a high-level ML framework to a target device (CPU, GPU, Neural Engine) for optimal functional accuracy and performance
- Write unit and system integration tests to ensure functional correctness and avoid performance regressions
- Diagnose performance bottlenecks and work with HW Arch teams to co-design solutions that further improve latency, power, and memory footprint of neural network workloads
- Analyze impact of model optimization (compression/quantization etc) on model quality by partnering with modeling and adaptation teams across diverse product use cases.
At Apple, the AIML - On-Device Machine Learning group is responsible for accelerating the creation of amazing on- device ML experiences, and we are looking for a senior software engineer to help define and implement features that accelerate and compress large state of the art (SoTA) models (e.g., LLMs) in our on-device inference stack. We are a dedicated team working on ground breaking technology in the field of natural language processing, computer vision and artificial intelligence. We are designing, developing, and optimizing large-scale language/vision/multi-modal models that power on-device inference capabilities across various Apple products and services. This is a unique opportunity to work on powerful new technologies and contribute to Apple's ecosystem, with a commitment to privacy and user experience impacting millions of users worldwide.
Are you someone who can write high-quality, well-tested code and collaborate cross-functionally with partner HW, SW and ML teams across the company? If so, come join us and be part of the team that is helping Machine Learning developers innovate and ship enriching experiences on Apple devices!
Description
As a member of this team, the successful candidate will:
- Build features for our on-device inference stack to support the most relevant accuracy preserving, general purpose techniques that empower model developers to compress and accelerate SoTA models (e.g., LLMs) in apps
- Convert models from a high-level ML framework to a target device (CPU, GPU, Neural Engine) for optimal functional accuracy and performance
- Write unit and system integration tests to ensure functional correctness and avoid performance regressions
- Diagnose performance bottlenecks and work with HW Arch teams to co-design solutions that further improve latency, power, and memory footprint of neural network workloads
- Analyze impact of model optimization (compression/quantization etc) on model quality by partnering with modeling and adaptation teams across diverse product use cases.
Created: 2024-09-22
Reference: 200527162
Country: United States
State: California
City: Cupertino
About Apple
Founded in: 1976
Number of Employees: 154000
Website: https://www.apple.com/
Career site: https://www.apple.com/careers/us/
Wikipedia: https://en.wikipedia.org/wiki/Apple_Inc.
Instagram: https://www.instagram.com/apple/
LinkedIn: https://www.linkedin.com/company/apple
Similar jobs:
-
AIML - ML Research Engineer or Scientist
Apple in Cupertino, California -
AIML - Machine Learning Engineer/Scientist, Siri and Information Intelligence
Apple in Cupertino, California -
AIML - Staff Machine Learning Engineer, Search Query Understanding (SII)
Apple in Cupertino, California -
AIML - Sr Data Scientist, Siri and Apple Intelligence, Data and ML Innovation
Apple in Cupertino, California -
AIML - Senior ML Engineering Manager, Data & ML Innovation
Apple in Cupertino, California -
AIML - Sr Engineering Program Manager, Advanced Question Answering Systems
Apple in Cupertino, California -
AIML - Software Engineer
Apple in Cupertino, California -
AIML - Senior Software Engineer- Simulation - AIML Special Projects
Apple in Sunnyvale, California -
AIML - Machine Learning Engineer, Foundation Model Services
Apple in Cupertino, California -
AIML - Head of Technical Production, AI Studio
Apple in Culver City, California -
AIML - Sr. Machine Learning Engineer - Large Language Models and Generative AI, Siri Information and Intelligence
Apple in Cupertino, California -
AIML - Sr Engineering Program Manager, Siri Metrics & Evaluation Methods
Apple in San Francisco, California -
AIML - Sr. Director of Machine Learning Applied Research Data ML Innovation-Engineering
Apple in Cupertino, California -
AIML - Machine Learning Engineer, Foundation Models
Apple in Cupertino, California -
AIML - ML Engineer, MLR
Apple in Cupertino, California -
AIML - Manager, FM Evaluation Product & Program Management
Apple in Cupertino, California -
AIML - Frontend/UI Senior Software Engineer - Simulation, Special Projects
Apple in Sunnyvale, California -
AIML - Machine Learning Researcher, Foundation Models
Apple in Cupertino, California -
AIML - Machine Learning Engineer, Siri Perception
Apple in San Francisco, California -
AIML - ML Engineer, ML Systems and Evaluation Engineering Client Tools and Frameworks
Apple in Cupertino, California