AIML - Machine Learning Engineer, Machine Learning Platform & Infrastructure

Cupertino, California

Employer: Apple

Industry: Machine Learning and AI

Salary: Competitive

Job type: Full-Time

Summary
The AIML - On-Device Machine Learning group is responsible for the creation of amazing on-device ML experiences. The team builds foundational machine learning frameworks and tools to optimize large language/vision/multi-modal models that power on-device ML features across Apple products and services. The group is looking for a senior software engineer to help define and implement features that accelerate and compress large state of the art (SoTA) models (e.g., LLMs) in our on-device inference stack. This is a unique opportunity to work on exciting new technologies and contribute to Apple's ecosystem, with a commitment to privacy and user experience impacting millions of users worldwide.
Are you someone who can write high-quality, well-tested code and collaborate cross-functionally with partner Hardware, Software, Machine Learning, and Research teams across the company? Do you have any experience building Machine Learning compilers/runtimes/kernels/optimization tools? If so, come join us and be a part of the team that is helping Machine Learning developers innovate and ship enriching experiences on Apple devices!

Description
This role sits at the intersection of software engineering and ML engineering. As a member of this team, the successful candidate will:

- Build features for our on-device inference stack to support the most relevant accuracy preserving, general purpose techniques that empower model developers to compress and accelerate SoTA models (e.g., LLMs) in apps

- This includes building Machine Learning compilers, runtimes, execution kernels, optimizations on ML models, tooling for debugging/visualization of ML models, etc.

- Convert models from a high-level ML framework to a target device (CPU, GPU, Neural Engine) for optimal functional accuracy and performance

- Write unit and system integration tests to ensure functional correctness and avoid performance regressions

- Diagnose performance bottlenecks and work with HW & SW Arch teams to co-design solutions that further improve latency, power, and memory footprint of neural network workloads

- Analyze impact of model optimization (compression/quantization etc) on model quality by partnering with modeling and adaptation teams across diverse product use cases.

Created: 2024-09-10

Reference: 200565771

Country: United States

State: California

City: Cupertino

About Apple

Founded in: 1976

Number of Employees: 154000

Website: https://www.apple.com/

Career site: https://www.apple.com/careers/us/

Wikipedia: https://en.wikipedia.org/wiki/Apple_Inc.

Instagram: https://www.instagram.com/apple/

LinkedIn: https://www.linkedin.com/company/apple

Similar jobs:

AIML - ML Research Engineer

Apple in Cupertino, California
Specialist Sales Rep, GenAI/ML, AGS-NAMER-US-COV-SPEC-DATA\u0026AI-AIML/GenAI

Amazon in San Diego, California

💸 $83100 per year
AIML - Machine Learning Engineer/Scientist, Siri and Information Intelligence

Apple in Cupertino, California
AIML - Staff Machine Learning Engineer, Search Query Understanding (SII)

Apple in Cupertino, California
AIML - Senior Machine Learning Engineer, Computer Vision, Siri and Information Intelligence

Apple in Cupertino, California
AIML - Sr Data Scientist, Siri and Apple Intelligence, Data and ML Innovation

Apple in Cupertino, California
AIML - Director of Business Operations

Apple in Cupertino, California
AIML - Chief of Staff, Data and Machine Learning Innovation

Apple in Cupertino, California
AIML - Sr Engineering Program Manager, Advanced Question Answering Systems

Apple in Cupertino, California
AIML - Sr. Machine Learning Engineer - Large Language Models and Generative AI, Siri Information and Intelligence

Apple in Cupertino, California
AIML - Senior Manager, ML Product Evaluation Engineering

Apple in Cupertino, California
AIML - Sr Engineering Program Manager, Siri Metrics & Evaluation Methods

Apple in San Francisco, California
AIML - Senior Front End Platform Engineer - Full Stack, ML Systems and Evaluation Engineering

Apple in Cupertino, California
AIML - Sr. Engineering Program Manager, Software Features

Apple in Santa Clara, California
AIML - ML Engineer, MLR

Apple in Cupertino, California
AIML - Sr. Software Engineer - AIML Observability

Apple in Cupertino, California
AIML - Program Manager, Business Analyst

Apple in Cupertino, California
AIML - Sr Software Engineer, Siri Runtime Platform

Apple in Cupertino, California
AIML - Manager, FM Evaluation Product & Program Management

Apple in Cupertino, California
AIML - Machine Learning Researcher, Foundation Models

Apple in Cupertino, California