GPU Compute Performance Engineer

Waltham, Massachusetts


Employer: Apple
Industry: Software and Services
Salary: Competitive
Job type: Full-Time

Summary
The Apple Silicon GPU Driver Performance team is responsible for ensuring great GPU performance across our entire product line. This team is involved in all phases of the product development cycle - from working with HW teams on our GPU feature and architectural roadmaps, delivering and analyzing the performance of the latest modern GPU workloads on emerging platforms, developing our state of the art performance analysis capabilities on silicon, and helping internal and external partners to achieve the best performance possible on Apple Silicon GPUs.

Members of this team possess deep technical expertise in our GPU architecture and programming models. We use this to develop workflows and tools for deep performance analysis capabilities, which we leverage to develop and optimize GPU graphics and compute workloads specifically for Apple GPUs. The team works on benchmarks, games, pro-apps, ML, GPU compute, and image processing use cases, optimizing the workloads at both the algorithm and shader level to achieve speed-of-light performance.



Description
The team is seeking extraordinary GPU and machine learning engineers who are passionate about providing robust compute solutions for accelerating machine learning workloads at both the system and GPU programming level on Apple Silicon. The ideal candidate will have a passion for squeezing the best performance possible out of our GPUs, and is able to explore the low level architectural details of the HW to achieve this. They will work closely with our GPU hardware architecture and design teams to help develop our GPU roadmap and to ensure Apple is building the right HW and SW features to make the best - and fastest - GPU products.

This role's responsibilities will include:

* Working with internal partners to analyze and improve GPU and system performance of large scale ML deployments, such as Apple Intelligence, Private Cloud Compute, and MetalFX.
* Working with internal and external partners to optimize their GPU based ML algorithm implementations, GPU compute applications, algorithms, and shaders to achieve the best possible performance on Apple platforms.
* Working with internal hardware teams to define a hardware roadmap that continues to deliver best in class GPU performance as well as performance analysis capabilities, particularly in the areas of emerging GPU accelerated ML training and inference, GPGPU use-cases and workflows
* Developing tools and frameworks to support internal and external developers with performance analysis on Apple Silicon GPUs.




Created: 2024-08-27
Reference: 200554934
Country: United States
State: Massachusetts
City: Waltham

About Apple

Founded in: 1976
Number of Employees: 154000


Similar jobs: