Machine Learning Engineer - Model Serving Infrastructure - USDS
Mountain View, California
Employer: TikTok
Industry: R&D
Salary: Competitive
Job type: Full-Time
Responsibilities
About TikTok U.S. Data Security
TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security ("USDS") is a subsidiary of TikTok in the U.S. This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep U.S. users safe. Our focus is on providing oversight and protection of the TikTok platform and U.S. user data, so millions of Americans can continue turning to TikTok to learn something new, earn a living, express themselves creatively, or be entertained. The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more.
Why Join Us
Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.
Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.
To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.
At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.
Join us.
Team Intro
The mission of our AML team is to push the next-generation AI infrastructure and recommendation platform for the ads ranking, search ranking, live & ecom ranking in our company. We also drive substantial impact on core businesses of the company. Currently, we are looking for Machine Learning Engineer - Model Serving Infrastructure to join our team to support and advance that mission.
In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department. We regularly review our hybrid work model, and the specific requirements may change at any time.
Responsibilities:
- Responsible for the design and implementation of distributed inference infrastructure for feeds, ads and search ranking models.
- Responsible for building monitoring/managing tools to oversee the reliability and scalability of online inference servers
- Responsible for triaging system inefficiency and bottlenecks and improving system performance
- Responsible for building tools to analyze bottlenecks and sources of instability and then design and implement solutions
- Responsible for collaboration with product teams and providing general solutions to meet their requirements
Qualifications
Minimum Qualifications
- Bachelor's/Master's degree in Computer Science, Computer Engineering, or related fields or equivalent years of experience in a software engineering role
- Proficient in C/C++/CUDA, and have solid programming skills.
- Familiar with deep learning serving frameworks (TensorFlow Serving/TorchScript).
- Experience in GPU performance optimization
Preferred Qualifications
- Experience contributing to an open sourced machine learning framework (tensorflow / jax / pytorch / torchscript / mxnet / tensorrt).
- Experience in developing and deploying large-scale systems.
- Strong background in one of the following fields: Hardware-Software Co-Design, High Performance Computing, ML Hardware Acceleration (e.g., GPU/RDMA) or ML for Systems.
- Ability to work independently and complete projects from beginning to end and in a timely manner.
- Good communication and teamwork skills to clearly communicate technical concepts with other teammates.
Candidates for this position must be legally authorized to work in the United States. This position is not eligible for visa sponsorship or support.
D&I Statement
TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.
Accommodation Statement
TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://shorturl.at/ktJP6
Data Security Statement
This role requires the ability to work with and support systems designed to protect sensitive data and information. As such, this role will be subject to strict national security-related screening.
#LI-DS4
About TikTok U.S. Data Security
TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security ("USDS") is a subsidiary of TikTok in the U.S. This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep U.S. users safe. Our focus is on providing oversight and protection of the TikTok platform and U.S. user data, so millions of Americans can continue turning to TikTok to learn something new, earn a living, express themselves creatively, or be entertained. The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more.
Why Join Us
Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.
Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.
To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.
At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.
Join us.
Team Intro
The mission of our AML team is to push the next-generation AI infrastructure and recommendation platform for the ads ranking, search ranking, live & ecom ranking in our company. We also drive substantial impact on core businesses of the company. Currently, we are looking for Machine Learning Engineer - Model Serving Infrastructure to join our team to support and advance that mission.
In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department. We regularly review our hybrid work model, and the specific requirements may change at any time.
Responsibilities:
- Responsible for the design and implementation of distributed inference infrastructure for feeds, ads and search ranking models.
- Responsible for building monitoring/managing tools to oversee the reliability and scalability of online inference servers
- Responsible for triaging system inefficiency and bottlenecks and improving system performance
- Responsible for building tools to analyze bottlenecks and sources of instability and then design and implement solutions
- Responsible for collaboration with product teams and providing general solutions to meet their requirements
Qualifications
Minimum Qualifications
- Bachelor's/Master's degree in Computer Science, Computer Engineering, or related fields or equivalent years of experience in a software engineering role
- Proficient in C/C++/CUDA, and have solid programming skills.
- Familiar with deep learning serving frameworks (TensorFlow Serving/TorchScript).
- Experience in GPU performance optimization
Preferred Qualifications
- Experience contributing to an open sourced machine learning framework (tensorflow / jax / pytorch / torchscript / mxnet / tensorrt).
- Experience in developing and deploying large-scale systems.
- Strong background in one of the following fields: Hardware-Software Co-Design, High Performance Computing, ML Hardware Acceleration (e.g., GPU/RDMA) or ML for Systems.
- Ability to work independently and complete projects from beginning to end and in a timely manner.
- Good communication and teamwork skills to clearly communicate technical concepts with other teammates.
Candidates for this position must be legally authorized to work in the United States. This position is not eligible for visa sponsorship or support.
D&I Statement
TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.
Accommodation Statement
TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://shorturl.at/ktJP6
Data Security Statement
This role requires the ability to work with and support systems designed to protect sensitive data and information. As such, this role will be subject to strict national security-related screening.
#LI-DS4
Created: 2024-09-07
Reference: A240196
Country: United States
State: California
City: Mountain View
Similar jobs:
-
Staff Machine Learning Engineer, Autobidder
Tesla Motors in Palo Alto, California -
Machine Learning Engineer, Residential Energy
Tesla Motors in Palo Alto, California -
Tech Lead Machine Learning Engineer - Applied AIGC, TikTok Monetization GenAI
TikTok in San Jose, California -
Machine Learning Engineer
Uber in San Francisco, California💸 $158000 per year -
Tech Lead, Machine Learning Engineer-TikTok Multimedia, Data Platform
TikTok in San Jose, California -
Sr. Machine Learning Engineer, Annapurna ML
Amazon in Cupertino, California💸 $151300 per year -
Sr Machine Learning Engineer, Inclusive / Responsible AI
Pinterest in San Francisco, California -
Senior Software Framework Engineer - System Intelligent and Machine Learning, ISE
Apple in Cupertino, California -
Machine Learning Engineer - Maps
Aurora Innovation in Mountain View, California💸 $144000 per year -
Machine Learning Engineer Graduate (E-commerce Governance Algorithms) - 2025 Start (Phd)
TikTok in San Jose, California -
Software Engineering Manager, Machine Learning
Meta in Sunnyvale, California -
Senior Machine Learning Engineer, TikTok Core Feed Recommendation - User Growth
TikTok in San Jose, California -
Senior Machine Learning Engineer, Search Engine, E-Commerce Alliance
TikTok in San Jose, California -
Senior Machine Learning Engineer, Search Recommendation Systems (Multiple Positions)
TikTok in San Jose, California -
Senior Machine Learning Engineer - Search Recommendation, E-commerce
TikTok in San Jose, California -
Staff Software Engineer, Machine Learning, Ads Understanding
Google in Los Angeles, California -
Software Engineer II, Machine Learning
Uber in San Francisco, California💸 $158000 per year -
Software Development Engineer - Machine Learning, Ad Response Prediction
Amazon in Palo Alto, California💸 $129300 per year -
Senior Software Engineer, Machine Learning, Google Research
Google in Mountain View, California -
Sr. Deep Learning Compiler Engineer III, AWS Neuron, Annapurna Labs
Amazon in Cupertino, California💸 $151300 per year