Senior/Tech Lead AI/LLM Network Software Development Engineer - San Jose
San Jose, California
Employer: TikTok
Industry: R&D
Salary: Competitive
Job type: Full-Time
Responsibilities
About ByteDance
Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok and Helo as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.
Why Join Us
Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible.
Together, we inspire creativity and enrich life - a mission we aim towards achieving every day.
To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.
At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve.
Join us.
About the Team
ByteDance Networking brings together innovative ideas and technologies from network architecture, software defined networking (SDN), network virtualization, switch software and hardware co-design, and high-speed networking, to create hyperscale data-center networking solutions that power several of the most popular apps of the world such as Douyin and TikTok which serve hundreds of millions of users around the globe.
ByteDance Networking is responsible for designing, building, and operating the global, intelligent network infrastructure to meet the requirements of high availability, scalability, and high-performance. By joining this team, you will gain marketable software development and/or network operation experiences in data center networking at massive scale.
Responsibilities:
- Design, implementation and deployment of high-speed network technologies to support AI/LLM applications.
- Design and development of platforms/systems for monitoring, analysis and diagnosis of large scale AI/LLM network.
- Research and development of high-performance AI communication framework, network protocol stacks, and codesign optimization of host-network-application to improve the scalability, reliability and performance of AI/LLM network.
- Building next generation AI network infrastructure supporting large scale heterogeneous network hardware with innovative and deployable solutions.
Qualifications
Minimum Qualifications
- Bachelor or higher degree in computer science, electronic engineering, network engineering or related fields.
- Proficiency in computer network and network programming.
- Proficiency in one or several mainstream programming languages, including C/C++, Python, Go and so on.
- Be familiar with the latest advances in the area of high-speed network systems, including RDMA, congestion control, AI network optimization and so on.
- Experience in developing high performance communication frameworks(including NCCL, MPI and RPC libraries) is a plus.
- Experience in developing software systems for AI network diagnosis and performance optimization is a plus.
Preferred Qualifications
- Experience in developing high performance communication frameworks(including NCCL, MPI and RPC libraries) is a plus.
- Experience in developing software systems for AI network diagnosis and performance optimization is a plus.
ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.
ByteDance Inc. is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://shorturl.at/cdpT2
About ByteDance
Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok and Helo as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.
Why Join Us
Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible.
Together, we inspire creativity and enrich life - a mission we aim towards achieving every day.
To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.
At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve.
Join us.
About the Team
ByteDance Networking brings together innovative ideas and technologies from network architecture, software defined networking (SDN), network virtualization, switch software and hardware co-design, and high-speed networking, to create hyperscale data-center networking solutions that power several of the most popular apps of the world such as Douyin and TikTok which serve hundreds of millions of users around the globe.
ByteDance Networking is responsible for designing, building, and operating the global, intelligent network infrastructure to meet the requirements of high availability, scalability, and high-performance. By joining this team, you will gain marketable software development and/or network operation experiences in data center networking at massive scale.
Responsibilities:
- Design, implementation and deployment of high-speed network technologies to support AI/LLM applications.
- Design and development of platforms/systems for monitoring, analysis and diagnosis of large scale AI/LLM network.
- Research and development of high-performance AI communication framework, network protocol stacks, and codesign optimization of host-network-application to improve the scalability, reliability and performance of AI/LLM network.
- Building next generation AI network infrastructure supporting large scale heterogeneous network hardware with innovative and deployable solutions.
Qualifications
Minimum Qualifications
- Bachelor or higher degree in computer science, electronic engineering, network engineering or related fields.
- Proficiency in computer network and network programming.
- Proficiency in one or several mainstream programming languages, including C/C++, Python, Go and so on.
- Be familiar with the latest advances in the area of high-speed network systems, including RDMA, congestion control, AI network optimization and so on.
- Experience in developing high performance communication frameworks(including NCCL, MPI and RPC libraries) is a plus.
- Experience in developing software systems for AI network diagnosis and performance optimization is a plus.
Preferred Qualifications
- Experience in developing high performance communication frameworks(including NCCL, MPI and RPC libraries) is a plus.
- Experience in developing software systems for AI network diagnosis and performance optimization is a plus.
ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.
ByteDance Inc. is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://shorturl.at/cdpT2
Created: 2024-05-18
Reference: RMHP
Country: United States
State: California
City: San Jose
ZIP: 95118
Similar jobs:
-
Embedded Software Development Engineer
Amazon in Sunnyvale, California💸 $115000 per year -
Software Development Engineer II, AWS Industry Products, Manufacturing
Amazon in Santa Clara, California💸 $129300 per year -
Software Development Engineer
Amazon in San Diego, California💸 $115000 per year -
Software Development Engineer in Test -II, WWGST Quality Reliability Engineering
Amazon in Irvine, California💸 $129300 per year -
Software Development Engineer
Amazon in Palo Alto, California💸 $115000 per year -
Senior Full Stack Development Engineer (On-site)
Cognizant Technology Solutions in Lake Forest, California💸 $43900 per year -
Sr Software Development Engineer in Test, IS&T Ai & Data Platforms
Apple in Sunnyvale, California -
Senior Software Development Engineer - Ads QA
TikTok in San Jose, California -
Sr. Software Development Engineer, Personalization
Amazon in Irvine, California💸 $134500 per year -
Senior Software Development Engineer - Data
TikTok in San Jose, California -
Software Development Engineer, New Security Service
Amazon in Santa Clara, California💸 $115000 per year -
Software Development Engineer, DBS Redshift
Amazon in East Palo Alto, California💸 $115000 per year -
Systems Development Engineer , Amazon Robotics Business Applications and Solutions Engineering
Amazon in Santa Monica, California💸 $136100 per year -
Systems Development Engineer II, Engine
Amazon in Sunnyvale, California💸 $103400 per year -
Software Development Engineer, Annapurna Labs
Amazon in Cupertino, California💸 $134500 per year -
Software Development Engineer - Prime Video, Partner Licensing Technology
Amazon in Culver City, California💸 $115000 per year -
Software Development Engineer, Alexa Echo Spatial Perception
Amazon in Sunnyvale, California💸 $115000 per year -
Software Development Engineer, Alexa Home Productivity
Amazon in Sunnyvale, California💸 $115000 per year -
Software Development Engineer, EKS Networking
Amazon in Santa Clara, California💸 $115000 per year -
Senior Software Development Engineer in Test - Global E-Commerce
TikTok in San Jose, California