Employer: Meta

Industry:

Salary: Competitive

Job type: Full-Time

Our team has released the Seamless Communication models at the end of 2023, the very first massively multilingual, streaming and expressive multimodal translation systems. We are looking for a Research Engineer, expert in speech generation to take these models to the next level by making them production ready. Overtime, this project will be transitioned fully to an infrastructure team, and the role will support our next research vision to build a personalizable, controllable foundation model for synchronous, multimodal and expressive behavior generation. Meta Fundamental AI Research (FAIR) is a research organization committed to advancing open AI research, and we will push the boundaries of human-centric understanding and generation. Our team's technology will enable next-generation human-to-human and human-to-machine communication.

Research Engineer, Speech Generation - FAIR Responsibilities

Collaborate, and execute on research that pushes forward the state of the art in human-centric understanding and generation.

Directly contribute to experiments, including designing experimental details, writing reusable code, running evaluations, and organizing results.

Develop methodology and benchmarks to evaluate different approaches.

Work with a large and globally distributed team.

Minimum Qualifications

Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.

Masters Degree in Computer Science or relevant technical field

3+ years of industry, academic or government laboratory experience

Experience holding an industry, faculty, or government researcher position

Experience developing machine learning algorithms or machine learning infrastructure in Python

Experience writing software and executing complex experiments involving large AI models and datasets

Experience in speech generation and text-to-speech

Preferred Qualifications

A PhD in AI, computer science, data science, or related technical fields.

Direct experience in generative AI, and LLM research.

First author publications experience at peer-reviewed AI conferences (NeurIPS, CVPR, ICML, ICLR, ICCV, ACL, EMNLP, Interspeech, etc.))

Experience in multimodal generation modeling, in particular human motion generation modeling.

Start preparing
Learn about how to prepare for your interview with our interview guide, tips, and interactive experiences.
Visit interview prep

Created: 2024-07-04

Reference: 1189148802113537

Country: United States

State: Pennsylvania

City: Pittsburgh

ZIP: 15216

Choose your perfect job from jobs we have!

Research Engineer, Speech Generation - FAIR

Pittsburgh, Pennsylvania