Data Science Associate

Pittsburgh, Pennsylvania


Employer: Compunnel
Industry: 
Salary: Competitive
Job type: Part-Time

Description

Our team is experimenting with Generative AI (LLMS) to innovate and improve upon traditional AI/ML techniques, such as LLM Based Classification, LLM Based Natural Language Processing, using emerging technology such as Graph RAG, deriving insights from gigabytes of data, etc. 

We are looking for resources who want to be a part of these PoC processes. 

Looking for associate level candidates: 1-3 years of experience

Need to have solid Python programming and some SQL experience including common data science libraries such as TensorFlow, PyTorch, Scikit-learn, and Pandas 

Ideal candidates will have GitHub or similar code repository to share pre-interview. 

Key areas in Data Science we are seeking: Machine Learning, Deep Learning Modules, Large Language Models (LLM) and Generative AI (GenAI). 

Our client is looking for a data science associate who is passionate about working with large language models and machine learning, and who can help build robust and scalable pipelines for automated processes. 

You will be part of a dynamic and innovative team that leverages cutting-edge technologies to deliver high-quality solutions for engineering and internal stakeholders.

Responsibilities

Develop, train, and deploy large language models and machine learning algorithms for various natural language processing tasks.

Build and maintain pipelines for data ingestion, processing, analysis, and visualization.

Perform data quality checks, data cleansing, and data wrangling.

Optimize and troubleshoot the performance and scalability of the models and pipelines.

Collaborate with other data scientists, engineers, and stakeholders to understand the business requirements and deliver the best solutions.

Qualifications

Bachelor's degree or higher in computer science, data science, analytics, statistics, or related field. (Or equivalent experience)

Strong knowledge of the fundamentals of machine learning, natural language processing, and statistics.

Proficient in Python and common data science libraries such as TensorFlow, PyTorch, Scikit-learn, and Pandas

Excellent communication and teamwork skills.

Willingness to learn and explore new technologies and methodologies.

Desired:

Experience in building and deploying pipelines for data science and machine learning projects.

At least one year of professional experience in data science, machine learning, analytics, or natural language processing.

Experience in working with large language models such as mistral, mixtral, gpt, etc.

Education: Bachelors Degree

Created: 2024-08-22
Reference: PATDC5034201
Country: United States
State: Pennsylvania
City: Pittsburgh
ZIP: 15216


Similar jobs: