Pyspark Developer

New York, New York

Employer: Virtusa Corporation

Industry:

Salary: Competitive

Job type: Full-Time

Description

Experience in building Pyspark process. Proficient in understanding distributed computing principles. Experience in managing Hadoop cluster with all services. Experience with Nosql Databases and Messaging systems like Kafka. Designing building installing configuring and supporting Hadoop Perform analysis of vast data stores. Good understanding of cloud technology. Must have strong technical experience in Design Mapping specifications HLD LLD. Must have the ability to relate to both business and technical members of the team and possess excellent communication skills. Leverage internal tools and SDKs, utilize AWS services such as S3, Athena, and Glue, and integrate with our internal Archival Service Platform for efficient data purging. Lead the integration efforts with the internal Archival Service Platform for seamless data purging and lifecycle management. Collaborate with the data engineering team to continuously improve data integration pipelines, ensuring adaptability to evolving business needs.
Develop and maintain data platforms using Pyspark
Work with AWS and Big Data, design and implement data pipelines, and ensure data quality and integrity
Collaborate with crossfunctional teams to understand data requirements and design solutions that meet business needs
Implement and manage agents for monitoring, logging, and automation within AWS environments
Handling migration from PySpark to AWS

Created: 2024-08-22

Reference: CREQ193162

Country: United States

State: New York

City: New York

ZIP: 10036

Similar jobs:

Pyspark Developer

Virtusa Corporation in New York, New York
Back End Developer (Java/Pyspark)

Insight Global in New York, New York

Pyspark Developer

New York, New York

Similar jobs:

Pyspark Developer

Back End Developer (Java/Pyspark)