Senior Python Big Data Engineer
[mclean, Va, 22012], Virginia
Employer: Saxon Global
Industry:
Salary: Competitive
Job type: Part-Time
Seeking a Senior Data Developer to implement data processing and ingestion of structured and semi-structured data as a member of the Innovation in Data Engineering and Analytics (IDEA) team.
Responsibilities include:
• Cleanse, manipulate and analyze large datasets (Semi-Structured and Unstructured data - XMLs, JSONs, CSVs, PDFs) using python and Snowflake database.
• Develop Python scripts to filter/cleanse/map/aggregate data.
• Manage and implement data processes (Data Quality reports)
• Develop data profiling, deduping logic, matching logic for analysis
• Programming Languages experience in Python, PySpark and SQL for data ingestion
• Present ideas and recommendations on data handling and data parsing technologies to management
Qualifications:
• 5+ years of experience in processing large volumes and variety of data (Structured and semi-structured data, writing code for parallel processing, shredding XMLS, JSONs and reading PDFs) - Mandatory
• 3+ years of programming experience in Python for data processing and analysis - Mandatory
• 2+ years of experience with Snowflake, preferable parsing JSON and XML files- Desirable
• Strong SQL experience is a must - Mandatory
• 3+ years of experience - using Hadoop platform and performing analysis. Familiarity with Hadoop cluster environment and configurations for resource management for analysis work - Optional
• 2+ years of programming experience in PySpark for data processing and analysis - Optional
• Detail oriented. Excellent communication skills (verbal and written)
• Must be able to manage multiple priorities and meet deadlines
• Degree in Computer Science , Statistics, Mathematics, or related field
Required Skills : Python, SQL, Snowflake
Basic Qualification : -Must be local to Mclean, VA or willing to relo -Years of experience: 5+ -Degree in Computer Science , Statistics, Mathematics, or related field
Additional Skills : -Must be local to Mclean, VA or willing to relo -Years of experience: 5+ -Degree in Computer Science , Statistics, Mathematics, or related field
Background Check :Yes
Drug Screen :Yes
Notes :Staff Aug
Selling points for candidate :
Project Verification Info :
Candidate must be your W2 Employee :Yes
Exclusive to Apex :No
Face to face interview required :No
Candidate must be local :No
Candidate must be authorized to work without sponsorship ::No
Interview times set : :No
Type of project :Development/Engineering
Master Job Title :VMS Access Entry
Branch Code :DC Metro Commercial
Responsibilities include:
• Cleanse, manipulate and analyze large datasets (Semi-Structured and Unstructured data - XMLs, JSONs, CSVs, PDFs) using python and Snowflake database.
• Develop Python scripts to filter/cleanse/map/aggregate data.
• Manage and implement data processes (Data Quality reports)
• Develop data profiling, deduping logic, matching logic for analysis
• Programming Languages experience in Python, PySpark and SQL for data ingestion
• Present ideas and recommendations on data handling and data parsing technologies to management
Qualifications:
• 5+ years of experience in processing large volumes and variety of data (Structured and semi-structured data, writing code for parallel processing, shredding XMLS, JSONs and reading PDFs) - Mandatory
• 3+ years of programming experience in Python for data processing and analysis - Mandatory
• 2+ years of experience with Snowflake, preferable parsing JSON and XML files- Desirable
• Strong SQL experience is a must - Mandatory
• 3+ years of experience - using Hadoop platform and performing analysis. Familiarity with Hadoop cluster environment and configurations for resource management for analysis work - Optional
• 2+ years of programming experience in PySpark for data processing and analysis - Optional
• Detail oriented. Excellent communication skills (verbal and written)
• Must be able to manage multiple priorities and meet deadlines
• Degree in Computer Science , Statistics, Mathematics, or related field
Required Skills : Python, SQL, Snowflake
Basic Qualification : -Must be local to Mclean, VA or willing to relo -Years of experience: 5+ -Degree in Computer Science , Statistics, Mathematics, or related field
Additional Skills : -Must be local to Mclean, VA or willing to relo -Years of experience: 5+ -Degree in Computer Science , Statistics, Mathematics, or related field
Background Check :Yes
Drug Screen :Yes
Notes :Staff Aug
Selling points for candidate :
Project Verification Info :
Candidate must be your W2 Employee :Yes
Exclusive to Apex :No
Face to face interview required :No
Candidate must be local :No
Candidate must be authorized to work without sponsorship ::No
Interview times set : :No
Type of project :Development/Engineering
Master Job Title :VMS Access Entry
Branch Code :DC Metro Commercial
Created: 2024-04-30
Reference: SG - 75508
Country: United States
State: Virginia
City: [mclean, Va, 22012]
Similar jobs:
-
Data Engineer & Solutions Integrator
Ascent Services Group in Alexandria, Virginia💸 $75 - $85 per hour -
Data Center Regional Mechanical Engineer (Field Engineering), Field Engineering
Amazon in Herndon, Virginia -
Data Center Regional Mechanical Engineer (Field Engineering), Field Engineering
Amazon in Herndon, Virginia -
Senior Data Engineer (Python, Scala, or Spark)
eSmartloan in Richmond, Virginia💸 $165100 - $188500 per year -
Senior Data Engineer
Deloitte in Rosslyn, Virginia -
Data Center Engineering Operations Tech, DCC Communities
Amazon in Ashburn, Virginia -
Data Center Regional Fire Protection Engineer (Field Engineering), DCC Communities
Amazon in Herndon, Virginia -
Controls Engineer, Deployment, Data Center Capacity Delivery
Amazon in Stafford, Virginia -
Data Center Engineering Operations Technician, DCEO
Amazon in Herndon, Virginia -
Data Center Regional Mechanical Engineer (Field Engineering), Field Engineering
Amazon in Herndon, Virginia -
Sr. Data Engineer
Esolvit, Inc. in Richmond, Virginia -
Voice and Data Engineer
Leidos Holding in Arlington, Virginia💸 $55250.00 per year -
Sr. Director, Data Engineering - Card Technology
eSmartloan in McLean, Virginia -
Principal Associate, Data Loss Prevention (DLP) Engineer
eSmartloan in McLean, Virginia💸 $165100 - $188500 per year -
Data Center Electrical Engineer
Google in Reston, Virginia -
Lead Software Engineer, Back End (Python, AWS, Data Streaming)
eSmartloan in Richmond, Virginia -
Senior Data Engineer
eSmartloan in Richmond, Virginia -
Electrical Field Engineer, Data Center Field Engineering
Amazon in Herndon, Virginia -
Data Center Operations Engineer - TS/SCI with Polygraph required
General Dynamics Corporation in Warrenton, Virginia -
Electrical Field Engineer, Data Center Field Engineering
Amazon in Herndon, Virginia