- Prototype and implement data transformations and entity resolutions on petabyte-scale data using Hadoop and Spark clusters.
- Debug and optimize Spark code for performance improvements.
- Understand, code, and debug ML models in Spark for analytics and data processing workflows.
- Analyze data errors and identify root causes to improve overall data quality.
- Work with relational, NoSQL, and graph databases to extract, transform, and query data efficiently.
- Collaborate closely with teams generating and consuming data to ensure accurate and reliable data outputs.
- Communicate findings and solutions clearly, both in writing and orally, to stakeholders and team members.
Engineer_Java Spark_Neustar - Hyderabad / Secunderabad, Telangana - confidential
Description
Job Summary:
We are seeking a skilled Java Spark Engineer to join our Data & Analytics team. The ideal candidate will work on building next-generation data products on Hadoop and Spark platforms, handling large-scale datasets, and collaborating with cross-functional teams to deliver high-quality data solutions.
Roles & Responsibilities:
Data Processing & Transformation:
Machine Learning Support:
Data Quality & Analytics:
Collaboration & Communication: