- Install, configure, and manage Apache Spark (open-source) clusters on Ubuntu, including Spark master/worker nodes and Spark environment files.
- Configure and manage Spark UI and Spark History Server for monitoring jobs, analyzing DAGs, stages, tasks, and troubleshooting performance.
- Develop, optimize, and deploy PySpark ETL/ELT pipelines using DataFrame API, UDFs, window functions, caching, partitioning, and broadcasting.
- Deploy PySpark jobs using spark-submit in client/cluster mode with proper logging and error handling.
- Install, configure, and manage Apache Airflow including UI, scheduler, webserver, connections, and variables.
- Create, schedule, and monitor Airflow DAGs for PySpark jobs using SparkSubmitOperator, BashOperator, or PythonOperator.
- Configure and manage cron jobs for scheduling data processing tasks where needed.
- Install, configure, and optimize Trino (PrestoSQL) coordinator and worker nodes;
configure catalogs suchas S3, MySQL, or PostgreSQL. - Maintain Linux/Ubuntu servers including services, logs, environment variables, memory usage, and port conflict resolution.
- Design and implement scalable data architectures using Azure Data Services including ADF, Synapse, ADLS, Azure SQL, and Databricks.
- Develop, manage, and automate ETL/ELT pipelines using Azure Data Factory (Pipelines, Mapping Dataflows, Dataflows).
- Monitor, troubleshoot, and optimize data pipelines across Spark, Airflow, Trino, and Azure platforms.
- Work with structured, semi-structured, and unstructured data across multiple data sources and formats.
- Implement data analytics, transformation, backup, and recovery solutions.
- Perform data migration, upgrade, and modernization using Azure and database tools.
- Implement CI/CD pipelines for data solutions using Azure DevOps and Git.
- Ensure data quality, governance, lineage, metadata management, and security compliance across cloud and big data environments.
- Design and optimize data models using star and snowflake schemas;
build data warehouses, Delta Lake, and Lakehouse systems. - Develop and rebuild reports/dashboards using Power BI, Tableau, or similar tools.
- Collaborate with internal teams, clients, and business users to gather requirements and deliver high-quality data solutions.
- Provide documentation, runbooks, and operational guidance.
- Apache Spark (Open Source) & PySpark - Must
- Apache Spark installation & cluster configuration (Ubuntu/Linux)
- Spark master/worker setup (standalone & cluster mode)
- Spark UI & History Server configuration and debugging
- PySpark development (ETL pipelines, UDFs, window functions, DataFrame API)
- Performance tuning (partitioning, caching, shuffles)
- spark-submit deployment with monitoring and logging
- Airflow installation & configuration (UI, scheduler, webserver)
- Creating and scheduling DAGs (SparkSubmitOperator, BashOperator, PythonOperator)
- Retry logic, triggers, alerting, and log management
- Cron job scheduling & process automation
- Trino coordinator & worker node setup
- Catalog configuration (S3, RDBMS sources)
- Distributed SQL troubleshooting & performance optimization
- Azure Data Factory
- Azure Synapse Analytics
- Azure SQL / Cosmos DB
- Azure Data Lake Storage (Gen2)
- Azure Databricks (Delta, Notebooks, Jobs)
- Azure Event Hubs / Stream Analytics
- Lakehouse
- Warehouse
- Dataflows
- Notebooks
- Pipelines
- Python
- PySpark
- SQL
- Scala
- Star schema modeling
- Snowflake schema modeling
- Fact/dimension modeling
- Data warehouse & Lakehouse design
- Delta Lake / Lakehouse architectures
- Git / GitHub / Azure Repos
- Azure DevOps pipelines (CI/CD)
- Automated deployment for Spark, Airflow, ADF, Databricks, Fabric
- Power BI
- Tableau
- Report building, datasets, DAX
- Shell scripting
- Service management
- Logs & environment variables
- Excellent problem solving and communication skills
- Able to work well in a team setting
- Excellent organizational and time management skills
- Taking end-to-end ownership
- Production support & timely delivery
- Self-driven, flexible and innovative
- Microsoft Certified: Azure Data Engineer Associate (DP-203 / DP -300)
- Knowledge of DevOps and CI/CD pipelines in Azure
- BSc/BA in Computer Science, Engineering or a related field
-
A Data Engineer with strong proficiency in MS Excel and experience with data analysis and trend identification is needed. · ...
Vadodara1 month ago
-
This position is part of Bechtel’s Infrastructure AI and Data program, aiming to transform how the company manages, governs, and leverages data. · ...
Vadodara1 month ago
-
This role involves designing and implementing robust data engineering solutions. The Data Engineer will perform data modeling, develop ETL processes and build/maintain data warehouses to derive actionable insights. · ...
Vadodara1 month ago
-
We are looking for curious, data-driven professionals to join our mission of delivering world-class digital solutions to our customers.As a Profile Data Setup Analyst, you will play a key role in configuring, analysing, · and managing product data for our customers. · ...
Vadodara1 month ago
-
We are now looking for curious, · data-driven professionals to join our mission of delivering world-class digital solutions to · our customers.You will play a key role in configuring, · analysing,and managing product data for our customers. · ...
Vadodara3 weeks ago
-
We are seeking a highly skilled Senior Data Engineer to build and maintain scalable, · high-performance data solutions that unify information from multiple systems into a single, · trusted source of truth.The ideal candidate will have strong hands-on expertise in dimensional mode ...
Vadodara3 weeks ago
-
Project Role Summary · Need an experienced Senior Data Engineer (8+ years) to design, develop, and optimize data pipelines and storage layers in a Medallion Architecture on Microsoft Azure . The ideal candidate will work on building scalable ETL/ELT pipelines , ensuring data gove ...
Vadodara ₹2,200,000 - ₹4,500,000 (INR) per year Full time3 days ago
-
7+ years of experience with Azure SQL, Microsoft SQL Server and Data Engineering solutions. · ...
Vadodara3 weeks ago
-
+Job summary · We are seeking a highly skilled Senior Data Engineer to build and maintain scalable, high-performance data solutions that unify information from multiple systems into a single, trusted source of truth. · +ResponsibilitiesData Engineering: Design, develop, and maint ...
Vadodara, Gujarat3 weeks ago
-
We are looking for a Data Engineer I (QA+Artificial Intelligence) to join our team. · Own campaign audits from launch through steady-state operationsEnsure smooth execution of audits, daily feedback, and periodic reportingDrive continuous process improvement initiatives ...
Vadodara, Gujarat1 month ago
-
How you'll make an impact: · The success candidate will be the part of an International Design and Engineering Team heavily specialized in Power Transformers design covering US factory. · Responsible for building visualizations in PBI based on various sources and datasets of powe ...
Vadodara ₹600,000 - ₹2,200,000 (INR) per year Full time3 days ago
-
We're looking for curious, data-driven professionals to join our mission of delivering world-class digital solutions to customers. · ...
Vadodara, Gujarat1 month ago
-
Location: Vadodara, Gujarat (HYBRID) · Department: Information Technology · Reports To: Lead Data Warehouse Engineer · Shift: 2 PM / 3 PM – 11 PM / 12 Midnight (Flexibility required) · Notice Period: Immediate – 30 Days · About the Company: The company is a leading technology ena ...
Vadodara ₹2,200,000 - ₹4,500,000 (INR) per year5 days ago
-
We are seeking a highly skilled Senior Data Engineer to build and maintain scalable, high-performance data solutions that unify information from multiple systems into a single, · trusted source of truth. · ...
Vadodara3 weeks ago
-
We help companies reach operational efficiencies by empowering them with technology solutions that drive their business processes. · ...
Vadodara2 weeks ago
-
We empower the world's leading brands and retailers with unmatched insights into consumer behavior and the influencers that drive it.We are seeking a highly skilled Senior Data Engineer with extensive experience in designing, building, and optimizing high-volume data pipelines. · ...
Vadodara3 weeks ago
-
WHAT YOU LL BE DOING: · Process Development Optimization : Take accountability of campaign audit from launch to smooth running of audits, sending out periodical reports, daily feedback etc. · Communication across Department : Will interact with PI to launch the audit process fo ...
Vadodara ₹900,000 - ₹2,500,000 (INR) per year1 week ago
-
Senior Data Engineer must have skills in SQL Server, Dimensional Modeling, Performance Optimization and Power BI. · Azure SQL · Azure Synapse · Azure Data Factory · Azure Databricks · CI/CD · ...
Vadodara3 weeks ago
-
The company is looking for a Quality Assurance Manager to lead the QA team and ensure smooth execution of audits. The ideal candidate should have experience in Quality Management or Contact Center Operations and strong leadership skills. He/She should be able to analyze data and ...
Vadodara1 month ago
-
The job involves working as a data software engineer with expertise in distributed computing principles, Apache Spark, and Python programming. The individual should have hands-on experience with Hadoop v2 and be proficient in building stream-processing systems using technologies ...
Vadodara1 month ago
-
Manage creation, modification and deletion of material master data in SAP MM. · ...
Vadodara, Gujarat1 month ago
Sr. Data Engineer - Vadodara - Exigo Tech
Description
Exigo Tech is a Sydney-based Technology Solutions Provider that is focused on providing solutions on three major verticals;
Infrastructure, Cloud, and Application to businesses across Australia. We help companies reach operational efficiencies by empowering them with technology solutions that drive their business processes.
Exigo is looking for Full-time Sr. Data Engineer
Click Here to know more : LIFE AT EXIGO TECH
Roles and Responsibilities
Technical Skills:
2. Apache Airflow & Job Orchestration - Must
3. Trino (PrestoSQL) - Must
4. Azure Data Services (nice to have)
5. Microsoft Fabric ( nice to have)
6. Programming & Querying
7. Data Modeling & Warehousing
8. DevOps & CI/CD
9. BI Tools (Nice to have)
10. Linux/Ubuntu Server Knowledge
Soft Skills:
Education:
Work Location: Vadodara, Gujarat, India
-
Data Engineer
Only for registered members Vadodara
-
Data Engineer
Only for registered members Vadodara
-
Data Engineer
Only for registered members Vadodara
-
Data Engineer
Only for registered members Vadodara
-
Data Engineer
Only for registered members Vadodara
-
Senior Data Engineer
Only for registered members Vadodara
-
Senior Data Engineer
Full time Rishabh Software Private Limited- Vadodara
-
Sr Data Engineer
Only for registered members Vadodara
-
Senior Data Engineer
Only for registered members Vadodara, Gujarat
-
Data Engineer I-
Only for registered members Vadodara, Gujarat
-
Engineer-Data Science
Full time Hitachi Energy- Vadodara
-
Sr. Data Engineer
Only for registered members Vadodara, Gujarat
-
Senior Data Engineering
Only for registered members Vadodara
-
Senior Data Engineer
Only for registered members Vadodara
-
Sr. Data Engineer
Only for registered members Vadodara
-
Senior Data Engineer
Only for registered members Vadodara
-
Data Engineer I
Only for registered members Vadodara
-
Senior Data Engineer
Only for registered members Vadodara
-
Data Engineer I-
Only for registered members Vadodara
-
Data Software Engineer
Only for registered members Vadodara
-
Master data Engineer
Only for registered members Vadodara, Gujarat
