-
Data Scientist
5 days ago
Cloud Raptor Bangalore Metropolitan Area, IndiaAtCloud Raptor , we specialise in providing scalable and efficient solutions to businesses across · many industries. Focused on cloud technologies and technical expertise, we help organisations · navigate the ever-changing landscape of the digital era. · With dedicated Centres of ...
-
Associate Principal – Data Engineering
6 days ago
MathCo Bangalore Metropolitan Area, IndiaAs an Associate Principal – Data Engineering, you'll have an opportunity to work on the universe of data and solve some very interesting problems by creating and maintaining scalable data pipelines dealing with petabytes of data. All our projects entail working on cutting edge te ...
-
Biocon Biologics Bangalore Metropolitan Area, IndiaAbout the company: · Biocon Biologics is a subsidiary of Biocon Ltd, an innovation led global biopharmaceuticals company. Biocon Biologics is engaged in developing high quality, affordable biosimilars that can expand access to a cutting-edge class of therapies to patients globall ...
Lead Databricks Engineer - Bangalore Metropolitan Area, India - Tredence Inc.
Description
About Tredence:
Tredence is a global analytics services and solutions company. We are one of the fastest growing private companies in the country for three straight years according to the Inc. 5000 and we continue to set ourselves apart from our competitors by attracting the greatest talent in the data analytics and data science space. Our capabilities range from Data Visualization, Data Management to Advanced analytics, Big Data and Machine Learning. Our uniqueness is in building Scalable Big Data Solutions on Onprem/GCP/Azure cloud in a very cost effective and easily scalable manner for our clients. We also come in with some strong IP and pre-built analytics solutions in data mining, BI and Big Data.
The candidate must understand the usage of data Engineering tools for solving business problems and help clients in their data journey. Must have knowledge of emerging technologies used in companies for data management including data governance, data quality, security, data integration, processing, and provisioning. The candidate must possess required soft skills to work with teams and lead medium to large teams.
Candidate should be comfortable with taking leadership roles, in client projects, pre-sales/consulting, solutioning, business development conversations, execution on data engineering projects.
Mandatory Skills : Azure Databricks, Azure Data Factory, Azure Delta Lake, Pyspark.
Experience Range - 5 years to 12 years.
Location - Bangalore, Chennai, Gurgaon, Pune, Kolkata
Role Description:
● Developing Modern Data Warehouse solutions using Databricks and Azure Stack
● Ability to provide solutions that are forward-thinking in data engineering and analytics space
● Collaborate with DW/BI leads to understand new ETL pipeline development requirements.
● Triage issues to find gaps in existing pipelines and fix the issues
● Work with business to understand the need in reporting layer and develop data model to fulfill reporting needs
● Drive technical discussion with client architect and team members
● Orchestrate the data pipelines in scheduler via Airflow Skills and Qualifications:
● Bachelor's and/or master's degree in computer science or equivalent experience.
● Must have total 9+ yrs. of IT experience experience in Data warehouse/ETL projects.
● Deep understanding of Star and Snowflake dimensional modelling.
● Strong knowledge of Data Management principles
● Good understanding of Databricks Data & AI platform and Databricks Delta Lake Architecture
● Should have hands-on experience in SQL, Python and Spark (PySpark)
● Candidate must have experience in Azure stack
● Desirable to have ETL with batch and streaming (Kinesis).
● Experience in building ETL / data warehouse transformation processes
● Experience with Apache Kafka for use with streaming data / event-based data
● Experience with other Open-Source big data products Hadoop (incl. Hive, Pig, Impala)
● Experience with Open Source non-relational / NoSQL data repositories (incl. MongoDB, Cassandra, Neo4J)
● Experience working with structured and unstructured data including imaging & geospatial data. ● Experience working in a Dev/Ops environment with tools such as Terraform, CircleCI, GIT.
● Proficiency in RDBMS, complex SQL, PL/SQL, Unix Shell Scripting, performance tuning and troubleshoot
● Databricks Certified Data Engineer Associate/Professional Certification (Desirable).
● Comfortable working in a dynamic, fast-paced, innovative environment with several ongoing concurrent projects
● Should have experience working in Agile methodology
● Strong verbal and written communication skills.
● Strong analytical and problem-solving skills with a high attention to detail.
Mandatory Skills
Azure Databricks, Azure Data Factory, Azure Delta Lake, Pyspark.
Job Location - Bangalore , Chennai , Pune , Gurgaon , Kolkata
Notice Period Preferred - Immediate to 30 days