Data Engineer II - Bengaluru, India - Honeywell

    Honeywell
    Honeywell background
    Description

    Responsibilities·Work in complex data science and analytics projects in support of the Commercial organization.·Work with product owner to identify the data requirements and design/ maintain/ optimize data pipeline to ingest, transform, and load structured and unstructured data from various sources into the data warehouse or data lake.·Design and implement data models and schemas to support analytical and reporting requirements.·Collaborate with data scientists and analysts to define and structure data for effective analysis and reporting.·Develop and maintain ETL (Extract, Transform, Load) processes.·Administer, optimize, and manage databases, data warehouses, and data lakes to ensure performance, reliability, and scalability.·Enforce data governance policies, standards, and best practices to maintain data quality, privacy, and security.·Create and maintain comprehensive documentation for data architecture, processes, and systems.·Troubleshoot and resolve data-related problems and optimize system performance.·Partner with IT support team on production processes, continuous improvement, and production deployments.

    YOU MUST HAVE·Two or more years of relevant experience in Data Engineering, ETL Development, Database Administration.·Experience in Azure Databricks, CI/CD & Dev Ops Process·Expert in scripting and querying languages, such as Python, SQL, PySpark·Experience with both Structured and Unstructured data·Experience in Snowflake·SFDC business/ technical knowledge·Knowledge of Agile development methodologyWE VALUE·Working with at least one NoSQL system (HBase, Cassandra, MongoDB)·Knowledge of databases, data warehouse platforms (Snowflake) and Cloud based tools.·Experience in using data integration tools for ETL processes.·Knowledge of Data Modelling techniques including schema design for both rational and NoSQL databases·Understanding of Hadoop's ecosystem (including HDFS) and Spark for processing and analyzing large-scale datasets.·Demonstrated experience in cutting-edge packages such as SciKit, TensorFlow, Pytorch, GPT, PySpark, Bit bucket etc.·Ability to develop and communicate technical vision for projects and initiatives that can be understood by customers and management.·Proven mentoring ability to drive results and technical growth in peers.·Effective communication skills (verbal, written, and presentation) for interacting with customers and peers.·Demonstrated application of statistics, statistical modeling, and statistical process control.