Data Analyst - Bangalore, India - Varite India

    Default job background
    permanent Technology / Internet
    Description

    About the job :

    Company Name :
    VARITE India Private Limited


    About The Client :

    • A global information technology, consulting, and business process services company, headquartered in India, provides a broad spectrum of services, including IT consulting, application development, business process outsourcing, and digital solutions.
    • Serving clients across diverse industries and in over 167 countries, the company offers technologydriven solutions to enhance efficiency and innovation.
    • With a global presence, it has emerged as a key player in the IT services and consulting space, contributing to the digital transformation of businesses worldwide.

    Qualifications :

    • Acquiring data and their access from primary or secondary data sources .
    • Proven working experience as a Data Analyst or Business Data Analyst
    • Strong analytical skills with the ability to collect, organize, analyze, and disseminate significant amounts of information with attention to detail and accuracy
    • Good in documentation Skills
    Good Communication Skills

    Technology :

    • SQL(5+ Years Experience)
    • PYSPARK (5+ Years Experience)

    Roles and Responsibilities :

    Data Collection and Extraction :

    • Utilize PySpark to extract and collect data from various sources such as databases, data lakes, and streaming platforms.
    • Write complex SQL queries to retrieve relevant data from structured and unstructured datasets.

    Data Cleaning and Preprocessing :

    • Perform data cleaning and preprocessing using PySpark transformations and SQL queries to ensure data quality and consistency.
    • Handle missing values, outliers, and data anomalies effectively.

    Data Analysis and Interpretation :

    • Use PySpark and SQL to conduct exploratory data analysis (EDA) to identify patterns, trends, and anomalies in the data.
    • Apply statistical analysis and hypothesis testing techniques to derive actionable insights from the data.

    Data Transformation and Modeling :

    • Transform raw data into meaningful insights through data manipulation techniques using PySpark.
    • Develop and implement data models and algorithms to support analytical requirements.

    Data Visualization :

    • Create visualizations using PySpark and SQL results to present analysis findings effectively.
    • Utilize visualization tools like Matplotlib, Seaborn, or Tableau for creating insightful visual representations of data.

    Reporting and Documentation :

    • Prepare comprehensive reports and documentation summarizing data analysis findings and insights.
    • Document data analysis processes, methodologies, and results for reproducibility and future reference.

    Collaboration and Communication :

    • Collaborate with crossfunctional teams, including data engineers, business analysts, and stakeholders, to understand business requirements and deliver insights.
    • Communicate complex technical concepts and analysis results to nontechnical stakeholders in a clear and understandable manner.

    Data Governance and Quality Assurance :

    • Ensure adherence to data governance policies and standards in data handling and analysis.
    • Conduct quality assurance checks to validate data accuracy, completeness, and integrity.

    Continuous Learning and Improvement :

    • Stay updated with the latest advancements in PySpark and SQL technologies and methodologies.
    • Continuously improve skills in data analysis, programming, and statistical techniques.

    Problem Solving and Decision Making :

    • Identify and solve complex data analysis problems using PySpark and SQL.
    • Make datadriven decisions and provide actionable recommendations to support business objectives.

    Project Management :

    • Manage multiple data analysis projects simultaneously, ensuring timely delivery and quality outcomes.
    • Prioritize tasks and allocate resources effectively to meet project deadlines.

    Training and Mentorship :

    • Provide guidance, training, and mentorship to junior data analysts on PySpark and SQL skills and best practices.
    )