Data  Engineer  (2-3 YEARS)- 5 POSITIONS - Hyderabad, India - Rightpath Computer Technologies Pvt Ltd

    Default job background
    Description

    Role:

    Data Engineer (2-3 YEARS)- 5 POSITIONSResponsibilities: Data Engineer will work on collecting, storing, processing, and analyzing huge sets of data The primary focus will be on choosing optimal solutions to use for these purposes, then maintaining, implementing, and monitoring them.

    You will also be responsible for integrating them with the architecture used across the company in various products.


    Key Skills:
    1. Proficient understanding of distributed computing principles2. Ability to build, run and manage large clusters3. Hadoop v2, MapReduce, HDFS4. Java, Python5.

    Large Scale crawling :
    Scrapy, Nutch and custom crawling solutions6. Experience with Apache Solr Lucene7. NoSQL databases, such as MongoDB, HBase, Cassandra8. Knowledge of various ETL techniques and frameworks, such as Flume9. Experience with NLP tools and systems for POS, NER, and Information extraction10. Experience with Machine Learning – Regression, Classification, Decision Trees.11.

    Experience with Linux / AWSKey Technologies :
    – Web Scraping– NLP– NoSQL (MongoDB, Cassandra)– Python/JAVA, R– Machine Learning (Basics)– Big DataTools (Hadoop, Spark, Pig, Hive)