Site Reliability Engineer - Hyderabad, India - Ampcus Tech Pvt. Ltd

    Ampcus Tech Pvt. Ltd
    Ampcus Tech Pvt. Ltd Hyderabad, India

    2 weeks ago

    Default job background
    Full time
    Description

    Job Title: Site Reliability Engineer (SRE)

    Work Location: Hyderabad (work from office)

    Type: Fulltime Permanent

    Notice Period: 30 Days

    Experience: 5+Years

    JD:

    Responsibilities:

    Build and maintain our cloud platforms and support applications, demonstrating agile and dynamic application support capabilities.

    Contribute in our continuous improvement and continuous delivery while increasing maturity of Devops practices.

    Contribute in developing and implementing automated Devops capability for our application.

    Get involved in the discussions and provide inputs in designing a fully automated, robust and secure infrastructure.

    Collaborate closely with other internal technical teams/business users in investigating, testing and deployments

    Responsible for handling Release Management, raising Change Request and scheduling for the implementation of fixes and enhancements.

    Work effectively in collaboration with different teams either local or remote.

    Work towards 100% availability of our applications by putting in right monitoring in place.

    Support our production environment with strong performance tuning, end-to-end troubleshooting, networking fundamentals skills.

    Willingness to work efficiently during the events thereby making sure that the event is a success.

    Skills:

    Minimum 5-7 years experience as a DevOps engineer supporting different cloud platforms like AWS, GCP, IaC Tools like Terraform, CI/CD

    Ensure that software packages and programs are well documented and has reasonable test coverage

    Root cause analysis, management communication and client relationship management in partnership with Infrastructure Service Support team members.

    Ensures all production changes are made in accordance with life-cycle methodology and risk guidelines

    Application Support, Deployment of Release, patches & fixes on Platform

    Analyze application performance, perform tuning and ensure high availability & stability of platform.

    Knowledge of Batch Processing systems and tools

    Knowledge of Unix/Linux system and containerization and container orchestration tools and platforms (viz., Docker, Cloud Foundry, OpenShift, Kubernetes) etc.

    Strong scripting skills ability automate manual tasks which could be easily converted to a script - shell, python.

    Familiarity with Observability tools like Grafana, Kibana, AppDynamics etc.

    Experienced in AWS/GCP Public cloud services

    Hands on experience any of the CI/CD tools viz., Jenkins, Circle-CI, GitHub Actions and ability to understand and define different deployment strategies.

    Hands-on experience with GIT. Managing deployment and branching with in GIT

    Good understanding of YAML and JSON.