Site Reliability Engineer - Hyderabad, India - SID Global Solutions
Description
Experience in Cloud administration and troubleshooting(GCP is recommended) or AWS or AZURE
• Experience with Kubernetes or comparable technology.
• APIGEE OR API Management experience is Mandatory.
• Experience with CI/CD technologies like Jenkins, GitLab
• Experience with observability stacks like Dynatrace, Prometheus, ELK stack.( Create and manage alert )
• Understanding of Linux system and logging
• Expertise in AWS/GCP cloud infrastructure desirable
• Strong technical analytical, troubleshooting, and problem-solving skills.
• Strong sense of ownership, urgency, and drive Excellent verbal, written, communication and receptive listening skills (English).
• Respond to cases within SLA, with the appropriate level of urgency. Escalate issues to internal teams promptly when required. Familiarity with Jira will be a plus.
• AWS/GCP Certifications are highly desirable.
• An understanding of End-to-End business infrastructure components and maintain their health.
• Eager to learn new technology
• Should be a good team player Job Responsibilities:
• Provide world class customer service experience both technically and with soft skills.
• Assist customers troubleshooting cloud services.
• First point of contact for internal cloud platform users. Apply technical acumen and customer-facing skills to effectively represent the public cloud team.
• Manage the lifecycle of incidents, acting as the single point of contact for the customer ,impact analysis, internal stakeholder coordination and communication.
• Implement key post escalation actions from post incident reports.
• Follow best practices of incident response and site reliability.