No more applications are being accepted for this job
- Implement and maintain monitoring solutions using Dynatrace and/or IBM Instana to monitor the performance and availability of our systems and applications.
- Configure and customize monitoring dashboards, alerts, and reports to provide actionable insights and ensure proactive incident detection and resolution.
- Collaborate with development and operations teams to identify performance bottlenecks, troubleshoot issues, and optimize system performance.
- Conduct root cause analysis of incidents and implement preventive measures to minimize the risk of recurrence.
- Automate repetitive tasks and processes using scripting languages (e.g., Python, Bash) and configuration management tools (e.g., Ansible, Puppet).
- Participate in capacity planning, performance testing, and disaster recovery planning activities to ensure scalability, reliability, and resilience of our systems.
- Stay updated with industry trends and best practices in SRE, monitoring, and observability, and contribute to continuous improvement Bachelor's degree in Computer Science, Information Technology, or related field.
- 5 to 7 years of experience as a Site Reliability Engineer (SRE) or similar role, with a focus on service monitoring and observability.
- Handson experience with Dynatrace and/or IBM Instana for monitoring distributed systems, microservices, and cloud environments.
- Proficiency in scripting languages such as Python, Bash, or PowerShell for automation and tooling.
- Strong understanding of cloud computing platforms (e.g., AWS, Azure, Google Cloud) and containerization technologies (e.g., Docker, Kubernetes).
- Solid knowledge of DevOps principles, CI/CD pipelines, and infrastructure as code (IaC) concepts.
- Excellent problemsolving and troubleshooting skills, with the ability to analyze complex systems and identify performance bottlenecks.
- Strong communication and collaboration skills, with the ability to work effectively in crossfunctional teams and communicate technical concepts to nontechnical stakeholders.
- Certification in Dynatrace or IBM Instana.
- Experience with other monitoring and observability tools such as Prometheus, Grafana, ELK Stack, or New Relic.
- Familiarity with Agile methodologies and tools (e.g., Jira, Confluence).
- Knowledge of security best practices and compliance standards (e.g., GDPR, PCI-DSS).
Leuwint Technologies - Delhi, India - Leuwint Technologies Pvt Ltd
Description
Responsibilities :