- Work closely with team members to ensure best practices and strategic goals are incorporated into development work.
- Collaborate with other engineering teams to identify and anticipate changing requirements and opportunities to improve the development environment.
- Monitoring at scale with VictoriaMetrics and the like.
- Orchestrating and managing with K8S and the like.
- Implementing best practices, challenging the status quo, and tab on industry and technical trends, changes, and developments to ensure the team is always striving for bestinclass work.
- Manage capacity, build security into every layer, and reduce cost.
- Implement secure networking, key management, user management, access management, process management, and image management.
- Effectively lead and manage team deliverable (short/long term) project planning and coaching, quarterly reviews, participation in the selection process for new hires, and technical and nontechnical guidance to the team.
- Proven experience in handling large infrastructure and distributed systems like Yarn, Kubernetes, Elasticsearch, Kafka, etc.
- Familiarity with Pythonrelated technologies and frameworks like Falcon, Django, or Pyramid.
- Experience with Unix/Linux operating systems internals and administration (e. g. filesystems, inodes, system calls, etc. ) or networking (e. g. TCP/IP, routing, network topologies, and hardware, SDN, etc. ).
- Familiarity with the cloud computing infrastructure, preferably Azure.
- Familiarity with task queue frameworks like Celery or Pika is a plus.
- Source code management and Implementation of security best practices.
- Deep understanding of modern software architectures, including loadbalancing, queueing, caching, distributed systems failure modes generally, microservices, and big data technologies.
- Knowhow in gathering metrics across distributed systems (instances/container) and generating automated notifications, and reports.
- Prowess in analyzing App bottlenecks, and performance degradation, and implementing automated processes/tools to detect such anomalies.
- Excellent programming (Python, Go, Ruby, or preferred scripting languages) and automation skills.
- Deep understanding of container orchestration technologies
- Kubernetes.
- Should have had prior experience in migrating high throughput services to Kubernetes.
- Expertise in any CI/CD tools build, artifact, packaging, and service discovery management tools. Gitops preferred.
- Expertise in skillsets for centralized logging systems, metrics, and tooling frameworks such as ELK, Prometheus/VictoriaMetrics, and Grafana.
- Great communication, interpersonal, and teamwork skills.
- Experience with AWS/Azure cost explorer, billing analysis, and various cost optimization techniques.
- Awareness of Cloud Security concepts.
- Awareness of Information Security Concepts and Best Practices.
- AWS/Azure cloud certification preferred.
- Certification in Kubernetes Administrator (CKA).
- Certification in Kubernetes Application Developer (CKAD).
- Experience with configuration management tools and strong code analysis skills in Python.
- Experience in working with APMbased tools like New Relic.
-
Site Reliability Engineer
4 days ago
Waytogo Consultants Bangalore, India permanentJob Description : · As an SRE Lead (Site Reliability Engineering Lead), you will play a crucial role in ensuring the reliability, scalability, and performance of our systems and services. · He/ She will lead a team of SREs (Site Reliability Engineers) and collaborate closely wit ...
-
Site Reliability Engineer
1 week ago
Integra Connect Bangalore Urban, IndiaAbout IntegraConnect · Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud plat ...
-
Site Reliability Engineer
4 days ago
Jobeefie pvt ltd Bangalore, India permanentResponsibilities : · - Establish instrumentation to measure SLI (Service Level Indicators), define SLO (Service Level Objectives), Alerting mechanisms, review with Stakeholders · - Ensure the reliability, scalability and performance of our cloud-based systems and On-Prem Systems. ...
-
Senior Reliability Engineer
2 days ago
Zyoin group Bangalore, India permanentWe are seeking a highly skilled and experienced Senior Reliability Engineer to join our team in Bangalore and play a crucial role in ensuring the reliability, scalability, and performance of our critical software systems. · Location : Bangalore (Hybrid) · Responsibilities : · - D ...
-
Site Reliability Engineer
4 days ago
Zyoin group Bangalore, India permanentResponsibilities : · - Work with the Kubernetes, and Service Mesh team to manage our growing fleet of clusters globally, across multiple Cloud providers. · - Work with the Development Tools and Service Mesh teams to implement and measure SLAs, SLOs, and MTTD/R for our services fa ...
-
Site Reliability Engineer
1 week ago
Signify Netherlands B.V. Bangalore, India Full timeSite Reliability Engineer · Signify, the new company name of Philips Lighting, is the global leader in lighting building on 125+ years of innovations. Our purpose is to unlock the extraordinary potential of light for brighter lives and a better world. · We are proud to be ahead o ...
-
Site Reliability Engineer
6 days ago
Qualcomm Bangalore, India Paid WorkCompany: · Qualcomm India Private Limited · Job Area: · Information Technology Group, Information Technology Group > IT Software Engineer · Qualcomm Overview: · Qualcomm is a company of inventors that unlocked 5G ushering in an age of rapid acceleration in connectivity and new po ...
-
Site Reliability Engineer
1 week ago
Edge In Asia Recruitment Private Limited Bangalore, India permanentOur client is a global investment banking company having its headquarters in the US with employees globally and is looking for a SRE to join their Bangalore regional team. · Note : Looking for candidates who can join within 30days · Work Mode : Hybrid · Experience : 7 to 10 year ...
-
Site Reliability Engineer
1 day ago
Magna International Inc. Bangalore, IndiaJob Number: 65448 · Group: Magna Corporate · Division: Magna Corporate R&D India · Job Type: Permanent/Regular · Location: BANGALORE · Work Style: · About us · We see a future where everyone can live and move without limitations. That's why we are developing technologies, s ...
-
Lead Site Reliability Engineer
4 days ago
The HRBPs Bangalore, India permanentLead Site Reliability Engineer - Bangalore · Exp - 8 to 12 years · Responsibilities : · - Collaborating with customer success managers and solutions engineers to bring deep technical expertise in implementing intelligent automation solutions for customers. · - Providing customers ...
-
Senior Site Reliability Engineer
4 days ago
Dashhire Bangalore, India permanentActively seeking a Senior Site Reliability Engineer (Senior SRE) to elevate the reliability, scalability, and performance of our cloud management platform. · You will be at the forefront of developing and maintaining sophisticated tools and systems to automate and optimize the ma ...
-
Lead Site Reliability Engineer
4 days ago
The HRBPs Bangalore, India permanentLead Site Reliability Engineer - Bangalore · Experience - 8 to 12 years · Responsibilities : · - Collaborating with customer success managers and solutions engineers to bring deep technical expertise in implementing intelligent automation solutions for customers. · - Providing cu ...
-
Senior Site Reliability Engineer
4 days ago
Orange Shark Bangalore, India permanentPosition : Senior Site Reliability Engineer (SRE) · Experience : 5+ Years · Location : Bangalore (Hybrid) · Notice Period : Immediate to 30 Days Notice · Job Description : · Key Responsibilities : · - Design and implement sophisticated automation solutions for the management and ...
-
Site Reliability Engineer II
4 days ago
Meesho Bangalore, India permanentSite Reliability Engineer II · Bangalore, Karnataka Tech Infrastructure /Full Time Employee /On-Site · About the Team : · When 5% of Indian households shop with us, its important to build resilient systems to manage millions of orders every day. Weve done this with zero downtime ...
-
Site Reliability Engineer- On Premises
3 days ago
Smarsh Bangalore Urban, IndiaSmarsh is the leader in communications compliance, archiving, and analytics. We provide compliance across the broadest set of communications channels with insights on what's being captured. Smarsh customers manage over 500 million daily conversations across 80 channels and growin ...
-
Senior Network Reliability Engineer
4 days ago
Winfort Services Pvt ltd Bangalore, India permanentJob Description : · Building and maintaining network monitoring, orchestration and automation solutions, including automated inventory reconciliation and remediation, · - Monitor the performance of our network infrastructure and develop automated solutions to address any issues. ...
-
Site Reliability Engineer III
4 days ago
Novopay Bangalore, India permanentAbout Trustt : · Trustt (formerly Novopay) was founded by Srikanth Nadhamuni (Founder CTO Aadhaar) and Gautam Bandyopadhyay (a FinTech industry veteran and former Head of Finacle Innovation Hub at Infosys). Vinod Khosla, the legendary silicon-valley venture capitalist, is our chi ...
-
Site Reliability Engineer III
5 days ago
Grizmo Labs Bangalore, India permanentResponsibilities : · - Architect and deploy scalable infrastructure and platform services (Monitoring, logging, etc) with a focus on simplicity and automation. · - Own the performance and reliability of backend services, data pipelines, platform services, etc, and work with devel ...
-
Senior Site Reliability Engineer
4 days ago
TalentXo Bangalore, India permanentRole & Responsibilities : · - As a member of the cloud engineering team you get to build the cloud infrastructure in which our Jobvite applications run · - You will get to build the tools that monitor, deploy and manage our web applications and backend systems · - Participate in ...
-
Wipro Bangalore Urban, IndiaPrincipal Site Reliability Engineer · We are seeking a highly skilled and experienced Principal Site Reliability Engineer (SRE) to join Lab45 team in Wipro. As a Principal SRE, you will play a critical role in ensuring the reliability, availability, and performance of our systems ...
Lead Site Reliability Engineer - Bangalore, India - GetHyr
Description
Job Description :
Maintain services once they are live by measuring and monitoring availability, latency, and overall system reliability.
Mandatory Skills :
years of Experience on the AWS/Azure platform.