Director, Site Reliability Engineering (BB-8C15E)
Found in: Talent IN
Description:The team is also responsible for driving service uptime and quality in 24 x 7 environments and meeting client expectations of accessible data services. The team will enhance monitors, troubleshoot Windows and Linux applications, support AWS infrastructure, execute tools & automation, and impact the design of the future platform architecture. Additionally, they will be involved in developing support standards for all applications and adheres to those plans to provide the necessary level of production SLAs. The Avalara product suite is a mix of Windows/SQL environments with a wide variety of additional open source applications present in a Linux environment. The role supports multiple projects and requires interactions and partnerships with teams and personnel from multiple business locations, skill sets, and backgrounds. As such, the position requires strong communication skills in addition to a solid foundation of technical skills, analytical abilities, and end-to-end troubleshooting techniques. Essential Operational Engineering Skills Recommend application changes to improve application performance Work with Development to transition applications from one platform to another when called for by the business Cloud Services experience with AWS or Azure Experienced in troubleshooting applications running on a Windows .NET stack using IIS and Linux based opensource stacks in an AWS environment. You must have strong experience in one and a willingness to learn the other. Other Duties Provide technical escalation and share in the on-call rotation Work independently but also a very strong team player, as required for the project at hand Review existing processes and recommend changes or institute new processes as necessary, including the areas of monitoring, upgrades, and tuning, etc. Generate high-quality project documentation, such as architecture designs, implementation plans, design documents, test plans, etc. Participate in on-call 24/7 rotation. Technical Qualifications At least 7 years' experience in a SAAS operating environment. Deep expertise in the mentality, processes, and tools needed to deliver five nines SLAs. Communication and influence of indirect engineering teams through documentation, Brown Bags, training etc. CI/CD process Strong working knowledge of Windows or Linux and it's underlying components, system statistics, performance tuning, file systems, and io. Experience with either .NET or Linux in AWS. Past experience with scripting skills in Powershell, Python or Perl and a history of automating workloads Experience with production deployment, monitoring and operational support for enterprise-class applications Experience in performance diagnostics, capacity planning, performance architecture design, performance tuning, performance monitoring Experience working with load balanced high-traffic solutions/services. Experience in SQL and database services BS in Computer Science or STEM equivalent with 5-8 years of relevant work experience Good verbal and written communication skills Technologies you are likely to be working with AWS and AWS PaaS and SaaS technologies, GitLab, Atlassian suite, Powershell, Python, Terraform & HashiCorp suite, Jenkins, C#, SQL, Linux, Containerization technologies etc. Preferred Qualifications Can do attitude no problem is too big or too small. A desire to delight customers. A systematic problem solver, with the ability to think outside the box. Good data analysis skills to pick up trends before they become major problems. Previous experience as an enterprise-class Site Reliability Engineer A strong mix of Software Engineer and Operation Support skills. Eager to learn new technologies and programming languages.
calendar_today1 day ago