Associate Site Reliability Engineer

๐ŸŒ Remote, USA ๐ŸŽฏ Full-time ๐Ÿ• Posted Recently

Job Description

Why We Need You! Site Reliability Engineering (SRE) is a growing team that partners closely with Product Engineering, Security, and Support. We are responsible for the reliability, deployment, and continuous operation of the Ivanti Cloud services. We need your help to take our existing platform to the next level with observability, release automation, chaos engineering, and more. The Associate SRE role is a blend of infrastructure, networking, operating systems, automation, development, and application administration. It is a hands-on technical position in a fast-paced atmosphere. The ideal candidate has prior experience managing cloud-based SaaS applications and strives to solve traditional operations problems through automation and software. More so, the candidate must possess a high standard of excellence, have a strong customer focus, and is capable of technical deep dives into code, app servers, databases, load balancers, operating systems, and networks. What You Will Be Doing ยท Work Monday through Friday, approximately 2 PM to 11 PM U.S. Pacific Time, including lunch) ยท US citizenship and must be located domestically in the U.S. ยท Deploying, managing, and securing Ivantiโ€™s production Software-as-a-Service (SaaS) environments in AWS and Azure ยท Working with geographically dispersed, cross-departmental teams to solve difficult problems ยท Automating common and repetitive tasks ยท Write documentation and training material ยท Train other colleagues. ยท Participate in on-call rotations for 24x7 coverage (follow-the-sun model) for incident response, issue triage, and problem resolution To Be Successful in The Role, You Will Have ยท A BSc in Computer Science, a related field, or equivalent practical experience ยท 3+ years of relevant industry experience (2+ with an achieved BSc in Computer Science or Equivalent Degree) ยท Proficiency with Python and experience with one of the following languages: o Java o Golang o C# ยท Proficiency working with Bash or PowerShell programmatically ยท Familiarity with public cloud platforms (AWS or Azure preferred) ยท Experience troubleshooting Java and .NET applications ยท Experience troubleshooting network and storage infrastructure issues ยท Experience working with core Linux distributions (Debian, RHEL, SUSE, Slackware) Experience working with Windows ยท Experience working with one or more: SQL Server, PostgreSQL, Redis, Kafka, MongoDB, Elasticsearch, or similar ยท Ability to configure and fine tune at least one: HA Proxy, Apache, Nginx, IIS, or similar ยท Ability to configure: New Relic, DataDog, Splunk, or similar monitoring tools ยท Familiarity with container orchestration technologies (AWS EKS or Azure AKS preferred) ยท Experience with deployment pipeline tools such as Ansible, Jenkins, and/or GitHub Actions ยท Proficiency working and developing Infrastructure as Code (IaC) ยท A desire to adopt and implement emergent technologies and best practices ยท Strong verbal and written communication skills in English for the purposes of global collaboration โ€˜Nice-to-havesโ€™ include: ยท Prior experience as a Site Reliability Engineer or DevOps Engineer ยท Certificates in one or more of the following categories, or demonstrated certificate-equivalent knowledge: o Cloud Development and architecture o Kubernetes Administration o Linux Administration o Software engineering disciplines ยท Experience with compliance frameworks such as SOC 2 Type 2, ISO-27001, FedRAMP, or IRAP and privacy regulations such as GDPR and PIPEDA Roadmap for Success 90 Days: ยท Onboarding and role-training is complete ยท You're building foundational knowledge of the SRE-run product portfolio ยท You hold general knowledge of how SRE manages our SaaS environments ยท You've gotten to know the team and are building relationships with SRE peer teams 6 Months: ยท Self-sufficiency in core job functions and existing processes ยท Participating in SRE on-call rotations ยท Contributing to handling SRE tickets to fulfillment and responsible for individual SRE tasks ยท Active participation in SRE stability discussions with direct interaction with SRE peers 1 Year: ยท Contribute independently to improve reliability and compliance in our SaaS environments ยท Demonstrate ownership of SRE ticket management including triage and resolution ยท Lead one or more well-defined projects under guidance from Senior SRE members ยท Identify areas where performance, scalability, security, and reliability can be improved in production systems and environments #LI-Remote Apply tot his job

Ready to Apply?

Don't miss out on this amazing opportunity!

๐Ÿš€ Apply Now

Similar Jobs

Recent Jobs

You May Also Like