Job Description
- Job Description:
- Ensuring maximum uptime for our global infrastructure
- Automating processes and building tools to improve operational efficiency
- Coordinating operational work across teams to improve the platform with minimal impact
- Requirements:
- Solid experience with Linux operating systems or Networking and day to day upkeep
- Familiarity with virtualization technologies and troubleshooting virtual machine instances
- Familiarity with containerization technologies and troubleshooting containers
- Familiarity with IPv4 Networking and troubleshooting (CCNA equivalent)
- Basic storage concepts and technologies
- Experience with monitoring systems and incident management
- Experience scripting in one or more of the following languages: Bash, Python, or Go
- Experience with GPU hardware or AI/ML, and Kubernetes
- A passion for good documentation and open communication
- Proven ability to learn!
- Benefits:
- Competitive salary
- Flexible time off policy
- Employee Assistance Program
- Reimbursement for relevant conferences, training, and education
- Access to LinkedIn Learning's 10,000+ courses
- Employee Stock Purchase Program
- Equity grants
Apply tot his job
Apply To this Job