Level 1 (L1) Cloud Operations Specialist

🌍 Remote, USA 🎯 Full-time 🕐 Posted Recently

Job Description

Knox Systems, Inc. is a leading provider of secure cloud and AI environments for the U.S. government. The Cloud Operations Specialist (L1) role involves first-line monitoring, triage, and rapid incident response across multi-tenant and single-tenant cloud environments to ensure system availability, security, and compliance.


Responsibilities

  • Monitor infrastructure, applications, and network health using tools such as Grafana, Wiz, Datadog, and CrowdStrike Falcon
  • Detect, triage, and escalate alerts based on severity and business impact
  • Document incident timelines, actions, and resolutions in ticketing systems (ServiceNow, Jira Service Management)
  • Follow established FedRAMP incident handling and escalation procedures
  • Execute predefined runbooks for system checks, restarts, and health verifications
  • Validate post-maintenance and deployment health of systems and services
  • Assist with system patching coordination, log collection, and audit evidence gathering
  • Maintain situational awareness of system uptime, customer impact, and scheduled changes
  • Support basic troubleshooting for hosted applications
  • Validate API connectivity and assist in identifying failed integrations or logic errors
  • Collaborate with developers and CloudOps engineers to verify deployment health after releases
  • Escalate application-related issues with complete context — affected users, tenant ID, and integration dependencies
  • Ensure all activities follow change control, access management, and incident response procedures
  • Record detailed incident notes and maintain compliance-ready audit trails
  • Participate in Continuous Monitoring (ConMon) reporting and FedRAMP evidence collection

Skills

  • 1–3 years of experience in a NOC, SOC, or application support / support center environment, supporting production systems or customer-facing web applications
  • Experience supporting customer-facing web applications, including alert triage, incident documentation, and escalation to engineering or platform teams
  • Familiarity with Linux administration and command-line tools
  • Familiarity with AWS, Azure, or GCP infrastructure services
  • Understanding of network, compute, and application monitoring fundamentals
  • General application troubleshooting experience, including familiarity with web-based applications, common application architectures, and foundational web technologies such as HTTP, REST APIs, and JSON
  • Strong attention to detail, communication, and documentation skills
  • Bachelor's degree in Information Technology, Computer Science, Cybersecurity, or a related technical field, or equivalent practical experience supported by relevant industry certifications
  • Due to the nature of our work with federal government clients and compliance with applicable regulations, this position requires U.S. citizenship
  • Candidates must be able to provide documentation verifying U.S. citizenship status as part of the background check process
  • Any offer of employment is contingent upon the successful completion of all required pre-employment screenings, including a background check, in accordance with applicable laws and government contract requirements
  • CompTIA Security+, Linux+, ITIL v4, AWS Cloud Practitioner, or Microsoft Fundamentals (AZ-900)

Benefits

  • Medical
  • Dental
  • Vision
  • Life & Disability
  • Unlimited PEO
  • Employee funded 401k plan

Company Overview

  • FedRAMP in 90 Days for 90% less. It was founded in 2024, and is headquartered in Austin, Texas, USA, with a workforce of 201-500 employees. Its website is http://knoxsystems.com.

  •  

    Apply To This Job

    Ready to Apply?

    Don't miss out on this amazing opportunity!

    🚀 Apply Now

    Similar Jobs

    Recent Jobs

    You May Also Like