Logo
Knox Systems

Cloud Operations Engineer

Knox Systems, Woburn, Massachusetts, us, 01813

Save Job

*US CITIZENSHIP REQUIRED*

Apply promptly! A high volume of applicants is expected for the role as detailed below, do not wait to send your CV.

The *Cloud Operations Engineer (L2)* is responsible for advanced troubleshooting, system administration, and application environment support across Knoxs *cloud infrastructure*. This role bridges operations, automation, and development support maintaining system stability, executing changes, and ensuring compliance within *FedRAMP Moderate, High, and DoD IL5* environments.

The ideal candidate brings strong *Linux, cloud, and automation experience*, with an understanding of application architecture and *low-code/no-code platform operations*.

*Key ResponsibilitiesIncident Management & System Troubleshooting*

* Perform advanced troubleshooting for infrastructure, OS, and application issues. * Analyze system logs, metrics, and telemetry from monitoring platforms (Grafana, Datadog, Wiz, CloudWatch). * Coordinate with Platform/DevOps Engineers on root cause analysis and long-term remediation. * Ensure timely resolution of escalated incidents in accordance with SLAs.

*Cloud Administration & Maintenance*

* Manage and maintain *AWS, Azure, and hybrid environments* following configuration baseline controls (CM-2, CM-6). * Execute system patching, upgrades, and configuration changes via automation or scripts. * Perform health checks, deployment validations, and post-change verifications. * Maintain infrastructure documentation and system configuration inventories.

*Application Support & Deployment Assistance*

* Support *low-code/no-code and custom applications* during deployment and maintenance windows. * Troubleshoot app-layer issues such as API failures, integration errors, or misconfigurations. * Work with DevOps/Platform teams to optimize *CI/CD deployment workflows* and rollback plans. * Ensure adherence to *change management and deployment authorization* processes.

*Automation & Scripting*

* Create or modify automation scripts (Bash, Python, PowerShell) for maintenance and reporting tasks. * Leverage *Terraform, Ansible, or cloud-native tools* for provisioning and environment consistency. * Proactively identify opportunities to automate recurring operational processes. * Document system changes and incident response details for FedRAMP audits.

*Qualifications*

* 35 years of experience in *cloud operations, system administration, or infrastructure support*. * Proficiency in *Linux administration* and *command-line troubleshooting*. * Strong working knowledge of *AWS and/or Azure infrastructure services*. * Familiarity with *CI/CD pipelines* and deployment automation tools. * Understanding of *low-code/no-code platforms* (Power Platform, ServiceNow, Salesforce) and related integration troubleshooting. * Experience writing and maintaining scripts (Bash, Python, PowerShell) * Familiarity with *FedRAMP, NIST 800-53*, or similar compliance environments. * US Citizenship required

*Preferred Certifications:* AWS SysOps Administrator, Microsoft Azure Administrator, Terraform Associate, CompTIA Security+, ITIL v4.

*Success Indicators*

* Improved uptime and service reliability across assigned systems. * Faster incident resolution and RCA completion times. * Demonstrated automation improvements in recurring operations. * Positive collaboration feedback from DevOps and Security teams. * Support *Continuous Monitoring (ConMon)* activities through vulnerability reporting and patch compliance tracking. * Assist in maintaining logs, baselines, and access control evidence.

Job Types: Full-time, Contract

Pay: $105,000.00 - $135,000.00 per year

Work Location: Hybrid remote in Woburn, MA 01801