ClearanceJobs
Site Reliability Engineer
Alaska Northstar Federal is looking for a Site Reliability Engineer to join the team on a long-term project to be awarded soon onsite at Warner Robins AFB, GA. The Site Reliability Engineer (SRE) will provide comprehensive engineering and operational support for cloud and application services hosted in Oracle Cloud Infrastructure (OCI), ensuring sustained system reliability, scalability, and security to meet demanding government program requirements. This position targets candidates with a record of excellence delivering both remote and onsite technical solutions in secure Federal/DoD environments, leveraging industry best practices in automation, modern DevOps, and continuous improvement. This position requires on-site support for customers located at Robins AFB, Warner Robins, GA. Responsibilities for the Site Reliability Engineer will include, but not be limited to: Cloud Infrastructure Operations: Deploy, administer, monitor, and sustain cloud infrastructure and application components in Oracle Cloud Infrastructure (OCI), including the design and management of resilient, highly available solutions aligned to organizational requirements. Automation & Engineering Excellence: Develop and maintain Infrastructure as Code (IaC) scripts, automation pipelines, and configuration management code for deployment, patch management, and cloud operations, leveraging tools such as Terraform and Ansible. DevOps & CI/CD: Collaborate with Development and Security teams to implement and optimize automated build, integration, deployment, and testing pipelines for application releases; drive continuous improvement of DevOps/DevSecOps practices. Service Monitoring & Incident Response: Implement end-to-end monitoring and alerting solutions for application/infrastructure health using OCI native services and industry-standard tools. Triage and resolve production incidents to minimize downtime and user impact, conduct root cause analysis, and drive preventive measures. Cloud Security & Compliance: Support implementation of security controls IAW DoD RMF, NIST, and FedRAMP guidelines; coordinate Authority to Operate (ATO) sustainment; perform vulnerability remediations and ensure compliance with cyber mandates, including scanning, patching, incident response, and audit trail management. Collaboration & Reporting: Provide cross-functional technical consultation to project stakeholders, system engineers, security professionals, and end users; produce clear, actionable technical documentation, status reports, recommendations, and compliance deliverables. Continuous Improvement: Proactively identify opportunities to automate and enhance operational procedures, enabling increased system performance, reduced operational risk, and improved end user experience. Knowledge Management: Maintain and update technical resources, procedures, and troubleshooting guides (e.g., SharePoint, internal wikis), and support knowledge transfer events within the engineering team and to customer stakeholders. Requirements: Candidate must be a U.S. Citizen Candidate must have an active DoD Secret Clearance Candidate must have an active Security+ CE (or equivalent) Candidate must have at least 4 years of experience in the following: Supporting cloud environments, with specific focus on Oracle Cloud Infrastructure (OCI) or major IaaS providers Managing production workloads in a regulated/Federal environment. Automation & Scripting: Proficiency in modern scripting languages (Python, Bash, or PowerShell) and Infrastructure as Code (e.g. Terraform, Ansible). Monitoring & Observability: Strong experience with cloud monitoring solutions (e.g., OCI Monitoring, Splunk, Grafana) and application performance management tools. Security & Compliance: Familiarity with DoD, NIST, and FedRAMP requirements; practical experience supporting RMF/ATO processes and continuous cybersecurity operations, including vulnerability management and log/audit management. Incident Management: Proven skills in operational troubleshooting, root cause analysis, and the implementation of durable corrective/preventive actions. Collaboration & Communication: Excellent written and verbal communications; ability to clearly document processes, procedures, and technical findings for diverse stakeholders, including program managers, government users, and auditors. Desired Knowledge, Skills, Abilities: Bachelor's degree Certifications such as OCI Certified Architect Associate/Professional, AWS/Azure/GCP Cloud certifications. ITIL Foundation and/or familiarity with ITIL-based incident management. Supporting large, distributed DoD or Federal program environments. Container orchestration (Kubernetes, Docker), CI/CD tools (Jenkins, GitLab), and Agile methodologies. Windows/Linux administration in a cloud enterprise context. About ANF Alaska Northstar Federal (ANF) maintains an outstanding work environment that includes competitive compensation, outstanding benefits, and challenging work assignments with opportunities for advancement/career growth. To be considered for employment opportunities you must complete an online application. EEO Statement ANF is committed to hiring and retaining a diverse workforce. We are proud to be an Equal Opportunity/Affirmative Action-Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or protected veteran status. U.S. Citizenship is required for most positions. ANF is an advocate of preferential hiring and professional development of qualified Shee Atik Inc shareholders, their spouses and descendants, and Alaska Natives in accordance with Public Law 93-638.
Alaska Northstar Federal is looking for a Site Reliability Engineer to join the team on a long-term project to be awarded soon onsite at Warner Robins AFB, GA. The Site Reliability Engineer (SRE) will provide comprehensive engineering and operational support for cloud and application services hosted in Oracle Cloud Infrastructure (OCI), ensuring sustained system reliability, scalability, and security to meet demanding government program requirements. This position targets candidates with a record of excellence delivering both remote and onsite technical solutions in secure Federal/DoD environments, leveraging industry best practices in automation, modern DevOps, and continuous improvement. This position requires on-site support for customers located at Robins AFB, Warner Robins, GA. Responsibilities for the Site Reliability Engineer will include, but not be limited to: Cloud Infrastructure Operations: Deploy, administer, monitor, and sustain cloud infrastructure and application components in Oracle Cloud Infrastructure (OCI), including the design and management of resilient, highly available solutions aligned to organizational requirements. Automation & Engineering Excellence: Develop and maintain Infrastructure as Code (IaC) scripts, automation pipelines, and configuration management code for deployment, patch management, and cloud operations, leveraging tools such as Terraform and Ansible. DevOps & CI/CD: Collaborate with Development and Security teams to implement and optimize automated build, integration, deployment, and testing pipelines for application releases; drive continuous improvement of DevOps/DevSecOps practices. Service Monitoring & Incident Response: Implement end-to-end monitoring and alerting solutions for application/infrastructure health using OCI native services and industry-standard tools. Triage and resolve production incidents to minimize downtime and user impact, conduct root cause analysis, and drive preventive measures. Cloud Security & Compliance: Support implementation of security controls IAW DoD RMF, NIST, and FedRAMP guidelines; coordinate Authority to Operate (ATO) sustainment; perform vulnerability remediations and ensure compliance with cyber mandates, including scanning, patching, incident response, and audit trail management. Collaboration & Reporting: Provide cross-functional technical consultation to project stakeholders, system engineers, security professionals, and end users; produce clear, actionable technical documentation, status reports, recommendations, and compliance deliverables. Continuous Improvement: Proactively identify opportunities to automate and enhance operational procedures, enabling increased system performance, reduced operational risk, and improved end user experience. Knowledge Management: Maintain and update technical resources, procedures, and troubleshooting guides (e.g., SharePoint, internal wikis), and support knowledge transfer events within the engineering team and to customer stakeholders. Requirements: Candidate must be a U.S. Citizen Candidate must have an active DoD Secret Clearance Candidate must have an active Security+ CE (or equivalent) Candidate must have at least 4 years of experience in the following: Supporting cloud environments, with specific focus on Oracle Cloud Infrastructure (OCI) or major IaaS providers Managing production workloads in a regulated/Federal environment. Automation & Scripting: Proficiency in modern scripting languages (Python, Bash, or PowerShell) and Infrastructure as Code (e.g. Terraform, Ansible). Monitoring & Observability: Strong experience with cloud monitoring solutions (e.g., OCI Monitoring, Splunk, Grafana) and application performance management tools. Security & Compliance: Familiarity with DoD, NIST, and FedRAMP requirements; practical experience supporting RMF/ATO processes and continuous cybersecurity operations, including vulnerability management and log/audit management. Incident Management: Proven skills in operational troubleshooting, root cause analysis, and the implementation of durable corrective/preventive actions. Collaboration & Communication: Excellent written and verbal communications; ability to clearly document processes, procedures, and technical findings for diverse stakeholders, including program managers, government users, and auditors. Desired Knowledge, Skills, Abilities: Bachelor's degree Certifications such as OCI Certified Architect Associate/Professional, AWS/Azure/GCP Cloud certifications. ITIL Foundation and/or familiarity with ITIL-based incident management. Supporting large, distributed DoD or Federal program environments. Container orchestration (Kubernetes, Docker), CI/CD tools (Jenkins, GitLab), and Agile methodologies. Windows/Linux administration in a cloud enterprise context. About ANF Alaska Northstar Federal (ANF) maintains an outstanding work environment that includes competitive compensation, outstanding benefits, and challenging work assignments with opportunities for advancement/career growth. To be considered for employment opportunities you must complete an online application. EEO Statement ANF is committed to hiring and retaining a diverse workforce. We are proud to be an Equal Opportunity/Affirmative Action-Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or protected veteran status. U.S. Citizenship is required for most positions. ANF is an advocate of preferential hiring and professional development of qualified Shee Atik Inc shareholders, their spouses and descendants, and Alaska Natives in accordance with Public Law 93-638.