CACI International Inc
Overview
Join to apply for the
System Reliability Engineer
role at
CACI International Inc .
The Opportunity
Join NAVSEA 03S as a Site Reliability Engineer for the Navy Maintenance and Modernization Enterprise Solution (NMMES)
Play a crucial role in ensuring the reliability, performance, and scalability of IT systems supporting naval ship and submarine maintenance operations
Work on complex systems serving over 45,000 users across global Navy facilities, including Navy Shipyards and Intermediate Maintenance Facilities
Bridge the gap between development and operations teams to improve system reliability and efficiency
Contribute to the modernization of NMMES infrastructure, balancing legacy systems with new technology integration
Implement and maintain robust monitoring, alerting, and incident response processes
Apply SAFe Agile and DevOps methodologies to improve system reliability and team efficiency
Responsibilities
Design, implement, and maintain scalable and reliable infrastructure for NMMES applications and services
Develop and implement automation solutions for deployment, scaling, and management of NMMES systems
Monitor system performance, availability, and capacity, and proactively address potential issues
Implement and maintain robust logging, monitoring, and alerting systems
Participate in on-call rotations to provide 24/7 support for critical NMMES systems
Collaborate with development teams to improve application performance and reliability
Conduct post-incident reviews and implement improvements to prevent future incidents
Develop and maintain documentation for system architecture, operations procedures, and disaster recovery plans
Implement and champion SAFe Agile and DevOps practices within the NMMES program
Required Qualifications
Bachelor's degree in Computer Science, Information Systems, or related field
At least 10 years of experience in systems engineering, DevOps, or site reliability engineering
Strong knowledge of Linux/Unix systems administration
Experience with configuration management tools (e.g., Ansible, Puppet, Chef)
Proficiency in scripting languages (e.g., Python, Bash)
Familiarity with containerization technologies (e.g., Docker, Kubernetes)
Experience with cloud platforms (e.g., AWS, Azure, GCP)
SAFe Agilist (SA) certification or higher
Must be a US Citizen with an active Secret clearance
Desired Qualifications
Experience working with DoD/Navy programs or similar complex government IT systems
Knowledge of both legacy systems and modern web application technologies
Familiarity with network protocols and security best practices
Experience with database administration (e.g., MySQL, PostgreSQL, Oracle)
Knowledge of monitoring and logging tools (e.g., Prometheus, ELK stack)
Advanced SAFe certifications such as SAFe DevOps Practitioner
Understanding of cybersecurity requirements for DoD systems
Experience with high-availability and disaster recovery strategies
Strong problem-solving skills and ability to work effectively in a team environment
Job Details
Seniority level: Mid-Senior level
Employment type: Full-time
Job function: Engineering and Information Technology
Industries: IT Services and IT Consulting
Pay Range The Proposed Salary Range For This Position Is
$98,500-$206,800
EEO and Other Statements CACI is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, age, national origin, disability, status as a protected veteran, or any other protected characteristic.
#J-18808-Ljbffr
System Reliability Engineer
role at
CACI International Inc .
The Opportunity
Join NAVSEA 03S as a Site Reliability Engineer for the Navy Maintenance and Modernization Enterprise Solution (NMMES)
Play a crucial role in ensuring the reliability, performance, and scalability of IT systems supporting naval ship and submarine maintenance operations
Work on complex systems serving over 45,000 users across global Navy facilities, including Navy Shipyards and Intermediate Maintenance Facilities
Bridge the gap between development and operations teams to improve system reliability and efficiency
Contribute to the modernization of NMMES infrastructure, balancing legacy systems with new technology integration
Implement and maintain robust monitoring, alerting, and incident response processes
Apply SAFe Agile and DevOps methodologies to improve system reliability and team efficiency
Responsibilities
Design, implement, and maintain scalable and reliable infrastructure for NMMES applications and services
Develop and implement automation solutions for deployment, scaling, and management of NMMES systems
Monitor system performance, availability, and capacity, and proactively address potential issues
Implement and maintain robust logging, monitoring, and alerting systems
Participate in on-call rotations to provide 24/7 support for critical NMMES systems
Collaborate with development teams to improve application performance and reliability
Conduct post-incident reviews and implement improvements to prevent future incidents
Develop and maintain documentation for system architecture, operations procedures, and disaster recovery plans
Implement and champion SAFe Agile and DevOps practices within the NMMES program
Required Qualifications
Bachelor's degree in Computer Science, Information Systems, or related field
At least 10 years of experience in systems engineering, DevOps, or site reliability engineering
Strong knowledge of Linux/Unix systems administration
Experience with configuration management tools (e.g., Ansible, Puppet, Chef)
Proficiency in scripting languages (e.g., Python, Bash)
Familiarity with containerization technologies (e.g., Docker, Kubernetes)
Experience with cloud platforms (e.g., AWS, Azure, GCP)
SAFe Agilist (SA) certification or higher
Must be a US Citizen with an active Secret clearance
Desired Qualifications
Experience working with DoD/Navy programs or similar complex government IT systems
Knowledge of both legacy systems and modern web application technologies
Familiarity with network protocols and security best practices
Experience with database administration (e.g., MySQL, PostgreSQL, Oracle)
Knowledge of monitoring and logging tools (e.g., Prometheus, ELK stack)
Advanced SAFe certifications such as SAFe DevOps Practitioner
Understanding of cybersecurity requirements for DoD systems
Experience with high-availability and disaster recovery strategies
Strong problem-solving skills and ability to work effectively in a team environment
Job Details
Seniority level: Mid-Senior level
Employment type: Full-time
Job function: Engineering and Information Technology
Industries: IT Services and IT Consulting
Pay Range The Proposed Salary Range For This Position Is
$98,500-$206,800
EEO and Other Statements CACI is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, age, national origin, disability, status as a protected veteran, or any other protected characteristic.
#J-18808-Ljbffr