Logo
CACI International Inc

System Reliability Engineer

CACI International Inc, Norfolk, Virginia, United States, 23500

Save Job

Overview Join to apply for the

System Reliability Engineer

role at

CACI International Inc .

The Opportunity

Join NAVSEA 03S as a Site Reliability Engineer for the Navy Maintenance and Modernization Enterprise Solution (NMMES)

Play a crucial role in ensuring the reliability, performance, and scalability of IT systems supporting naval ship and submarine maintenance operations

Work on complex systems serving over 45,000 users across global Navy facilities, including Navy Shipyards and Intermediate Maintenance Facilities

Bridge the gap between development and operations teams to improve system reliability and efficiency

Contribute to the modernization of NMMES infrastructure, balancing legacy systems with new technology integration

Implement and maintain robust monitoring, alerting, and incident response processes

Apply SAFe Agile and DevOps methodologies to improve system reliability and team efficiency

Responsibilities

Design, implement, and maintain scalable and reliable infrastructure for NMMES applications and services

Develop and implement automation solutions for deployment, scaling, and management of NMMES systems

Monitor system performance, availability, and capacity, and proactively address potential issues

Implement and maintain robust logging, monitoring, and alerting systems

Participate in on-call rotations to provide 24/7 support for critical NMMES systems

Collaborate with development teams to improve application performance and reliability

Conduct post-incident reviews and implement improvements to prevent future incidents

Develop and maintain documentation for system architecture, operations procedures, and disaster recovery plans

Implement and champion SAFe Agile and DevOps practices within the NMMES program

Required Qualifications

Bachelor's degree in Computer Science, Information Systems, or related field

At least 10 years of experience in systems engineering, DevOps, or site reliability engineering

Strong knowledge of Linux/Unix systems administration

Experience with configuration management tools (e.g., Ansible, Puppet, Chef)

Proficiency in scripting languages (e.g., Python, Bash)

Familiarity with containerization technologies (e.g., Docker, Kubernetes)

Experience with cloud platforms (e.g., AWS, Azure, GCP)

SAFe Agilist (SA) certification or higher

Must be a US Citizen with an active Secret clearance

Desired Qualifications

Experience working with DoD/Navy programs or similar complex government IT systems

Knowledge of both legacy systems and modern web application technologies

Familiarity with network protocols and security best practices

Experience with database administration (e.g., MySQL, PostgreSQL, Oracle)

Knowledge of monitoring and logging tools (e.g., Prometheus, ELK stack)

Advanced SAFe certifications such as SAFe DevOps Practitioner

Understanding of cybersecurity requirements for DoD systems

Experience with high-availability and disaster recovery strategies

Strong problem-solving skills and ability to work effectively in a team environment

Job Details

Seniority level: Mid-Senior level

Employment type: Full-time

Job function: Engineering and Information Technology

Industries: IT Services and IT Consulting

Pay Range The Proposed Salary Range For This Position Is

$98,500-$206,800

EEO and Other Statements CACI is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, age, national origin, disability, status as a protected veteran, or any other protected characteristic.

#J-18808-Ljbffr