Ingram Barge Company
Overview
Ingram Barge is seeking a Senior DevOps Engineer to join our dynamic DevSecOps team. This role will work alongside our Systems Architect, Application Development Architect, and Security Engineer to operationalize cloud-native infrastructure, enhance CI/CD pipelines, ensure system reliability and resilience, and provide 24x7 operational support.
What you will be doing
Pipeline & Automation
Design and implement advanced CI/CD pipeline features using GitLab
Develop and maintain Terraform modules for infrastructure provisioning
Create and optimize Ansible playbooks for configuration management and deployment automation
Integrate security scanning and compliance checks into deployment pipelines
Container & Kubernetes Operations
Build, configure, and maintain Azure Kubernetes Service (AKS) clusters
Develop and optimize Helm charts for application deployments
Implement and manage GitOps workflows
Monitor and troubleshoot containerized applications and cluster performance
Infrastructure & Reliability
Implement Infrastructure as Code best practices using Terraform and Ansible
Design and execute disaster recovery procedures and business continuity plans
Perform system patching, upgrades, and maintenance activities
Establish and maintain comprehensive monitoring, alerting, and observability solutions using Prometheus and Grafana
Cost Optimization & Resource Management
Monitor and analyze Azure cloud spending patterns and resource utilization
Implement cost optimization strategies including right-sizing, reserved instances, and auto-scaling policies
Develop dashboards and reports for cost tracking and forecasting
Collaborate with teams to optimize resource allocation and eliminate waste
Monitoring & Observability
Design and implement comprehensive monitoring solutions using Prometheus for metrics collection
Build and maintain Grafana dashboards for infrastructure, application, and business metrics
Configure intelligent alerting rules and escalation procedures
Establish SLIs, SLOs, and error budgets for critical services
24x7 Support & Incident Response
Participate in on-call rotation for 24x7 production support
Lead Tier 3 incident response efforts for production outages and system issues
Perform root cause analysis and implement preventive measures
Collaborate with development teams on performance optimization and troubleshooting
Maintain runbooks and documentation for operational procedures
Qualifications Knowledge, Skills, and Abilities:
Technical Expertise (5+ years)
Strong experience with Kubernetes (AKS preferred) and container orchestration
Proficiency in Infrastructure as Code: Terraform and Ansible
Advanced GitLab CI/CD pipeline development and optimization
Experience with GitOps methodologies and leading toolsets like Helm, Flux and/or ArgoCD
Python scripting for automation and pipeline tasks
Azure cloud services and networking concepts
Monitoring & Cost Management
Hands-on experience with Prometheus for metrics collection and alerting
Proficiency in Grafana for dashboard creation and data visualization
Experience with Azure Cost Management tools and FinOps practices
Knowledge of resource optimization techniques and auto-scaling strategies
Understanding of cloud pricing models and cost allocation methods
DevOps & SRE Practices
Incident management and post-mortem processes
24x7 on-call experience with escalation procedures
Disaster recovery planning and implementation
Security best practices in CI/CD and infrastructure
Experience with chaos engineering and resilience testing
Collaborative Skills
Experience working with cross-functional teams
Strong troubleshooting and problem-solving abilities under pressure
Documentation and knowledge sharing practices
Comfortable with 24x7 on-call rotation responsibilities
Preferred Qualifications
Azure certifications (AZ-104, AZ-400, or AKS-related)
Experience with message bus systems (Azure Service Bus)
Knowledge of .NET applications and Angular frontend deployments
Familiarity with secret management solutions (Delinea or similar)
Experience with additional monitoring tools (Azure Monitor, Application Insights)
FinOps certification or cost optimization experience
Experience with alerting tools and PagerDuty integration
Additional Information Why You Should Apply:
Professional and financial growth opportunities
Medical benefits
Retirement benefits
All your information will be kept confidential according to EEO guidelines.
If you are requesting reasonable accommodation or disability assistance in submitting your application, you may email us at Recruiting@ingrambarge.com
Ingram Marine Group and its affiliates (“Company”) is an Affirmative Action/Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, age, work related mental or physical disability, veteran status, sexual orientation, gender identity, or genetic information.
EEO/AA Employer/Vet/Disabled
We participate in EVerify
http://www.dol.gov/ofccp/regs/compliance/posters/pdf/eeopost.pdf
http://www.dol.gov/ofccp/regs/compliance/posters/pdf/eeosp.pdf
#J-18808-Ljbffr
What you will be doing
Pipeline & Automation
Design and implement advanced CI/CD pipeline features using GitLab
Develop and maintain Terraform modules for infrastructure provisioning
Create and optimize Ansible playbooks for configuration management and deployment automation
Integrate security scanning and compliance checks into deployment pipelines
Container & Kubernetes Operations
Build, configure, and maintain Azure Kubernetes Service (AKS) clusters
Develop and optimize Helm charts for application deployments
Implement and manage GitOps workflows
Monitor and troubleshoot containerized applications and cluster performance
Infrastructure & Reliability
Implement Infrastructure as Code best practices using Terraform and Ansible
Design and execute disaster recovery procedures and business continuity plans
Perform system patching, upgrades, and maintenance activities
Establish and maintain comprehensive monitoring, alerting, and observability solutions using Prometheus and Grafana
Cost Optimization & Resource Management
Monitor and analyze Azure cloud spending patterns and resource utilization
Implement cost optimization strategies including right-sizing, reserved instances, and auto-scaling policies
Develop dashboards and reports for cost tracking and forecasting
Collaborate with teams to optimize resource allocation and eliminate waste
Monitoring & Observability
Design and implement comprehensive monitoring solutions using Prometheus for metrics collection
Build and maintain Grafana dashboards for infrastructure, application, and business metrics
Configure intelligent alerting rules and escalation procedures
Establish SLIs, SLOs, and error budgets for critical services
24x7 Support & Incident Response
Participate in on-call rotation for 24x7 production support
Lead Tier 3 incident response efforts for production outages and system issues
Perform root cause analysis and implement preventive measures
Collaborate with development teams on performance optimization and troubleshooting
Maintain runbooks and documentation for operational procedures
Qualifications Knowledge, Skills, and Abilities:
Technical Expertise (5+ years)
Strong experience with Kubernetes (AKS preferred) and container orchestration
Proficiency in Infrastructure as Code: Terraform and Ansible
Advanced GitLab CI/CD pipeline development and optimization
Experience with GitOps methodologies and leading toolsets like Helm, Flux and/or ArgoCD
Python scripting for automation and pipeline tasks
Azure cloud services and networking concepts
Monitoring & Cost Management
Hands-on experience with Prometheus for metrics collection and alerting
Proficiency in Grafana for dashboard creation and data visualization
Experience with Azure Cost Management tools and FinOps practices
Knowledge of resource optimization techniques and auto-scaling strategies
Understanding of cloud pricing models and cost allocation methods
DevOps & SRE Practices
Incident management and post-mortem processes
24x7 on-call experience with escalation procedures
Disaster recovery planning and implementation
Security best practices in CI/CD and infrastructure
Experience with chaos engineering and resilience testing
Collaborative Skills
Experience working with cross-functional teams
Strong troubleshooting and problem-solving abilities under pressure
Documentation and knowledge sharing practices
Comfortable with 24x7 on-call rotation responsibilities
Preferred Qualifications
Azure certifications (AZ-104, AZ-400, or AKS-related)
Experience with message bus systems (Azure Service Bus)
Knowledge of .NET applications and Angular frontend deployments
Familiarity with secret management solutions (Delinea or similar)
Experience with additional monitoring tools (Azure Monitor, Application Insights)
FinOps certification or cost optimization experience
Experience with alerting tools and PagerDuty integration
Additional Information Why You Should Apply:
Professional and financial growth opportunities
Medical benefits
Retirement benefits
All your information will be kept confidential according to EEO guidelines.
If you are requesting reasonable accommodation or disability assistance in submitting your application, you may email us at Recruiting@ingrambarge.com
Ingram Marine Group and its affiliates (“Company”) is an Affirmative Action/Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, age, work related mental or physical disability, veteran status, sexual orientation, gender identity, or genetic information.
EEO/AA Employer/Vet/Disabled
We participate in EVerify
http://www.dol.gov/ofccp/regs/compliance/posters/pdf/eeopost.pdf
http://www.dol.gov/ofccp/regs/compliance/posters/pdf/eeosp.pdf
#J-18808-Ljbffr