UNIX Administrator & Cloud Automation Engineer
Atlanticus - Atlanta
Work at Atlanticus
Overview
- View job
Overview
Job Location
Atlanta, GA
Description
Job Title: UNIX Administrator & Cloud Automation Engineer
Location: Atlanta / Hybrid
Department: IT Infrastructure & Operations
Experience: 7-10 year's experience in UNIX Administration and Infrastructure as Code
Reports to: VP of Infrastructure & Cloud Services
Position Summary
We are looking for a highly skilled UNIX Administrator & Cloud Automation Engineer with a strong background in AWS cloud environments, containerization, and Infrastructure as Code (IaC). This role is instrumental in supporting customer-facing fintech platforms, maintaining highly available UNIX systems, and driving automation using Terraform, CloudFormation, and Ansible.
The ideal candidate brings a strong SRE/ITIL mindset, thrives in high-pressure environments, and demonstrates curiosity and adaptability-especially toward learning and adopting AI-based operational technologies.
Key Responsibilities
UNIX/Linux Administration & Application Support:
- Manage and administer UNIX/Linux systems (e.g., RHEL, CentOS, Ubuntu) supporting production fintech workloads.
- Support and troubleshoot critical customer-facing applications in partnership with development teams.
- Perform system patching, tuning, upgrades, and security hardening.
- Ensure endpoint protection and compliance using tools like SentinelOne.
- Deploy and maintain applications in AWS ECS, EKS, and Fargate container environments.
- Build, monitor, and troubleshoot container workloads, images, and registries (Docker, ECR).
- Implement cloud-native designs for scalability, fault-tolerance, and cost optimization.
- Design, develop, and manage reusable Terraform modules and AWS CloudFormation to provision scalable and secure AWS infrastructure.
- Use Ansible playbooks to automate configuration management, deployments, and compliance tasks.
- Implement Terraform pipelines in CI/CD for repeatable infrastructure provisioning across development, staging, and production environments.
- Set up, configure, and manage monitoring and alerting using Nagios, Datadog, and CloudWatch.
- Participate in on-call rotations and rapid response for infrastructure incidents.
- Document RCA and lessons learned in JIRA and collaborate in continuous improvement.
- Administer NetApp storage solutions, ensuring availability, performance, and backup integrity.
- Work in alignment with ITIL and SRE frameworks to support change, incident, and problem management.
- Partner with DevOps, App Dev, InfoSec, and product teams for seamless integration of infrastructure and application needs.
- Maintain detailed documentation including SOPs, architecture diagrams, and automation workflows.
- 7-10 years of UNIX/Linux system administration in high-availability environments.
- Strong hands-on experience with AWS services, including EC2, ECS, EKS, IAM, VPC, S3, and CloudWatch.
- 3+ years of experience managing containers (Docker, Kubernetes) in AWS environments.
- Deep hands-on expertise with Terraform, & CloudFormation including state management, remote backends, workspaces, and module-based architecture.
- Strong knowledge of Ansible for configuration management and orchestration.
- Experience with NetApp storage and enterprise data lifecycle operations.
- Proficiency in scripting languages such as Bash, Python, or Shell.
- Familiarity with SentinelOne, Nagios, Datadog, and JIRA.
- Solid understanding of ITIL and SRE principles.
- Strong troubleshooting, documentation, and critical thinking skills.
- AWS Certifications (e.g., Solutions Architect, SysOps, DevOps Engineer).
- Kubernetes certifications (CKA, CKAD) are a plus.
- Contributions to Terraform module registries, GitHub projects, or internal IaC libraries considered a strong plus
- Experience in the fintech or financial services industry.
- Exposure to AI-based automation or interest in AIOps platforms.
- Strong communication and cross-functional collaboration abilities.