Logo
Georgia Staffing

Enterprise Observability & Automation Administrator

Georgia Staffing, Suwanee, Georgia, United States, 30174

Save Job

Observability & Automation Administrator

GCPS is seeking a highly skilled Observability & Automation Administrator to design, implement, and support monitoring, logging, and automation solutions across our infrastructure. This role is critical to improving system visibility, reducing incident response times, and streamlining operations through intelligent automation. The ideal candidate will be a technical expert with a strong background in observability tools, scripting, and infrastructure automation. Key Responsibilities: Deploy and maintain our primary observability tool, Dynatrace. Develop and maintain monitoring dashboards, alerts, and SLO/SLA reporting. Design and implement automated workflows using tools like Ansible, Terraform, or ServiceNow. Collaborate with infrastructure and application teams to integrate telemetry and logging standards across environments. Use AI/ML features in observability platforms to reduce alert noise and accelerate root cause analysis. Participate in incident response, including after-action reviews and proactive improvements. Document observability standards, automation runbooks, and integrations. We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. Skills and Requirements: 3+ years of experience in IT infrastructure, DevOps, or SRE roles with a focus on monitoring and automation. Hands-on experience with Dynatrace, Datadog, Grafana, or similar platforms. Proficiency in scripting languages such as Python, PowerShell, or Bash. Familiarity with ITSM platforms like ServiceNow, including workflow development. Strong understanding of telemetry concepts: logs, metrics, traces, events. Experience supporting hybrid cloud environments (AWS, Azure, on-prem). Comfortable working across Linux, Windows, and containerized environments.