Dexian
Observability Engineer (3448) - Tampa/Dallas
Location: Dallas or Tampa | Hybrid: 3 days onsite | Contract: 6-month contract-to-hire We are seeking a
senior-level Observability & AIOps Engineer
with hands-on experience in
Java and Python
to enhance enterprise IT observability, resilience, and reliability. This role blends
hands-on engineering
with architectural guidance to optimize monitoring, performance, and reliability across IT systems. Key Responsibilities
Design, prototype, test, and document observability and reliability solutions. Publish technology strategies, observability standards, and best practices. Translate business goals into technical solutions that meet non-functional requirements. Create
Observability Driven Development
procedures and promote adoption of open-standard frameworks (OTel, MELTS). Implement AI-augmented testing strategies for federated execution and enterprise governance. Collaborate with SREs and production support teams to improve distributed tracing, trade processing reliability, and chaos testing. Design and implement full-stack applications for operational predictability and prescriptive disruption response. Establish monitoring and alerting standards for performance, scalability, availability, and reliability. Experience & Qualifications
Distributed Applications:
10+ years designing and implementing distributed systems. Networking & Infrastructure:
5+ years in networking, middleware, infrastructure, and database architecture. Highly Available Architecture:
5+ years implementing highly available solutions. Disaster Recovery:
5+ years with disaster recovery methodologies and patterns. Hands-On Development:
Senior-level expertise in
Java and Python
for observability and reliability engineering. Knowledge & Skills
Strong problem-solving and independent work capabilities. Familiarity with
public cloud environments
(AWS, Azure) is a plus. Performance analysis, tuning, and engineering experience is desirable. Knowledge of monitoring/observability tools:
Dynatrace, Splunk, Grafana, Prometheus, OpenTelemetry, CloudWatch, CloudTrail . Ability to design solutions that improve resilience, reliability, and operational efficiency. Dexian is a leading provider of staffing, IT, and workforce solutions with over 12,000 employees and 70 locations worldwide. Dexian was formed in 2023 through the merger of DISYS and Signature Consultants. Dexian connects talent, technology, and organizations to produce results. Dexian's brands include Dexian DISYS, Dexian Signature Consultants, Dexian Government Solutions, Dexian Talent Development and Dexian IT Solutions. Visit the Dexian website to learn more. Dexian is an Equal Opportunity Employer that recruits and hires qualified candidates without regard to race, religion, sex, sexual orientation, gender identity, age, national origin, ancestry, citizenship, disability, or veteran status. #J-18808-Ljbffr
Location: Dallas or Tampa | Hybrid: 3 days onsite | Contract: 6-month contract-to-hire We are seeking a
senior-level Observability & AIOps Engineer
with hands-on experience in
Java and Python
to enhance enterprise IT observability, resilience, and reliability. This role blends
hands-on engineering
with architectural guidance to optimize monitoring, performance, and reliability across IT systems. Key Responsibilities
Design, prototype, test, and document observability and reliability solutions. Publish technology strategies, observability standards, and best practices. Translate business goals into technical solutions that meet non-functional requirements. Create
Observability Driven Development
procedures and promote adoption of open-standard frameworks (OTel, MELTS). Implement AI-augmented testing strategies for federated execution and enterprise governance. Collaborate with SREs and production support teams to improve distributed tracing, trade processing reliability, and chaos testing. Design and implement full-stack applications for operational predictability and prescriptive disruption response. Establish monitoring and alerting standards for performance, scalability, availability, and reliability. Experience & Qualifications
Distributed Applications:
10+ years designing and implementing distributed systems. Networking & Infrastructure:
5+ years in networking, middleware, infrastructure, and database architecture. Highly Available Architecture:
5+ years implementing highly available solutions. Disaster Recovery:
5+ years with disaster recovery methodologies and patterns. Hands-On Development:
Senior-level expertise in
Java and Python
for observability and reliability engineering. Knowledge & Skills
Strong problem-solving and independent work capabilities. Familiarity with
public cloud environments
(AWS, Azure) is a plus. Performance analysis, tuning, and engineering experience is desirable. Knowledge of monitoring/observability tools:
Dynatrace, Splunk, Grafana, Prometheus, OpenTelemetry, CloudWatch, CloudTrail . Ability to design solutions that improve resilience, reliability, and operational efficiency. Dexian is a leading provider of staffing, IT, and workforce solutions with over 12,000 employees and 70 locations worldwide. Dexian was formed in 2023 through the merger of DISYS and Signature Consultants. Dexian connects talent, technology, and organizations to produce results. Dexian's brands include Dexian DISYS, Dexian Signature Consultants, Dexian Government Solutions, Dexian Talent Development and Dexian IT Solutions. Visit the Dexian website to learn more. Dexian is an Equal Opportunity Employer that recruits and hires qualified candidates without regard to race, religion, sex, sexual orientation, gender identity, age, national origin, ancestry, citizenship, disability, or veteran status. #J-18808-Ljbffr