Logo
Insight Global

INTL - SRE System Monitor- 3bced1c5

Insight Global, San Jose

Save Job

Job Description

n

Insight Global is seeking a skilled LLM System Monitor to support the LLM Proxy team. You will be the person monitoring and interpreting the Grafana dashboards that will signal failures and problems in order to manage the incident communication. On a day-to-day basis you will be the SRE monitoring the observability dashboards. You will either begin an incident report yourself from an automated alert, or will be pulled into a chat zone by someone who has created a ticket. From here you will be the main point of contact exhibiting great communication to the end customer and the incident commander. You will give frequent updates of the status of the incident to all parties.

n

We are a company committed to creating inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity employer that believes everyone matters. Qualified candidates will receive consideration for employment opportunities without regard to race, religion, sex, age, marital status, national origin, sexual orientation, citizenship status, disability, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to Human Resources Request Form ( . The EEOC "Know Your Rights" Poster is available here ( .

n

To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: .

n

Skills and Requirements

n

-3 years of experience responding and monitoring a globally deployed web application (keeping track of permutations)

n

-Experience working with microservices that run on a Kubernetes background

n

-Metrics forward thought process and a strong understanding of observability tools focusing on operational Metrics: Quantiles, P99, and Prometheus

n

-Familiarity with AWS services or any cloud provider foundational understanding

n

Very Strong Communication and Customer service skills LLM or AI Experience null

n

We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal employment opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment without regard to race, color, ethnicity, religion,sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military oruniformed service member status, or any other status or characteristic protected by applicable laws, regulations, andordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request to