Snowrelic Inc

AI/ML Engineer

Snowrelic Inc, San Jose, California, United States, 95199

Overview

Role: AI/ML Engineer. Location: San Jose, CA. Work arrangement: 5 days WFO. Notice period: 2 weeks. Visa: Any (Except OPT and CPT). Responsibilities

Design and implement AI Agents to optimize cloud resource allocation, auto-scaling, and performance tuning. Develop predictive models for failure detection, incident management, and system health monitoring. Automate operational workflows using machine learning and intelligent scripting. Integrate AI-driven insights with existing cloud monitoring tools. Collaborate with DevOps and SRE teams to deploy, monitor, and improve ML models in production environments. Conduct anomaly detection for security, cost optimization, and performance analytics. Continuously evaluate emerging AI technologies and tools for operational improvements. Maintain documentation and best practices for AI/ML integration in cloud systems. Minimum Requirements

Bachelor's or equivalent experience or master’s degree in computer science, Data Science, or related technical field. Proven ability building and deploying ML models, with at least 2 years focused on infrastructure or cloud operations. Solid knowledge of hybrid cloud technologies (AWS, GCP, OpenStack, Kubernetes). Experience with Python, Jupiter, and ML libraries such as PyTorch, TensorFlow, or scikit-learn. Familiarity with cloud-native monitoring, logging, and automation tools (e.g., Terraform, Ansible, Prometheus, Splunk, AppDynamics). Comfortable working with streaming data, APIs, and telemetry systems. Strong communication and multi-functional collaboration skills. Experience with Agile and DevOps operating models, including project tracking tools (e.g., Jira), Git (any Version Control systems), and CI/CD systems (e.g., GitLab, GitHub Actions, Jenkins). Proficient in general-purpose programming languages (Python, GoLang, Bash and/or C/C++) and development platforms and technologies. Preferred Qualifications

Deep understanding of operating systems and experience with Cisco technologies (UCS, Nexus, Thousand Eyes) Established record of leading technical initiatives, delivering results, and a commitment to fostering a supportive work environment. Hard-working, dedicated to providing quality support for your customers

#J-18808-Ljbffr