ServiceNow

Senior Machine Learning Engineer AI Inferencing

ServiceNow, Santa Clara, California, United States, 95050

Senior Machine Learning Engineer AI Inferencing

It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today

ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500. Our intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work. But this is just the beginning of our journey. Join us as we pursue our purpose to make the world work better for everyone. Job Description

This role is based in our Santa Clara office and requires two days in the office. PLATO (Platform Engineering and AI Technology Organization) at ServiceNow is a customer-focused innovative group building intelligent software using a variety of technology stacks to enable end-to-end, industry-leading work experiences for our customers. We are a group of people deeply invested in the success of our customers that happen to have expertise and knowledge in advanced technologies and software engineering best practices. We are data driven, structured, committed and we enjoy what we are doing. We prioritize robustness, performance and user experience over the technology stack and tools. What you get to do in this role: You will play a major part in building AI and Machine Learning (ML) platform that transform the user experience and workflow efficiency of enterprise services. Your skills will enable us to build and optimize a high-performance inferencing platform, allowing our core platform to provide high quality AI solutions to our enterprise customers globally. Utilize your expertise in Python and Golang to develop high-performance components of the AI Platform. Collaborate with cross-functional teams to integrate AI capabilities seamlessly into workflows and user experiences. Ensure reliability and performance of AI models by applying best practices in software engineering and AI inferencing. Stay ahead of the curve by quickly learning emerging technologies and applying them to enhance the AI Platform. Qualifications

To be successful in this role you have: Experience in leveraging or critically thinking about how to integrate AI into work processes, decision-making, or problem-solving. Low Latency Optimization: Experience in optimizing models for low latency inference. High Throughput Optimization: Knowledge of maximizing inference throughput. Real-time Systems: Understanding the constraints of real-time systems on model inference. Model Quantization and Compression: Practical experience in reducing model size and computational cost. Proficient in prompt engineering and developing LLM based features Experience in using AI productivity tools such as Cursor, Windsurf, etc. Minimum 5 years of experience working in Software Development role. Proficiency in Python and Golang, with a strong grasp of software engineering principles. Hands-on experience with prompt engineering: ability to craft, test, and optimize prompts for task accuracy and efficiency. Demonstrated ability to thrive in fast-paced, dynamic environments. Knowledge of unit testing, profiling, and code tuning For positions in this location, we offer a base pay of $158,500 - $269,500, plus equity (when applicable), variable/incentive compensation and benefits. Sales positions generally offer a competitive On Target Earnings (OTE) incentive compensation structure. Please note that the base pay shown is a guideline, and individual total compensation will vary based on factors such as qualifications, skill level, competencies, and work location. We also offer health plans, including flexible spending accounts, a 401(k) Plan with company match, ESPP, matching donations, a flexible time away plan and family leave programs. Compensation is based on the geographic location in which the role is located and is subject to change based on work location.