Google

Software Engineer III, Multimodal Machine Learning, Glasses

Google, San Jose, California, United States, 95199

Software Engineer III, Multimodal Machine Learning, Glasses 1 week ago Be among the first 25 applicants

Minimum Qualifications

Bachelor’s degree or equivalent practical experience.

2 years of experience with software development in one or more programming languages.

1 year of experience with GenAI techniques (e.g., LLMs, Multi-Modal, Large Vision Models) or with GenAI-related concepts (language modeling, computer vision).

1 year of experience testing, maintaining, or launching software products, and 1 year of experience with software design and architecture.

1 year of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging).

Preferred Qualifications

Master’s degree or PhD in computer science, mathematics, applied stats, machine learning or equivalent practical experience.

3 years of experience with data structures/algorithms.

1 year of experience working in a complex, matrixed organization involving cross‑functional or cross‑business projects.

Experience conducting applied research to enable new functionality and improve the quality and efficiency of large language and multimodal models.

About The Job Google's software engineers develop the next‑generation technologies that change how billions of users connect, explore, and interact with information and one another. Our projects handle information at massive scale and extend beyond web search. We seek engineers with expertise ranging from information retrieval, distributed computing, large‑scale system design, to AI, natural language processing and UI design. You will work on a specific project critical to Google’s needs and may switch teams as the business evolves.

We are developing agentic AI solutions for smart glasses, utilizing Gemini Live and Astra to create a multimodal conversational experience. The role focuses on building lightweight XR devices paired with AI to augment human intelligence.

Responsibilities

Design, develop, and deploy scalable and robust agentic AI solutions for high‑value, real‑world multimodal conversational AI use cases on smart glasses.

Gain a deep understanding of the Gemini Live and Astra tech stack and infrastructure. Optimize agent architecture/orchestration to ensure efficient deployment and operation at scale, focusing on inference cost.

Take ownership of AI quality for production systems, defining technical metrics, implementing evaluation frameworks, analyzing loss patterns, and driving improvements through data collection and model enhancements.

Implement, optimize, and advance state‑of‑the‑art AI techniques, focusing on multimodal conversational quality, tool use and goal‑oriented reasoning.

Drive progress through rapid experimentation, proposing and validating hypotheses, implementing and testing new ideas, and iterating quickly to find optimal solutions.

Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also Google’s EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know by completing our Accommodations for Applicants form.

#J-18808-Ljbffr