AI is reshaping every industry—and semiconductor design is next. At our partner company, you’ll work at the intersection of cutting-edge ML and hardware innovation, helping bring frontier models specialized for semiconductors into the hands of chip designers. From deploying optimized foundation models to building low-latency inference infrastructure, your work will define how engineers interact with intelligence. If you're excited to make research real and shape the future of design, this role is for you.
What you'll do
- Fine-tune, evaluate, and deploy LLMs and foundation models for specialized chip engineering tasks.
- Build scalable inference systems, agentic workflows, and RAG pipelines to support intelligent, interactive design tools.
- Optimize model performance through batching, caching, quantization, and streaming to achieve sub-second latency.
- Collaborate closely with AI scientists to operationalize experimental techniques and deliver robust, production-grade systems.
- Contribute to architecture decisions around scalable, reliable ML infrastructure in a cloud-native environment.
- Own the end-to-end lifecycle of ML features—from prototyping to deployment and observability in production.
What You Bring
- 2+ years of ML engineering experience, including a track record of shipping production ML systems.
- 2+ years of hands-on experience working with LLMs, including prompt tuning, fine-tuning, and evaluation.
- Deep expertise in Python, PyTorch, and the surrounding ML tooling ecosystem.
- Familiarity with orchestration frameworks and agentic workflows (e.g., LangChain, LlamaIndex).
- Strong understanding of vector search, semantic embeddings, and RAG systems.
- Solid grasp of scalable ML architecture and the principles behind high-availability inference systems.
Bonus Points
- Experience working directly with ML researchers or in applied research settings.
- Background in electrical engineering, chip design tools, or physical design flows.
- Startup experience or contributions to high-velocity, cross-functional teams.
- Open-source contributions to ML tooling or infrastructure projects.
- Experience deploying models in cloud environments (e.g. AWS SageMaker, Bedrock, vLLM, or SGLang)
What it's like here
We’re a fast-moving AI startup with a collaborative, high-trust culture. You’ll work side-by-side with top engineers and scientists, translating cutting-edge research into real product impact. We value curiosity, speed, and precision—engineering systems that are elegant, pragmatic, and built to last. If you’re passionate about bringing frontier AI to the real world, you’ll feel right at home.
Logistics
This position is available in Silicon Valley’s Redwood Shores, CA and in Toronto, ON. We believe a mindmelt between engineers and scientists leads to our professional growth and to great products; we have a hybrid schedule with four days in office, one day remote. Cognichip accepts applications that need H1B transfer.
How to apply
Don’t meet every single requirement? Feel over-qualified? That’s okay—if you're excited about our mission, we’d still love to hear from you. We are growing fast and need a world class team of various experience levels.
Seniority level
Seniority level
Mid-Senior level
Employment type
Employment type
Full-time
Job function
Job function
Engineering and ResearchIndustries
Semiconductor Manufacturing and Software Development
Referrals increase your chances of interviewing at OCTOPYD by 2x
Inferred from the description for this job
Medical insurance
Vision insurance
401(k)
Get notified when a new job is posted.
Sign in to set job alerts for “Machine Learning Engineer” roles.
AI/ML Engineer (Multiple roles and seniority levels)
San Francisco, CA $149,998.00-$250,000.00 2 weeks ago
San Bruno, CA $117,000.00-$234,000.00 2 weeks ago
San Jose, CA $137,500.00-$236,500.00 3 weeks ago
San Jose, CA $120,700.00-$228,600.00 4 days ago
Mountain View, CA $124,500.00-$186,500.00 4 days ago
Sunnyvale, CA $167,000.00-$185,500.00 2 days ago
Palo Alto, CA $2,000.00-$2,500.00 3 weeks ago
San Francisco, CA $140,000.00-$235,000.00 1 day ago
Machine Learning Engineer (I, II, or Sr.)
Research Engineer - Machine Learning (ML)
New Grads 2025 - Software Engineer - Computer Vision/Deep Learning
San Jose, CA $120,000.00-$165,000.00 10 months ago
Data Scientist, Analytics - Safety Response
Software Engineer, AI Platform - New Grad
Mountain View, CA $145,000.00-$170,000.00 1 day ago
San Francisco, CA $180,000.00-$240,000.00 1 month ago
Machine Learning Scientist, NLP (All Levels)
San Francisco, CA $200,000.00-$300,000.00 6 months ago
San Jose, CA $120,700.00-$228,600.00 4 days ago
Machine Learning Scientist, NLP (All Levels)
San Francisco, CA $200,000.00-$300,000.00 5 months ago
Research Engineer, Applied AI/ML - Early Stage Project
Mountain View, CA $136,000.00-$185,000.00 5 days ago
San Jose, CA $169,500.00-$291,500.00 3 weeks ago
San Francisco, CA $140,000.00-$180,000.00 7 months ago
Machine Learning Engineer (I, II, or Sr.)
Sunnyvale, CA $158,200.00-$185,000.00 1 month ago
San Francisco, CA $164,700.00-$191,400.00 2 days ago
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr