Zyphra
Get AI-powered advice on this job and more exclusive features.
About This Role
You will be a core contributor on Zyphra’s Vision Team building the next generation of vision-language models which can understand natural scenes with a focus on web, desktop, and mobile UIs. You will be deeply involved in the entire model training process from data gathering and processing to designing novel architectures and training methodologies.
Key responsibilities include:
- Large‑scale vision encoder and vision‑language training runs
- Performance optimization of our training stack
- Image and video dataset collection, processing, and evaluation
- Architecture and training methodology ablations and improvements
Requirements
- Strong research taste and intuition, and the ability to take a project from conception to execution and write‑up.
- Strong implementation and prototyping ability—capable of turning an idea into quick experimentation.
- Excellent teamwork and communication skills in a fast‑paced research setting.
Good to Have
- Experience with training and evaluating vision‑language models.
- Experience creating and collecting large‑scale machine‑learning datasets, especially in the visual modality.
- Experience training vision encoders using contrastive learning or similar methods.
- Experience with supervised fine‑tuning, preference‑learning, and reinforcement‑learning methods.
- Strong intuitive ability to understand model behaviours and correct them through iterative fine‑tuning.
- Interest in data engineering and synthetic data generation.
- Postgraduate degree in Computer Science, Mathematics, Physics, Machine Learning, etc.
- Previously published machine‑learning research in well‑respected venues.
- Highly proficient with PyTorch and Python.
- Ability to quickly learn new fields and implement new ideas.
- Excellent communication and collaboration skills for research and engineering at scale.
Culture
- We value grounded, methodical research and engineering excellence equally.
- We encourage bold ideas and are willing to bet big on them.
- We move fast and lower the bar to impact.
- We all enjoy what we do and love discussing AI.
Benefits and Perks
- Medical, dental, vision, and FSA plans.
- Competitive salary, equity, and 401(k).
- Relocation and immigration support on a case‑by‑case basis.
- On‑site meals prepared by a dedicated culinary team; Thursday Happy Hours.
General Requirements
- Willing to be in‑person at our Palo Alto office.
- U.S. authorization to work; we consider O‑1 visa sponsorship for the right candidate.