xAI
Member of Technical Staff, Multimodal Understanding (Visual / Audio)
xAI, New York, New York, us, 10261
Member of Technical Staff – Multimodal Understanding (Visual / Audio)
xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. The team is small, highly motivated, and focused on engineering excellence. We operate with a flat organizational structure. All employees are expected to be hands‑on and to contribute directly to the company’s mission, showing initiative and consistently delivering excellence. Strong work ethic, prioritization skills, and communication skills are essential.
Focus
Creating and driving an engineering agenda toward superhuman multimodal capabilities, covering multimodal understanding and multimodal generation across image, video, and audio modalities.
Improving data quality, developing data filtering/generation techniques, and performing data studies at pre‑training scale.
Creating evaluation frameworks and internal benchmarks.
Designing and implementing efficient algorithms to achieve state‑of‑the‑art model performance.
Ideal Experience
Hands‑on experience with visual, audio or multimodal pre‑training.
Track record of leading engineering efforts that significantly improve neural‑network capability and performance, whether via better data or better modeling.
Experience in data‑driven experiment design and systematic analysis for iterative model debugging.
Experience developing or working with large‑scale distributed machine‑learning systems.
Ability to do whatever is necessary to deliver the best end‑to‑end user experience.
Tech Stack
Python
Jax
Rust
Interview Process After submitting your application, the team reviews your CV and a statement of exceptional work. If your application passes this stage, you are invited to a 15‑minute introductory call. Upon clearance you enter the main process, which includes four technical interviews:
One‑on‑one engineering discussion & coding interviews (three sessions).
Meet the team: Present your past exceptional work and your vision for xAI to a small audience.
All interviews are conducted via Google Meet.
Location The role is based in the New York area. Candidates are expected to be located near New York City or open to relocation.
Annual Salary Range $180,000 - $440,000 USD
Benefits Base salary, equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short‑ and long‑term disability insurance, life insurance, and various other discounts and perks.
Equal Opportunity Statement xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice.
Seniority Level Mid‑Senior level
Employment Type Full‑time
Job Function Engineering and Information Technology
Industries Technology, Information and Internet
#J-18808-Ljbffr
Focus
Creating and driving an engineering agenda toward superhuman multimodal capabilities, covering multimodal understanding and multimodal generation across image, video, and audio modalities.
Improving data quality, developing data filtering/generation techniques, and performing data studies at pre‑training scale.
Creating evaluation frameworks and internal benchmarks.
Designing and implementing efficient algorithms to achieve state‑of‑the‑art model performance.
Ideal Experience
Hands‑on experience with visual, audio or multimodal pre‑training.
Track record of leading engineering efforts that significantly improve neural‑network capability and performance, whether via better data or better modeling.
Experience in data‑driven experiment design and systematic analysis for iterative model debugging.
Experience developing or working with large‑scale distributed machine‑learning systems.
Ability to do whatever is necessary to deliver the best end‑to‑end user experience.
Tech Stack
Python
Jax
Rust
Interview Process After submitting your application, the team reviews your CV and a statement of exceptional work. If your application passes this stage, you are invited to a 15‑minute introductory call. Upon clearance you enter the main process, which includes four technical interviews:
One‑on‑one engineering discussion & coding interviews (three sessions).
Meet the team: Present your past exceptional work and your vision for xAI to a small audience.
All interviews are conducted via Google Meet.
Location The role is based in the New York area. Candidates are expected to be located near New York City or open to relocation.
Annual Salary Range $180,000 - $440,000 USD
Benefits Base salary, equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short‑ and long‑term disability insurance, life insurance, and various other discounts and perks.
Equal Opportunity Statement xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice.
Seniority Level Mid‑Senior level
Employment Type Full‑time
Job Function Engineering and Information Technology
Industries Technology, Information and Internet
#J-18808-Ljbffr