Logo
IMAGE FRAME INVESTMENT (UK) LIMITED

Research Scientist - Speech & Audio Understanding (Speech Generation)

IMAGE FRAME INVESTMENT (UK) LIMITED, Bellevue, Washington, us, 98009

Save Job

Research Scientist - Speech & Audio Understanding (Speech Generation) page is loaded Research Scientist - Speech & Audio Understanding (Speech Generation) Apply remote type Onsite locations US-Washington-Bellevue time type Full time posted on Posted 8 Days Ago job requisition id R105612

Business Unit

Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers, TEG provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia,TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.

What the Role Entails

Job Responsibilities: 1. Track the latest research in speech generation algorithms, explore next-generation paradigms for speech/audio generation, and push the boundaries of speech generation capabilities. 2. Investigate cutting-edge multimodal voice foundation model technologies to enhance voice interaction experiences by integrating text, speech, and vision. 3. Lead the technical R&D of voice foundation models, driving model performance improvements and innovative applications. Who We Look For

Job Requirements: 1. Master’s or Ph.D. in Computer Science, Artificial Intelligence, Electronic Engineering, Signal Processing, or related fields. 2. Research or development experience in one or more areas: voice foundation models, speech synthesis, speech recognition, audio generation, voice conversion, or speech codec. 3. Familiarity with mainstream voice-enabled large models (e.g., GPT4o, GLM-4-Voice, Qwen2.5-Omni, Voila). Prior project experience is preferred. 4. Proficient in deep learning frameworks (e.g., PyTorch). Experience with large-scale model training frameworks (Megatron/Deepspeed) is a plus. 5. Solid understanding of large model architectures and principles. Experience in large-scale pretraining or post-training is preferred. Location State(s) US-Washington-Bellevue The expected base pay range for this position in the location(s) listed above is $149,000.00 to $279,800.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience.Employees hired for this position may be eligible for a sign on payment, relocation package, and restricted stock units, which will be evaluated on a case-by-case basis.Subject to the terms and conditions of the plans in effect, hired applicants are also eligible for medical, dental, vision, life and disability benefits, and participation in the Company’s 401(k) plan. The Employee is also eligible for up to 15 to 25 days of vacation per year (depending on the employee’s tenure), up to 13 days of holidays throughout the calendar year, and up to 10 days of paid sick leave per year.Your benefits may be adjusted to reflect your location, employment status, duration of employment with the company, and position level. Benefits may also be pro-rated for those who start working during the calendar year.

Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals. Similar Jobs (3)

Research Scientist – Speech and Audio Understanding (Large Models & Multimodal Systems) remote type Onsite locations US-Washington-Bellevue time type Full time posted on Posted 8 Days Ago Senior Researcher: Artificial General Intelligence (Natural Language Processing) remote type Onsite locations US-Washington-Bellevue time type Full time posted on Posted 30+ Days Ago Vision Researcher – Multimodal Understanding & Generation in Foundation Models remote type Onsite locations US-Washington-Bellevue time type Full time posted on Posted 8 Days Ago Tencent is a world-leading internet and technology company that develops innovative products and services to improve the quality of life for people around the world. Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

#J-18808-Ljbffr