Samsung Electronics America
Senior Staff Research Engineer, Speech Machine Learning (ASR)
Samsung Electronics America, Mountain View, California, us, 94039
Lab Summary
Bixby is an intelligent personal assistant which is only available as a built-in application on Samsung flagship devices and wearables. This application uses Natural Language Understanding to perform tasks on these devices using voice/text, including but not limited to making phone calls, sending text messages, setting up meetings, opening apps, setting alarms and timers, getting directions, answering general questions, providing information about restaurants and other businesses, etc.
Position Summary For this position we are expanding our Advanced Intelligence Labs (AILs) voice technology and features to include advanced research and projects in Wake word detection, Automatic Speech Recognition (ASR), that includes Acoustic and Language Modeling, and personalization. We also work on language and gender detection using speech signals, Speaker identification, verification and diarization techniques. At AIL we perform state-of-the-art research in multi-lingual/accents research and bringing those research ideas to production. We are looking for candidates with extensive expertise in Digital Signal/Speech Processing with Speech recognition specialization, demonstrated research expertise by publishing papers in reputed journals/conferences, excellent knowledge of Deep/Machine Learning with 7+ years of industry experience. Candidates are expected to work in a fast paced environments.
Position Responsibilities
Architect and design end to end Automatic Speech Recognition products, applications and solutions for specific business needs and provide implementation guidance during delivery
Leverage, customize and implement ASR models, algorithms, and methodologies to improve the overall quality ASR in various applications and systems
Analyze and evaluate the performance ASR systems and provide design recommendations
Analyze and make right technological choices for generative ai solutions
Design and prototype reusable components for LLM based solutions for ASR
Architect components of an ASR solution to address Responsible AI & Security
Collaborate seamlessly with diverse, cross-functional teams to accurately identify and prioritize requirements, ensuring that the language model meets the needs and expectations of various stakeholders
Create and maintain comprehensive technical documentation that comprehensibly captures the intricate details of the language model, facilitating seamless understanding, efficient troubleshooting, and future development
Harness the power of transformer architecture, a cutting-edge deep learning model widely employed in natural language processing and computer vision, to optimize the language model's performance and efficiency
Exploiting the transformative capabilities of transformer architectures to seamlessly process and reshape vast volumes of data, empowering the language model to achieve unprecedented levels of accuracy and versatility
Ensure ethical AI development practices, prioritizing fairness, transparency, and privacy
Required Skills
MS or Ph.D. in Computer Science or Digital Signal Processing or equivalent combination of education, training, and experience
7+ years of relevant professional experience in Machine Learning or relevant field
Experience with Tensorflow or Pytorch or similar frameworks
Worked on advance architectures such as transformers, conformer and other advanced models for ASR systems
Working experience on ASR in large scale production systems
Experience in modeling ML algorithms on GPUs at scale
Experience with multi-lingual speech, low resource speech research and architectures
Working experience on deploying recognition engines on both server and edge devices
Experience with Acoustic modeling, noise and ambient modeling, and its effects on ASR
Knowledge of state-of-the-art Large Language models such as Deepseek, GPT, BERT variants and other deep fusion techniques is essential
Working on WFST, n-gram and other shallow fusion techniques for named entity recognitions
Experience on speaker recognition, wakeup and audio-based language recognition is desirable
Experience with improving ASR performance in far field and noisy environments
Working experience on masking and spectral restoration based noise suppression and speech enhancement techniques
Experience in developing advance classification models such as ECAPA-TDNN for speaker, gender classifications
Ability to develop project plans and experience to execute them
Research expertise in ML and written research publications
C/C++, PYTHON, JAVA programming language experience
Leadership ability to lead a mid-size team
Additional Information Disclosure of Trade Secrets
Samsung has a strict policy on trade secrets. In applying to Samsung and progressing through the recruitment process, you must not disclose any trade secrets of a current or previous employer.
Essential Job Functions This position will be performed in an office setting. The position will require the incumbent to sit and stand at a desk, communicate in person and by telephone, and frequently operate standard office equipment, such as telephones and computers.
Samsung Research America is committed to complying with all Federal, State and local laws related to the employment of qualified individuals with disabilities. If you are an individual with a disability and would like to request a reasonable accommodation as part of the employment selection process, please contact the recruiter or email sratalent@samsung.com.
Equal Employment Opportunity At Samsung, we believe that innovation and growth are driven by an inclusive culture and a diverse workforce. We aim to create a global team where everyone belongs and has equal opportunities, inspiring our talent to be their true selves. Together, we are building a better tomorrow for our customers, partners, and communities.
Samsung Research America is committed to employing a diverse workforce, and provide Equal Employment Opportunity for all individuals regardless of race, color, religion, gender, age, national origin, marital status, sexual orientation, gender identity, status as a protected veteran, genetic information, status as a qualified individual with a disability, or any other characteristic protected by law.
For more information regarding protection from discrimination under Federal law for applicants and employees, please refer to this link: Pay Transparency
#J-18808-Ljbffr
Position Summary For this position we are expanding our Advanced Intelligence Labs (AILs) voice technology and features to include advanced research and projects in Wake word detection, Automatic Speech Recognition (ASR), that includes Acoustic and Language Modeling, and personalization. We also work on language and gender detection using speech signals, Speaker identification, verification and diarization techniques. At AIL we perform state-of-the-art research in multi-lingual/accents research and bringing those research ideas to production. We are looking for candidates with extensive expertise in Digital Signal/Speech Processing with Speech recognition specialization, demonstrated research expertise by publishing papers in reputed journals/conferences, excellent knowledge of Deep/Machine Learning with 7+ years of industry experience. Candidates are expected to work in a fast paced environments.
Position Responsibilities
Architect and design end to end Automatic Speech Recognition products, applications and solutions for specific business needs and provide implementation guidance during delivery
Leverage, customize and implement ASR models, algorithms, and methodologies to improve the overall quality ASR in various applications and systems
Analyze and evaluate the performance ASR systems and provide design recommendations
Analyze and make right technological choices for generative ai solutions
Design and prototype reusable components for LLM based solutions for ASR
Architect components of an ASR solution to address Responsible AI & Security
Collaborate seamlessly with diverse, cross-functional teams to accurately identify and prioritize requirements, ensuring that the language model meets the needs and expectations of various stakeholders
Create and maintain comprehensive technical documentation that comprehensibly captures the intricate details of the language model, facilitating seamless understanding, efficient troubleshooting, and future development
Harness the power of transformer architecture, a cutting-edge deep learning model widely employed in natural language processing and computer vision, to optimize the language model's performance and efficiency
Exploiting the transformative capabilities of transformer architectures to seamlessly process and reshape vast volumes of data, empowering the language model to achieve unprecedented levels of accuracy and versatility
Ensure ethical AI development practices, prioritizing fairness, transparency, and privacy
Required Skills
MS or Ph.D. in Computer Science or Digital Signal Processing or equivalent combination of education, training, and experience
7+ years of relevant professional experience in Machine Learning or relevant field
Experience with Tensorflow or Pytorch or similar frameworks
Worked on advance architectures such as transformers, conformer and other advanced models for ASR systems
Working experience on ASR in large scale production systems
Experience in modeling ML algorithms on GPUs at scale
Experience with multi-lingual speech, low resource speech research and architectures
Working experience on deploying recognition engines on both server and edge devices
Experience with Acoustic modeling, noise and ambient modeling, and its effects on ASR
Knowledge of state-of-the-art Large Language models such as Deepseek, GPT, BERT variants and other deep fusion techniques is essential
Working on WFST, n-gram and other shallow fusion techniques for named entity recognitions
Experience on speaker recognition, wakeup and audio-based language recognition is desirable
Experience with improving ASR performance in far field and noisy environments
Working experience on masking and spectral restoration based noise suppression and speech enhancement techniques
Experience in developing advance classification models such as ECAPA-TDNN for speaker, gender classifications
Ability to develop project plans and experience to execute them
Research expertise in ML and written research publications
C/C++, PYTHON, JAVA programming language experience
Leadership ability to lead a mid-size team
Additional Information Disclosure of Trade Secrets
Samsung has a strict policy on trade secrets. In applying to Samsung and progressing through the recruitment process, you must not disclose any trade secrets of a current or previous employer.
Essential Job Functions This position will be performed in an office setting. The position will require the incumbent to sit and stand at a desk, communicate in person and by telephone, and frequently operate standard office equipment, such as telephones and computers.
Samsung Research America is committed to complying with all Federal, State and local laws related to the employment of qualified individuals with disabilities. If you are an individual with a disability and would like to request a reasonable accommodation as part of the employment selection process, please contact the recruiter or email sratalent@samsung.com.
Equal Employment Opportunity At Samsung, we believe that innovation and growth are driven by an inclusive culture and a diverse workforce. We aim to create a global team where everyone belongs and has equal opportunities, inspiring our talent to be their true selves. Together, we are building a better tomorrow for our customers, partners, and communities.
Samsung Research America is committed to employing a diverse workforce, and provide Equal Employment Opportunity for all individuals regardless of race, color, religion, gender, age, national origin, marital status, sexual orientation, gender identity, status as a protected veteran, genetic information, status as a qualified individual with a disability, or any other characteristic protected by law.
For more information regarding protection from discrimination under Federal law for applicants and employees, please refer to this link: Pay Transparency
#J-18808-Ljbffr