ByteDance
Research Scientist in Large Model System
Responsibilities Leveraging substantial data and computing resources and through continued investment in these domains, we have developed a proprietary general-purpose model with multimodal capabilities. In the Chinese market, Doubao models power over 50 ByteDance apps and business lines, including Doubao, Coze, and Dreamina, and is available to external enterprise clients via Volcano Engine. Today, the Doubao app stands as the most widely used AIGC application in China. Responsibilities include: Responsible for the machine learning system development of the company's large-scale models, researching new applications and solutions of related technologies in areas such as search, recommendation, advertising, content creation, conversation, and customer service. Design and development of the architecture of large-scale machine learning systems, solving technical difficulties such as high concurrency, high reliability, and high scalability of the system. Covering various sub-directions of machine learning system, including resource scheduling, model training, model inference, data management, and workflow orchestration. Research and introduction of advanced technologies in machine learning systems, such as the latest hardware architecture, heterogeneous computing systems, and compiler-based optimization technologies. Working closely with the algorithm teams to optimize the algorithm and system jointly. Qualifications Minimum Qualifications: Excellent coding ability, solid foundation in data structures and basic algorithms, proficient in C/C++ or Python. Familiar with at least one mainstream machine learning framework (TensorFlow/PyTorch). Master the principles of distributed systems. Strong sense of responsibility, good learning ability, communication ability, and self-motivation. Good communication and collaboration skills, able to explore new technologies with the team and promote technological progress. Preferred Qualifications: Prior experience in large-scale projects or papers with great influence in the field of large models. Familiar with LLM, CV-related algorithms, and technologies, and experienced in large model training and RL algorithms. Experience in one of the following fields: CUDA, RDMA, AI Infrastructure, HW/SW Co-Design, High-Performance Computing, ML Hardware Architecture (GPU, Accelerators, Networking), ML for System, and Distributed Storage. Job Information The base salary range for this position in the selected city is $177,688 - $341,734 annually. Compensation may vary outside of this range depending on a number of factors, including a candidate's qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units. Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. EEO Statement ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. Reasonable Accommodation ByteDance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://tinyurl.com/RA-request
#J-18808-Ljbffr
Responsibilities Leveraging substantial data and computing resources and through continued investment in these domains, we have developed a proprietary general-purpose model with multimodal capabilities. In the Chinese market, Doubao models power over 50 ByteDance apps and business lines, including Doubao, Coze, and Dreamina, and is available to external enterprise clients via Volcano Engine. Today, the Doubao app stands as the most widely used AIGC application in China. Responsibilities include: Responsible for the machine learning system development of the company's large-scale models, researching new applications and solutions of related technologies in areas such as search, recommendation, advertising, content creation, conversation, and customer service. Design and development of the architecture of large-scale machine learning systems, solving technical difficulties such as high concurrency, high reliability, and high scalability of the system. Covering various sub-directions of machine learning system, including resource scheduling, model training, model inference, data management, and workflow orchestration. Research and introduction of advanced technologies in machine learning systems, such as the latest hardware architecture, heterogeneous computing systems, and compiler-based optimization technologies. Working closely with the algorithm teams to optimize the algorithm and system jointly. Qualifications Minimum Qualifications: Excellent coding ability, solid foundation in data structures and basic algorithms, proficient in C/C++ or Python. Familiar with at least one mainstream machine learning framework (TensorFlow/PyTorch). Master the principles of distributed systems. Strong sense of responsibility, good learning ability, communication ability, and self-motivation. Good communication and collaboration skills, able to explore new technologies with the team and promote technological progress. Preferred Qualifications: Prior experience in large-scale projects or papers with great influence in the field of large models. Familiar with LLM, CV-related algorithms, and technologies, and experienced in large model training and RL algorithms. Experience in one of the following fields: CUDA, RDMA, AI Infrastructure, HW/SW Co-Design, High-Performance Computing, ML Hardware Architecture (GPU, Accelerators, Networking), ML for System, and Distributed Storage. Job Information The base salary range for this position in the selected city is $177,688 - $341,734 annually. Compensation may vary outside of this range depending on a number of factors, including a candidate's qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units. Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. EEO Statement ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. Reasonable Accommodation ByteDance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://tinyurl.com/RA-request
#J-18808-Ljbffr