ByteDance
Join us as we work together to inspire creativity and enrich life around the globe.
Location: Seattle
Team: Infrastructure
Employment Type: Regular
Responsibilities
The ByteDance DPU (Data Processing Unit) team is building the foundational computing infrastructure for ByteDance and Volcano Engine Public Cloud. Our mission is to advance the architecture, development, and research of next-generation software-hardware technologies across compute, networking, and storage for cloud and AI computing. Our technology stack spans cloud virtualization & hypervisors, high-performance user-space network protocols (DPDK, RDMA, etc.), high-speed interconnect and virtual switching, distributed storage acceleration, and GPU virtualization and scheduling for AI/ML workloads. Responsibilities include: Design and develop DPU network software with a focus on high performance, low latency, and reliability. Collaborate with hardware teams to build software-hardware co-design solutions for networking and storage acceleration. Explore AI/ML infrastructure acceleration, leveraging DPUs, GPUs, and custom hardware to optimize distributed training and inference. Drive end-to-end performance optimization, from OS kernels and drivers to user-space runtime systems. Contribute to architecture design, technical proposals, and long-term research directions. Qualifications
Minimum Qualifications: B.S./M.S. in Computer Science, Computer Engineering, or related fields; or Ph.D. with strong research/publications. 3+ years of relevant industry experience (exception for Ph.D. with strong background). Proficiency in C/C++ development and debugging. Strong Linux systems development experience. Solid understanding of compute, network architecture, and operating systems. Background in at least one of: software-hardware co-design, distributed systems, high-performance networking, or AI/ML systems. Preferred Qualifications: Ph.D. in related fields with research training and publications. Experience with software-hardware co-design (networking, storage, or distributed compute). Hands-on experience with network virtualization (OVS, SR-IOV, eBPF). Familiarity with DPDK and high-performance user-space networking. Bonus points for hardware acceleration experience, FPGA/ASIC/GPU/CUDA. Bonus points for experience with NCCL Collectives along with AI communication patterns and parallelization techniques. Proven experience designing and building AI/ML infrastructure related but not limited to inference kv cache system, data preprocessing system. About Us
Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Lemon8, CapCut and Pico as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Diversity & Inclusion
ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. ByteDance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://tinyurl.com/RA-request
#J-18808-Ljbffr
The ByteDance DPU (Data Processing Unit) team is building the foundational computing infrastructure for ByteDance and Volcano Engine Public Cloud. Our mission is to advance the architecture, development, and research of next-generation software-hardware technologies across compute, networking, and storage for cloud and AI computing. Our technology stack spans cloud virtualization & hypervisors, high-performance user-space network protocols (DPDK, RDMA, etc.), high-speed interconnect and virtual switching, distributed storage acceleration, and GPU virtualization and scheduling for AI/ML workloads. Responsibilities include: Design and develop DPU network software with a focus on high performance, low latency, and reliability. Collaborate with hardware teams to build software-hardware co-design solutions for networking and storage acceleration. Explore AI/ML infrastructure acceleration, leveraging DPUs, GPUs, and custom hardware to optimize distributed training and inference. Drive end-to-end performance optimization, from OS kernels and drivers to user-space runtime systems. Contribute to architecture design, technical proposals, and long-term research directions. Qualifications
Minimum Qualifications: B.S./M.S. in Computer Science, Computer Engineering, or related fields; or Ph.D. with strong research/publications. 3+ years of relevant industry experience (exception for Ph.D. with strong background). Proficiency in C/C++ development and debugging. Strong Linux systems development experience. Solid understanding of compute, network architecture, and operating systems. Background in at least one of: software-hardware co-design, distributed systems, high-performance networking, or AI/ML systems. Preferred Qualifications: Ph.D. in related fields with research training and publications. Experience with software-hardware co-design (networking, storage, or distributed compute). Hands-on experience with network virtualization (OVS, SR-IOV, eBPF). Familiarity with DPDK and high-performance user-space networking. Bonus points for hardware acceleration experience, FPGA/ASIC/GPU/CUDA. Bonus points for experience with NCCL Collectives along with AI communication patterns and parallelization techniques. Proven experience designing and building AI/ML infrastructure related but not limited to inference kv cache system, data preprocessing system. About Us
Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Lemon8, CapCut and Pico as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Diversity & Inclusion
ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. ByteDance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://tinyurl.com/RA-request
#J-18808-Ljbffr