Postdoctoral Appointee Data Infrastructure and Software for AI
Argonne National Laboratory - Posen, Illinois, United States, 60469
Work at Argonne National Laboratory
Overview
- View job
Overview
The Argonne Leadership Computing Facility's (ALCF) mission is to accelerate major scientific discoveries and engineering breakthroughs for humanity by designing and providing world-leading computing facilities in partnership with the computational science community. We help researchers solve some of the world's largest and most complex problems with our unique combination of supercomputing resources and computational science expertise. The ALCF has an opening for a postdoctoral position in data management targeting AI applications at scale. This postdoc will join the AL/ML group, a vibrant multidisciplinary team of scientists and High-Performance Computing (HPC) engineers. In the AL/ML group, we work at the forefront of HPC to push scientific boundaries, carrying out research and development in state-of-the-art data management, machine learning and statistics techniques. With the advancement of Exascale systems and the variety of novel AI hardware designed to accelerate both training and inference, the ALCF is studying the application of these techniques to a variety of our science applications, including but not limited to: Computational Chemistry, Plasma Physics, High Energy Physics, analysis of Light Source data such as that from the Advanced Photon Source, Biology, Astronomy, and other science disciplines. This postdoc will have a rare chance to work on exascale supercomputing systems and novel AI hardware to help solve significant real-world problems using machine learning and deep learning. ALCF researchers work in a highly collaborative environment involving science application teams, academia, and industry, as well as other national labs and agencies, to solve some of the world's largest and most complex problems in science and engineering. Objective
The goal for this postdoctoral position is to work on development and scaling of the data infrastructure and software for AI applications on supercomputing systems and AI testbed systems. The postdoc will work on multimodal data management for science applications, focusing on integrating, organizing, and analyzing diverse data types to accelerate scientific discovery and innovation. A primary goal will be to research and develop data management infrastructure leveraging Aurora's storage system, DAOS, a Distributed Asynchronous Object Storage system, to meet the needs of AI-driven applications. Another goal would be to evaluate vector databases and retrieval-augmented generation (RAG) on supercomputers to address the requirements of AI fine-tuning and inference applications. The candidate will work with diverse applications and on diverse systems, including object stores, memory-X and filesystems, to research and develop strategies to evaluate, profile and optimize AI applications at scale. Another key objective is to help us with design of future systems and data-management solutions to meet needs of our applications. Benefit To Alcf
This postdoc position will help ALCF evaluate and integrate data infrastructure needed to better facilitate AI models, including training, fine-tuning and inferencing, at scale. It will help us better understand and improve DAOS to meet the needs of AI-driven science applications. We expect the postdoc to help prototype, benchmark, and evaluate strategies to better support these workloads for Aurora. Position Requirements
A recent PhD (within 5 years) in computer science, computational science, a physical science, or engineering or related field. Comprehensive experience programming in one or more programming languages such as Python, C/C++. Experience with one of the AI frameworks, such as PyTorch or TensorFlow. Effective written and oral communications skills. Ability to model Argonne's Core Values: Impact, Safety, Respect, Integrity, and Teamwork. Experience with MPI on supercomputers. Experience in writing technical papers and presentations. Ability to create, maintain, and support high-quality software is essential. The successful candidate will be expected to work with and contribute to domain-specific software and models. Experience with version control software such as git is essential. Postdoctoral Appointee Long-Term (Fixed Term) Full time The expected hiring range for this position is $70,758.00 - $110,379.55. Please note that the pay range information is a general guideline only. The pay offered to a selected candidate will be determined based on factors such as, but not limited to, the scope and responsibilities of the position, the qualifications of the selected candidate, business considerations, internal equity, and external market pay for comparable jobs. Additionally, comprehensive benefits are part of the total rewards package. Argonne National Laboratory is committed to a safe and welcoming workplace that fosters collaborative scientific discovery and innovation. Argonne encourages everyone to apply for employment. Argonne is committed to nondiscrimination and considers all qualified applicants for employment without regard to any characteristic protected by law.