Logo
Bank of America

Software Engineer III -Gen AI Inferencing

Bank of America, Newark, Delaware, United States, 19711

Save Job

Software Engineer III -Gen AI Inferencing

Software Engineer III -Gen AI Inferencing at Bank of America. This role focuses on design, build, and operation of reusable toolkits for Gen AI RAG capabilities and delivering secure, scalable, high-performance AI capabilities across the organization. Position Summary

Join a groundbreaking team at Bank of America, at the forefront of innovation in AI. We are building the next generation of Gen AI platform, empowering new AI initiatives across Consumer, Small Business, Global Banking, and Wealth organizations. This is a unique opportunity to contribute to a critical platform that enables secure, scalable, and high-performance AI capabilities across the organization. We value curiosity, collaboration, and a passion for pushing the boundaries of what’s possible with AI. This position is focused on design, build, and operate of reusable toolkits for Gen AI RAG capabilities. This job is responsible for developing and delivering complex requirements to accomplish business goals, ensuring software meets functional, non-functional, and compliance requirements, with maintainability, ease of integration, and testing built-in from the outset. Job expectations include strong knowledge of development and testing practices and architectural patterns. Responsibilities

Codes solutions and unit tests to deliver a requirement/story per defined acceptance criteria and compliance requirements Designs, develops, and modifies architecture components, application interfaces, and solution enablers while ensuring principal architecture integrity Mentors other software engineers and coaches team on CI/CD practices and automating tool stacks Executes story refinement, defines requirements, and estimates work necessary to realize a story through the delivery lifecycle Performs spike/proof of concept as necessary to mitigate risk or implement new ideas Automates manual release activities Designs, develops, and maintains automated test suites (integration, regression, performance) Utilizes multiple architectural components across data, application, and business layers in design and development Manages multiple priorities and engages with multiple teams Participates in estimating work necessary to realize a story/requirement through the delivery lifecycle Be vocal and actively participate in all sessions with business stakeholders and agile teams Collaborates with product teams, data analysts, and data scientists to design and build solutions Required qualifications

5+ years of OOP in Python/Scala/Java with expert level development skills Experience with AI/ML/GenAI lifecycle management and development; hands-on experience building frameworks using MLOps, fine-tuning techniques, and inference frameworks Experience deploying models using vLLM/Triton Inference Server in containers in production with automation; CI/CD; performance tuning for higher throughput Track record of maintaining large-scale Python/Unix-based systems Hands-on experience with GenAI RAG processes for use cases including chunking, embedding, retrieval, reranking, and summarization Experience in application development with MongoDB, Redis, Angular/React, containerization, API-based applications using FASTAPI, JWT, API Gateway Develop efficient utilities, automation frameworks, and data science platforms for across multiple Data Science teams Experience working in large teams on a shared multi-repo codebase with IDEs, CI/CD, and testing Strong automation, scripting, Python development, and DevOps experience with tools such as GIT/Bitbucket, Jenkins, SonarQube/pytest, Artifactory, Ansible Desired Qualifications

Experience building and deploying Gen AI inferencing platform with open-source toolsets, including AI Gateway, Policy store, and Observability for RAG/MCP use cases Experience driving quality, innovation, and experimentation culture Research on new tools/capabilities for UI/UX in an advanced analytics platform; prototyping and participation in user forums Skills

Application Development Automation Influence Solution Design Technical Strategy Development Architecture Business Acumen DevOps Practices Result Orientation Solution Delivery Process Analytical Thinking Collaboration Data Management Risk Management Test Engineering Shift

1st shift (United States of America) Hours Per Week

40 Seniority level

Mid-Senior level Employment type

Full-time Job function

Engineering and Information Technology Industries: Banking

#J-18808-Ljbffr