Bank of America
Software Engineer III-Gen AI Inferencing
Bank of America, Newark, Delaware, United States, 19711
Overview
Software Engineer III-Gen AI Inferencing at Bank of America. Join to apply for the Software Engineer III-Gen AI Inferencing role at Bank of America. This position focuses on design, build, and operation of reusable toolkits for Gen AI RAG capabilities and developing and delivering complex requirements to accomplish business goals. It emphasizes meeting functional, non-functional, and compliance requirements with maintainability, ease of integration, and testability built in from the outset. Position Summary
Join a groundbreaking team at Bank of America, at the forefront of innovation in AI. We are building the next generation of Gen AI platform, empowering new AI initiatives across Consumer, Small Business, Global Banking, and Wealth organizations. This is a unique opportunity to contribute to a critical platform that enables secure, scalable, and high-performance AI capabilities across the organization. We value curiosity, collaboration, and a passion for pushing the boundaries of whats possible with AI. This position is focused on design, build, and operate of reusable toolkits for Gen AI RAG capabilities. This job is responsible for developing and delivering complex requirements to accomplish business goals. Key responsibilities include ensuring software is developed to meet functional, non-functional and compliance requirements, and solutions are well designed with maintainability, ease of integration, and testing built in from the outset. Responsibilities
Codes solutions and unit test to deliver a requirement/story per the defined acceptance criteria and compliance requirements Designs, develops, and modifies architecture components, application interfaces, and solution enablers while ensuring principal architecture integrity is maintained Mentors other software engineers and coaches the team on CI/CD practices and automating tool stacks Executes story refinement, definition of requirements, and estimating work necessary to realize a story through the delivery lifecycle Performs spike/proof of concept as necessary to mitigate risk or implement new ideas Automates manual release activities Designs, develops, and maintains automated test suites (integration, regression, performance) Supports multiple platforms and line of business for data science and GenAI use cases Utilizes multiple architectural components (across data, application, business) in design and development of client requirements Participates in estimating work necessary to realize a story/requirement through the delivery lifecycle Manages multiple priorities and engages with multiple teams Be vocal and actively participate in sessions with business stakeholders and agile teams Collaborates with product teams, data analysts and data scientists to design and build solutions Required qualifications
5+ years OOP in Python/Scala/Java programming experience with expert level development skills Experience with Model Serving platforms i.e. AI/ML/GenAI Lifecycle Management and Development and its ecosystem. Hands-on experience building frameworks using MLOps, fine-tuning techniques, inference frameworks Track record of maintaining large-scale Python/Unix-based systems Hands-on experience and knowledge of generative AI RAG processes for various use cases, including chunking, embedding, retrieval, reranking and summarization Hands-on experience in application development in one or more areas: MongoDB, Redis, Angular/React, Containerization, building API-based applications leveraging FASTAPI, JWT Integration, API Gateway Develop efficient utilities, automation frameworks, data science platforms usable across multiple Data Science teams for AI/ML and GenAI work Experience in large teams collaborating on a shared multi-repo codebase using IDEs (e.g., VS Code), CI, CD, and Continuous Testing Strong automation, scripting, and Python development skills. Hands-on DevOps with global enterprise tools: Version Control (Git/Bitbucket), Build Orchestration (Jenkins), Code Quality (SonarQube and pytest), Artifact Management (Artifactory), Deployment (Ansible) Desired Qualifications
Experience developing Gen AI training and inferencing platforms with open-source models, building inferencing and servicing capabilities supporting RAG/MCP modules Experience driving and maintaining a culture of quality, innovation, and experimentation Research on new tools and capabilities for UI/UX improvements for an advanced analytics platform, rapid prototyping and demonstration of features, and participation in user forums Skills
Application Development Automation Influence Solution Design Technical Strategy Development Architecture Business Acumen DevOps Practices Result Orientation Solution Delivery Process Analytical Thinking Collaboration Data Management Risk Management Test Engineering Shift
1st shift (United States of America) Hours Per Week
40 Seniority level
Mid-Senior level Employment type
Full-time Job function
Engineering and Information Technology Industries
Banking
#J-18808-Ljbffr
Software Engineer III-Gen AI Inferencing at Bank of America. Join to apply for the Software Engineer III-Gen AI Inferencing role at Bank of America. This position focuses on design, build, and operation of reusable toolkits for Gen AI RAG capabilities and developing and delivering complex requirements to accomplish business goals. It emphasizes meeting functional, non-functional, and compliance requirements with maintainability, ease of integration, and testability built in from the outset. Position Summary
Join a groundbreaking team at Bank of America, at the forefront of innovation in AI. We are building the next generation of Gen AI platform, empowering new AI initiatives across Consumer, Small Business, Global Banking, and Wealth organizations. This is a unique opportunity to contribute to a critical platform that enables secure, scalable, and high-performance AI capabilities across the organization. We value curiosity, collaboration, and a passion for pushing the boundaries of whats possible with AI. This position is focused on design, build, and operate of reusable toolkits for Gen AI RAG capabilities. This job is responsible for developing and delivering complex requirements to accomplish business goals. Key responsibilities include ensuring software is developed to meet functional, non-functional and compliance requirements, and solutions are well designed with maintainability, ease of integration, and testing built in from the outset. Responsibilities
Codes solutions and unit test to deliver a requirement/story per the defined acceptance criteria and compliance requirements Designs, develops, and modifies architecture components, application interfaces, and solution enablers while ensuring principal architecture integrity is maintained Mentors other software engineers and coaches the team on CI/CD practices and automating tool stacks Executes story refinement, definition of requirements, and estimating work necessary to realize a story through the delivery lifecycle Performs spike/proof of concept as necessary to mitigate risk or implement new ideas Automates manual release activities Designs, develops, and maintains automated test suites (integration, regression, performance) Supports multiple platforms and line of business for data science and GenAI use cases Utilizes multiple architectural components (across data, application, business) in design and development of client requirements Participates in estimating work necessary to realize a story/requirement through the delivery lifecycle Manages multiple priorities and engages with multiple teams Be vocal and actively participate in sessions with business stakeholders and agile teams Collaborates with product teams, data analysts and data scientists to design and build solutions Required qualifications
5+ years OOP in Python/Scala/Java programming experience with expert level development skills Experience with Model Serving platforms i.e. AI/ML/GenAI Lifecycle Management and Development and its ecosystem. Hands-on experience building frameworks using MLOps, fine-tuning techniques, inference frameworks Track record of maintaining large-scale Python/Unix-based systems Hands-on experience and knowledge of generative AI RAG processes for various use cases, including chunking, embedding, retrieval, reranking and summarization Hands-on experience in application development in one or more areas: MongoDB, Redis, Angular/React, Containerization, building API-based applications leveraging FASTAPI, JWT Integration, API Gateway Develop efficient utilities, automation frameworks, data science platforms usable across multiple Data Science teams for AI/ML and GenAI work Experience in large teams collaborating on a shared multi-repo codebase using IDEs (e.g., VS Code), CI, CD, and Continuous Testing Strong automation, scripting, and Python development skills. Hands-on DevOps with global enterprise tools: Version Control (Git/Bitbucket), Build Orchestration (Jenkins), Code Quality (SonarQube and pytest), Artifact Management (Artifactory), Deployment (Ansible) Desired Qualifications
Experience developing Gen AI training and inferencing platforms with open-source models, building inferencing and servicing capabilities supporting RAG/MCP modules Experience driving and maintaining a culture of quality, innovation, and experimentation Research on new tools and capabilities for UI/UX improvements for an advanced analytics platform, rapid prototyping and demonstration of features, and participation in user forums Skills
Application Development Automation Influence Solution Design Technical Strategy Development Architecture Business Acumen DevOps Practices Result Orientation Solution Delivery Process Analytical Thinking Collaboration Data Management Risk Management Test Engineering Shift
1st shift (United States of America) Hours Per Week
40 Seniority level
Mid-Senior level Employment type
Full-time Job function
Engineering and Information Technology Industries
Banking
#J-18808-Ljbffr