American International Group
Production Support Lead, GenAI
American International Group, Atlanta, Georgia, United States, 30383
At AIG, we’re reshaping how the world manages risk, and we’re inviting you to be a key part of that transformation. As our **Production Support Tech Lead - GenAI** you will have the opportunity to make a meaningful impact, leveraging and further developing your skills to guide groundbreaking AI initiatives. If you’re looking for a place to grow your career and where your contributions will shape the future, AIG is where you belong.
As **Production Support Lead**, you will be at the forefront for creation of Operational Excellence & Business Continuity plan, being complainant and managing risk to production environment, clear and detailed knowledge management, monitoring cost and usage, Innovation & Continuous Improvement, define and establish
Metrics & Performance
for all the application under the GenAI Assist program. This is a unique opportunity to shape our AI-driven future, fostering innovation through modern development practices and delivering high-impact software that will drive business growth. * Responsible for applying industry best practices to put a standard support model and process for our GenAI Assist Platform* Potentially test, debug, and perform analysis of incidents reported from users.* Responsible for change management co-ordination for production releases and upgrades* Ensuring that production changes are reviewed and validated in production to avoid/minimize production issues* Develop and maintain monitoring dashboards for various components in the application* Problem Management – support and drive detailed root cause analysis of production issues, work with the Application Delivery, platform, DevOps and Business teams to address and provide remediation* Provide regular verbal and written communications regarding status, risks, issues. makes recommendations for remediation or change* Working with development teams ensure functionality deployments are completed without incident and with adherence to proper procedure* Identify opportunities for application tuning* Review, maintain and enhance application Run Books* 10+ years of experience with providing support of application platforms at the enterprise level including incident management, patches, vulnerabilities, change management, monitoring, and continued platform improvement.* Good knowledge in process to handle L2/L3 production issues* Hands on experience on collaboration with multiple stakeholders / managers to perform incident management conduct RCAs and drive continuous improvement in application stability.* Configured and maintained observability capabilities across critical applications and infrastructure, ensuring stability and visibility into application health and usage* Hands on experience to monitor the production applications using monitoring tools such as Dynatrace, Splunk, cloud watch.* Experienced in identifying performance bottle necks and issues.* Experience in developing/supporting Java J2EE applications using Spring, Spring Boot, Microservices, REST API, Angular, XML and JSON.* Developed a detailed run book / Knowledge Management documentation* Good experience in Splunk search, dashboards creation and Alert generation.* Worked on Global Agile Development teams* Providing end user communications for critical issues with ETAs. Conducted Daily application health checks are performed to ensure the smooth running of the application.* You are knowledgeable about ethical standards and controls for Generative AI and have worked in supporting AI/ML based enterprise application within large enterprises.* You have development experience on enterprise application #J-18808-Ljbffr
As **Production Support Lead**, you will be at the forefront for creation of Operational Excellence & Business Continuity plan, being complainant and managing risk to production environment, clear and detailed knowledge management, monitoring cost and usage, Innovation & Continuous Improvement, define and establish
Metrics & Performance
for all the application under the GenAI Assist program. This is a unique opportunity to shape our AI-driven future, fostering innovation through modern development practices and delivering high-impact software that will drive business growth. * Responsible for applying industry best practices to put a standard support model and process for our GenAI Assist Platform* Potentially test, debug, and perform analysis of incidents reported from users.* Responsible for change management co-ordination for production releases and upgrades* Ensuring that production changes are reviewed and validated in production to avoid/minimize production issues* Develop and maintain monitoring dashboards for various components in the application* Problem Management – support and drive detailed root cause analysis of production issues, work with the Application Delivery, platform, DevOps and Business teams to address and provide remediation* Provide regular verbal and written communications regarding status, risks, issues. makes recommendations for remediation or change* Working with development teams ensure functionality deployments are completed without incident and with adherence to proper procedure* Identify opportunities for application tuning* Review, maintain and enhance application Run Books* 10+ years of experience with providing support of application platforms at the enterprise level including incident management, patches, vulnerabilities, change management, monitoring, and continued platform improvement.* Good knowledge in process to handle L2/L3 production issues* Hands on experience on collaboration with multiple stakeholders / managers to perform incident management conduct RCAs and drive continuous improvement in application stability.* Configured and maintained observability capabilities across critical applications and infrastructure, ensuring stability and visibility into application health and usage* Hands on experience to monitor the production applications using monitoring tools such as Dynatrace, Splunk, cloud watch.* Experienced in identifying performance bottle necks and issues.* Experience in developing/supporting Java J2EE applications using Spring, Spring Boot, Microservices, REST API, Angular, XML and JSON.* Developed a detailed run book / Knowledge Management documentation* Good experience in Splunk search, dashboards creation and Alert generation.* Worked on Global Agile Development teams* Providing end user communications for critical issues with ETAs. Conducted Daily application health checks are performed to ensure the smooth running of the application.* You are knowledgeable about ethical standards and controls for Generative AI and have worked in supporting AI/ML based enterprise application within large enterprises.* You have development experience on enterprise application #J-18808-Ljbffr