Fidelity National Information Services Inc
Site Reliability Engineer Specialists
Fidelity National Information Services Inc, Jacksonville, Florida, United States, 32290
Site Reliability Engineer Specialist
FIS Management Services, LLC seeks Site Reliability Engineer Specialists in Jacksonville, FL to apply developed knowledge of advanced software reliability engineering (SRE) principles to develop infrastructure processes that enhance the reliability and scalability of the organization's service and software systems. Implement systems for facilitating application maintenance tasks that play a pivotal role in ensuring the organization's software applications and systems remain reliable amidst frequent updates. Act as an SRE specialist, benchmarking industry best practices and methodologies for minimizing downtime and improving software system reliability in terms of availability and performance. Coordinate capacity planning efforts to ensure that systems can handle current and future loads, working with cross-functional teams to scale infrastructure resources appropriately based on demand. Consult on incident response efforts and root-cause investigation activities, diagnosing and resolving complex issues promptly to ensure minimal downtime and mitigate system interruptions. Interface regularly with development teams to embed reliability and scalability considerations into the design and integration of new software components and system features. Implement monitoring processes, alerting systems, and response tools to detect and respond to anomalies in system behavior, driving proactive issue identification and prompt responses. Coordinate and communicate with security teams to ensure that reliability efforts align with the organization's security policies and adhere to external regulatory compliance standards. Coach and mentor junior-level engineers, providing guidance on SRE best practices for building reliable and scalable system components. Ensure that technological duties associated with cloud computing, including design, planning, management, maintenance, and support are completed. Define application SLA/SLO/SLI goals and metrics in coordination with Product and Development teams. REQUIREMENTS: Bachelor's degree or foreign equivalent in Computer Science, Computer Engineering, or related field and seven (7) years of progressively responsible experience in the job offered or a related occupation: utilizing Site Reliability Engineering principles and methodologies to identify and improve architecture and engagement processes; understanding network architecture security and manages network certificate policies, renewals, and processes; leading efforts for upgrades and improvement of architecture, by keeping up-to-date on current industry standards; utilizing in-depth understanding of REST and XML APIs to support the API usage, of both proprietary and third-party vendor and customized gateway applications; working with general banking environment and how banks utilize digital banking products; working with Microsoft Azure and Amazon AWS public cloud environments; utilizing Akamai Global Traffic Management (GTM) tools to secure applications and route traffic to maintain performance and in disaster recovery scenarios; and utilizing Harness and Jenkins for Continuous Integration/Continuous Delivery platform tools. Telecommuting and/or working from home may be permissible pursuant to company policies.
FIS Management Services, LLC seeks Site Reliability Engineer Specialists in Jacksonville, FL to apply developed knowledge of advanced software reliability engineering (SRE) principles to develop infrastructure processes that enhance the reliability and scalability of the organization's service and software systems. Implement systems for facilitating application maintenance tasks that play a pivotal role in ensuring the organization's software applications and systems remain reliable amidst frequent updates. Act as an SRE specialist, benchmarking industry best practices and methodologies for minimizing downtime and improving software system reliability in terms of availability and performance. Coordinate capacity planning efforts to ensure that systems can handle current and future loads, working with cross-functional teams to scale infrastructure resources appropriately based on demand. Consult on incident response efforts and root-cause investigation activities, diagnosing and resolving complex issues promptly to ensure minimal downtime and mitigate system interruptions. Interface regularly with development teams to embed reliability and scalability considerations into the design and integration of new software components and system features. Implement monitoring processes, alerting systems, and response tools to detect and respond to anomalies in system behavior, driving proactive issue identification and prompt responses. Coordinate and communicate with security teams to ensure that reliability efforts align with the organization's security policies and adhere to external regulatory compliance standards. Coach and mentor junior-level engineers, providing guidance on SRE best practices for building reliable and scalable system components. Ensure that technological duties associated with cloud computing, including design, planning, management, maintenance, and support are completed. Define application SLA/SLO/SLI goals and metrics in coordination with Product and Development teams. REQUIREMENTS: Bachelor's degree or foreign equivalent in Computer Science, Computer Engineering, or related field and seven (7) years of progressively responsible experience in the job offered or a related occupation: utilizing Site Reliability Engineering principles and methodologies to identify and improve architecture and engagement processes; understanding network architecture security and manages network certificate policies, renewals, and processes; leading efforts for upgrades and improvement of architecture, by keeping up-to-date on current industry standards; utilizing in-depth understanding of REST and XML APIs to support the API usage, of both proprietary and third-party vendor and customized gateway applications; working with general banking environment and how banks utilize digital banking products; working with Microsoft Azure and Amazon AWS public cloud environments; utilizing Akamai Global Traffic Management (GTM) tools to secure applications and route traffic to maintain performance and in disaster recovery scenarios; and utilizing Harness and Jenkins for Continuous Integration/Continuous Delivery platform tools. Telecommuting and/or working from home may be permissible pursuant to company policies.