IBM

Site Reliability Engineer

IBM, San Jose, California, United States, 95199

Join to apply for the

Site Reliability Engineer

role at

IBM Join to apply for the

Site Reliability Engineer

role at

IBM Get AI-powered advice on this job and more exclusive features. Your Role And Responsibilities

Site Reliability Engineer, IBM Corporation, San Jose, CA: Ensure the reliability, scalability, and performance of the data analysis product. Design, develop, and optimize scalable data collection and visualization pipelines to enable efficient analysis and insights. Build and refine advanced forecasting and anomaly detection models to drive data-driven decision-making and improve system performance. Design and implement real-time dashboards using React and Node.js to provide critical performance insights. Respond to and manage incidents as part of an on-call rotation, troubleshoot system issues, and implement resolutions to enhance reliability. Automate infrastructure provisioning, configuration, and deployment through scripting and CI/CD pipelines, while establishing robust monitoring and alerting systems to proactively address anomalies. Employ Infrastructure as Code (IaC) principles to provision and maintain servers, databases, networking, and cloud resources, while planning for capacity and scalability to meet growing demands. Ensure security and compliance by implementing best practices, managing updates, and participating in audits. Collaborate with software developers, data analysts, and other stakeholders to optimize system performance, while maintaining accurate documentation and creating runbooks for operational excellence. Drive continuous learning and innovation by identifying opportunities for optimization, leveraging emerging technologies, and implementing solutions to improve system reliability and efficiency. Utilize: Data Analytics, Python, PL/SQL, Data Warehousing, ETL (Extract, Transform and Load), GIT, NumPy, Pandas, Scikit-learn.

Your Role And Responsibilities

Site Reliability Engineer, IBM Corporation, San Jose, CA:

Ensure the reliability, scalability, and performance of the data analysis product. Design, develop, and optimize scalable data collection and visualization pipelines to enable efficient analysis and insights. Build and refine advanced forecasting and anomaly detection models to drive data-driven decision-making and improve system performance. Design and implement real-time dashboards using React and Node.js to provide critical performance insights. Respond to and manage incidents as part of an on-call rotation, troubleshoot system issues, and implement resolutions to enhance reliability. Automate infrastructure provisioning, configuration, and deployment through scripting and CI/CD pipelines, while establishing robust monitoring and alerting systems to proactively address anomalies. Employ Infrastructure as Code (IaC) principles to provision and maintain servers, databases, networking, and cloud resources, while planning for capacity and scalability to meet growing demands. Ensure security and compliance by implementing best practices, managing updates, and participating in audits. Collaborate with software developers, data analysts, and other stakeholders to optimize system performance, while maintaining accurate documentation and creating runbooks for operational excellence. Drive continuous learning and innovation by identifying opportunities for optimization, leveraging emerging technologies, and implementing solutions to improve system reliability and efficiency. Utilize: Data Analytics, Python, PL/SQL, Data Warehousing, ETL (Extract, Transform and Load), GIT, NumPy, Pandas, Scikit-learn.

Required: Master’s degree or equivalent in Computer Science, Engineering or related (employer will accept a Bachelor's degree plus five (5) years of progressive experience in lieu of a Master’s degree) and one (1) year of experience as a Software Developer or related. One (1) year of experience must include utilizing Data Analytics, Python, PL/SQL, Data Warehousing, ETL (Extract, Transform and Load), GIT, NumPy, Pandas, Scikit-learn. $226158 to $267100 per year. Full time. AV161.

Required Technical And Professional Expertise

Master’s degree or equivalent in Computer Science, Engineering or related (employer will accept a Bachelor's degree plus five (5) years of progressive experience in lieu of a Master’s degree) and one (1) year of experience as a Software Developer or related. One (1) year of experience must include utilizing Data Analytics, Python, PL/SQL, Data Warehousing, ETL (Extract, Transform and Load), GIT, NumPy, Pandas, Scikit-learn. Seniority level

Seniority level Mid-Senior level Employment type

Employment type Full-time Job function

Job function Engineering and Information Technology Industries IT Services and IT Consulting Referrals increase your chances of interviewing at IBM by 2x Get notified about new Site Reliability Engineer jobs in

San Jose, CA . Full Stack Software Engineer (L4), Product Localization Engineering

Palo Alto, CA $100,000.00-$200,000.00 1 month ago Software Engineer, AI Platform - New Grad

Mountain View, CA $145,000.00-$170,000.00 1 day ago Site Reliability Engineer, Global E-Commerce

San Jose, CA $136,800.00-$259,200.00 1 week ago New Grads 2025 - General Software Engineer

San Jose, CA $120,000.00-$165,000.00 6 months ago Santa Clara, CA $85,000.00-$155,000.00 1 day ago San Jose, CA $133,900.00-$242,000.00 4 days ago Santa Clara, CA $144,000.00-$270,250.00 4 days ago Palo Alto, CA $180,000.00-$440,000.00 2 weeks ago Redwood City, CA $100,000.00-$150,000.00 2 days ago Associate Site Reliability Engineer/Site Reliability Engineer

Redwood City, CA $116,000.00-$168,000.00 1 week ago Mountain View, CA $145,000.00-$170,000.00 5 hours ago New Grads 2025 - Software Engineer, Algorithm

San Jose, CA $120,000.00-$165,000.00 11 months ago Mountain View, CA $134,000.00-$241,900.00 5 days ago Mechanical Product Design Engineer, Platforms, University Graduate

Sunnyvale, CA $105,000.00-$151,000.00 2 weeks ago Sunnyvale, CA $104,400.00-$171,000.00 3 weeks ago Santa Clara, CA $89,000.00-$165,600.00 2 days ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

#J-18808-Ljbffr