Alibaba Cloud
Site Reliability Engineer-Database-OLTP platform
Alibaba Cloud, Sunnyvale, California, United States, 94087
Site Reliability Engineer-Database-OLTP platform
Join to apply for the
Site Reliability Engineer-Database-OLTP platform
role at
Alibaba Cloud Site Reliability Engineer-Database-OLTP platform
Join to apply for the
Site Reliability Engineer-Database-OLTP platform
role at
Alibaba Cloud This range is provided by Alibaba Cloud. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range
$133,200.00/yr - $219,600.00/yr Direct message the job poster from Alibaba Cloud Global Talent Acquisition Talent Sourcer
The Alibaba Cloud Database Team is at the forefront of cloud database technology, driving innovation and excellence in providing robust, scalable, and secure database solutions. Our OLTP group focuses on developing cutting-edge transaction processing systems that power millions of businesses worldwide. Our team thrives on innovation, pushing the boundaries of what's possible in cloud-native database solutions. We have pioneered several groundbreaking features, including storage-compute separation and shared-storage architectures, which allow us to deliver exceptional performance and scalability. Furthermore, our commitment to excellence has been recognized through numerous awards and accolades, positioning us at the forefront of the industry. We believe in fostering a culture of continuous learning and growth. As a member of our OLTP group, you will collaborate closely with leading experts in database technology and contribute to projects that impact millions of users worldwide. Whether it's optimizing query performance, enhancing concurrency control mechanisms, or designing new features to support emerging use cases, your work will make a tangible difference. We are looking for a Site Reliability Engineer (SRE) specialized in the database domain to support the stable operation of Alibaba Cloud's OLTP platform . This role combines software and systems engineering to ensure the reliable operation of Alibaba Cloud's database OLTP platform, providing stable OLTP database services to customers. Responsibilities include but are not limited to: · Ensuring System Stability and High Availability: Responsible for health checks of components within the database foundational platform, developing maintenance tools for routine inspections, identifying and resolving potential risks in advance. · Development of Operations Platforms and Tools: Design and implement automated operations platforms that can maintain large-scale online clusters. Monitor and maintain various operational metrics, optimizing the system through data analysis. Participate in solving issues related to capacity, performance, and stability in production systems, designing and implementing automated operations platforms for large-scale online clusters. · Ensuring System Stability and High Availability: Design and implement high-availability systems, such as automatic fault localization, automatic recovery, adaptive disaster recovery, and implementation of cloud-native technologies, to ensure continuous business availability. · Incident Handling and Emergency Response: During major events like promotional sales, ensure smooth user experience under massive peak loads while maintaining cost control. Handle live network issues, including fault diagnosis, disaster recovery, intelligent scheduling, elastic scaling, and anti-attack measures. · Close Collaboration with Development Teams: Work closely with product teams to promptly identify and optimize technical architectures, improving service response latency and performance, and enhancing service availability. Actively participate in discussions and designs of business solutions, promoting optimization and improvement of services. · Bachelor's degree in Computer Science, or a related technical field, or equivalent practical experience. · 4+ years of work experience in Site Reliability Engineer within the domain of databases or other cloud products. · Familiar with the basic principles of the Linux kernel, common tools and commands, and has good skills in diagnostics and optimization. · Proficient in at least one or more of the following languages: Java, Python, Go, C++, with experience in developing operations and maintenance tools. · Familiar with open-source cloud platforms such as Kubernetes, OpenStack, and CloudFoundry. · Experience with relational databases like MySQL, SQL Server, and PostgreSQL, as well as open-source databases and queue products like Redis, MongoDB, HBase, Cassandra, Kafka, and Elasticsearch, with knowledge of their principles or operational experience being a plus. · Requires experience in operating large-scale distributed systems, with proficiency in at least one major cloud platform. · Excellent problem-solving and analytical skills. The pay range for this position at commencement of employment is expected to be between $133,200/year and $219,600/year. However, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. If hired, employee will be in an “at-will position” and the Company reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and market factors. Seniority level
Seniority level Entry level Employment type
Employment type Full-time Job function
Industries Software Development Referrals increase your chances of interviewing at Alibaba Cloud by 2x Sunnyvale, CA $141,000.00-$202,000.00 2 weeks ago Sunnyvale, CA $141,000.00-$202,000.00 2 weeks ago Foster City, CA $120,000.00-$160,000.00 1 week ago Sunnyvale, CA $141,000.00-$202,000.00 1 week ago Staff Software Engineer, Adversarial ML, Core
Sunnyvale, CA $197,000.00-$291,000.00 2 days ago Fremont, CA $166,000.00-$244,000.00 2 weeks ago Sunnyvale, CA $141,000.00-$202,000.00 1 week ago Mountain View, CA $141,000.00-$202,000.00 1 week ago Staff Software Engineer, AI/ML Recommendations, Rankings, Predictions, YouTube
Mountain View, CA $141,000.00-$202,000.00 1 week ago Software Test Engineer, Pixel Cross-Device Experiences
Mountain View, CA $102,000.00-$146,000.00 1 week ago Sunnyvale, CA $117,000.00-$173,000.00 2 weeks ago Staff Software Engineer, Scalability Regions Efficiency and Capacity
Sunnyvale, CA $197,000.00-$291,000.00 1 week ago Software Engineer III, Performance, Google Maps
Mountain View, CA $141,000.00-$202,000.00 2 days ago Mountain View, CA $150,000.00-$220,000.00 1 week ago Staff Software Engineer, Databases, Google Cloud
Sunnyvale, CA $141,000.00-$202,000.00 2 weeks ago Sunnyvale, CA $147,000.00-$208,000.00 2 weeks ago Menlo Park, CA $117,000.00-$173,000.00 2 weeks ago Software Engineer III, Augmented Reality
Mountain View, CA $141,000.00-$202,000.00 2 days ago Sunnyvale, CA $141,000.00-$202,000.00 1 week ago Software Engineer III, Mobile Ads Security
Mountain View, CA $141,000.00-$202,000.00 1 week ago Mountain View, CA $141,000.00-$202,000.00 2 weeks ago Software Engineer III, Full Stack, Google Cloud Business Platforms
Sunnyvale, CA $141,000.00-$202,000.00 1 week ago Software Engineer, Test Automation, Google Distributed Cloud
Sunnyvale, CA $141,000.00-$202,000.00 1 day ago Mountain View, CA $166,000.00-$244,000.00 1 week ago Software Engineer, ML Supercomputer Reliability
Sunnyvale, CA $197,000.00-$291,000.00 1 week ago Senior Software Engineer, AI/ML, Google Cloud Compute
Sunnyvale, CA $166,000.00-$244,000.00 1 week ago Software Engineer III, AI/ML GenAI, Google Ads
Mountain View, CA $141,000.00-$202,000.00 1 day ago Senior Software Engineer, Infrastructure, Google Cloud Platforms
Sunnyvale, CA $166,000.00-$244,000.00 2 weeks ago Fremont, CA $117,000.00-$173,000.00 3 days ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr
Join to apply for the
Site Reliability Engineer-Database-OLTP platform
role at
Alibaba Cloud Site Reliability Engineer-Database-OLTP platform
Join to apply for the
Site Reliability Engineer-Database-OLTP platform
role at
Alibaba Cloud This range is provided by Alibaba Cloud. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range
$133,200.00/yr - $219,600.00/yr Direct message the job poster from Alibaba Cloud Global Talent Acquisition Talent Sourcer
The Alibaba Cloud Database Team is at the forefront of cloud database technology, driving innovation and excellence in providing robust, scalable, and secure database solutions. Our OLTP group focuses on developing cutting-edge transaction processing systems that power millions of businesses worldwide. Our team thrives on innovation, pushing the boundaries of what's possible in cloud-native database solutions. We have pioneered several groundbreaking features, including storage-compute separation and shared-storage architectures, which allow us to deliver exceptional performance and scalability. Furthermore, our commitment to excellence has been recognized through numerous awards and accolades, positioning us at the forefront of the industry. We believe in fostering a culture of continuous learning and growth. As a member of our OLTP group, you will collaborate closely with leading experts in database technology and contribute to projects that impact millions of users worldwide. Whether it's optimizing query performance, enhancing concurrency control mechanisms, or designing new features to support emerging use cases, your work will make a tangible difference. We are looking for a Site Reliability Engineer (SRE) specialized in the database domain to support the stable operation of Alibaba Cloud's OLTP platform . This role combines software and systems engineering to ensure the reliable operation of Alibaba Cloud's database OLTP platform, providing stable OLTP database services to customers. Responsibilities include but are not limited to: · Ensuring System Stability and High Availability: Responsible for health checks of components within the database foundational platform, developing maintenance tools for routine inspections, identifying and resolving potential risks in advance. · Development of Operations Platforms and Tools: Design and implement automated operations platforms that can maintain large-scale online clusters. Monitor and maintain various operational metrics, optimizing the system through data analysis. Participate in solving issues related to capacity, performance, and stability in production systems, designing and implementing automated operations platforms for large-scale online clusters. · Ensuring System Stability and High Availability: Design and implement high-availability systems, such as automatic fault localization, automatic recovery, adaptive disaster recovery, and implementation of cloud-native technologies, to ensure continuous business availability. · Incident Handling and Emergency Response: During major events like promotional sales, ensure smooth user experience under massive peak loads while maintaining cost control. Handle live network issues, including fault diagnosis, disaster recovery, intelligent scheduling, elastic scaling, and anti-attack measures. · Close Collaboration with Development Teams: Work closely with product teams to promptly identify and optimize technical architectures, improving service response latency and performance, and enhancing service availability. Actively participate in discussions and designs of business solutions, promoting optimization and improvement of services. · Bachelor's degree in Computer Science, or a related technical field, or equivalent practical experience. · 4+ years of work experience in Site Reliability Engineer within the domain of databases or other cloud products. · Familiar with the basic principles of the Linux kernel, common tools and commands, and has good skills in diagnostics and optimization. · Proficient in at least one or more of the following languages: Java, Python, Go, C++, with experience in developing operations and maintenance tools. · Familiar with open-source cloud platforms such as Kubernetes, OpenStack, and CloudFoundry. · Experience with relational databases like MySQL, SQL Server, and PostgreSQL, as well as open-source databases and queue products like Redis, MongoDB, HBase, Cassandra, Kafka, and Elasticsearch, with knowledge of their principles or operational experience being a plus. · Requires experience in operating large-scale distributed systems, with proficiency in at least one major cloud platform. · Excellent problem-solving and analytical skills. The pay range for this position at commencement of employment is expected to be between $133,200/year and $219,600/year. However, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. If hired, employee will be in an “at-will position” and the Company reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and market factors. Seniority level
Seniority level Entry level Employment type
Employment type Full-time Job function
Industries Software Development Referrals increase your chances of interviewing at Alibaba Cloud by 2x Sunnyvale, CA $141,000.00-$202,000.00 2 weeks ago Sunnyvale, CA $141,000.00-$202,000.00 2 weeks ago Foster City, CA $120,000.00-$160,000.00 1 week ago Sunnyvale, CA $141,000.00-$202,000.00 1 week ago Staff Software Engineer, Adversarial ML, Core
Sunnyvale, CA $197,000.00-$291,000.00 2 days ago Fremont, CA $166,000.00-$244,000.00 2 weeks ago Sunnyvale, CA $141,000.00-$202,000.00 1 week ago Mountain View, CA $141,000.00-$202,000.00 1 week ago Staff Software Engineer, AI/ML Recommendations, Rankings, Predictions, YouTube
Mountain View, CA $141,000.00-$202,000.00 1 week ago Software Test Engineer, Pixel Cross-Device Experiences
Mountain View, CA $102,000.00-$146,000.00 1 week ago Sunnyvale, CA $117,000.00-$173,000.00 2 weeks ago Staff Software Engineer, Scalability Regions Efficiency and Capacity
Sunnyvale, CA $197,000.00-$291,000.00 1 week ago Software Engineer III, Performance, Google Maps
Mountain View, CA $141,000.00-$202,000.00 2 days ago Mountain View, CA $150,000.00-$220,000.00 1 week ago Staff Software Engineer, Databases, Google Cloud
Sunnyvale, CA $141,000.00-$202,000.00 2 weeks ago Sunnyvale, CA $147,000.00-$208,000.00 2 weeks ago Menlo Park, CA $117,000.00-$173,000.00 2 weeks ago Software Engineer III, Augmented Reality
Mountain View, CA $141,000.00-$202,000.00 2 days ago Sunnyvale, CA $141,000.00-$202,000.00 1 week ago Software Engineer III, Mobile Ads Security
Mountain View, CA $141,000.00-$202,000.00 1 week ago Mountain View, CA $141,000.00-$202,000.00 2 weeks ago Software Engineer III, Full Stack, Google Cloud Business Platforms
Sunnyvale, CA $141,000.00-$202,000.00 1 week ago Software Engineer, Test Automation, Google Distributed Cloud
Sunnyvale, CA $141,000.00-$202,000.00 1 day ago Mountain View, CA $166,000.00-$244,000.00 1 week ago Software Engineer, ML Supercomputer Reliability
Sunnyvale, CA $197,000.00-$291,000.00 1 week ago Senior Software Engineer, AI/ML, Google Cloud Compute
Sunnyvale, CA $166,000.00-$244,000.00 1 week ago Software Engineer III, AI/ML GenAI, Google Ads
Mountain View, CA $141,000.00-$202,000.00 1 day ago Senior Software Engineer, Infrastructure, Google Cloud Platforms
Sunnyvale, CA $166,000.00-$244,000.00 2 weeks ago Fremont, CA $117,000.00-$173,000.00 3 days ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr