Bloomberg
Senior Software Engineer/SRE - Automated Disaster Recovery
Bloomberg, New York, New York, us, 10261
Senior Software Engineer/SRE - Automated Disaster Recovery
Apply for the
Senior Software Engineer/SRE - Automated Disaster Recovery
role at
Bloomberg .
Base pay range $160,000.00/yr - $240,000.00/yr
Location New York
Role Description The Platform Database Services Disaster Recovery as a Service (DRaaS) SRE team administers end‑to‑end testing of Bloomberg’s datacenters for disaster recovery scenarios across numerous services that support Bloomberg’s products. The team is involved in inventing, engineering, developing, building, coding, troubleshooting, and maintaining a wide range of tools, monitors, frameworks, interfaces, protocols, solutions, and best practices around Disaster Recovery. These components stitch together a robust suite of automated and self‑healing systems that manage the services the Platform Database Services SRE team provides to the rest of the firm.
What’s In It For You You will help meet company and regulatory defined Disaster Testing standards. Manage and develop solutions for disaster recovery tools, integrating the services they provide into Bloomberg’s operational environment and products. These in‑house tools are required to test our clusters and managed services in an automated, scalable, self‑driven fashion with comprehensive metrics and transparency tools for internal and external clients. Tooling is expected to be written with end‑to‑end unit testing and continuous integration to provide the highest level of stability.
Responsibilities You’ll have product ownership and classic SRE responsibilities such as system tuning, performance analysis, defining and following availability targets (SLA’s, SLO’s, SLI’s), and immediate access to other experts designing Bloomberg‑specific components, APIs, and methods supporting the disaster recovery infrastructure. You’ll gain insight into how Bloomberg applications interact and the runtime environments, enabling in‑depth troubleshooting and enhancements to stability, reliability, performance, and feature set.
Qualifications
4+ years of experience in Python and/or TypeScript
A degree in Computer Science, Engineering, or similar field of study or equivalent work experience
5+ years experience with Unix, Unix tools, and shell scripting
Experience designing stable, long‑lasting APIs
Deep understanding of TCP/IP networking and the OSI model
Experience designing and automating repeatable processes in a client/server modeled environment
Ability to build and maintain highly sophisticated, available, performant and scalable systems
Experience building monitors and alarms for system performance, status and stability
Experience with CI/CD systems and writing robust unit and system tests
Nice to Have
Basic knowledge of Rapid framework
Experience analyzing existing systems and identifying shortcomings with proven methods for improvement
Experience with Chaos Engineering
Experience with Splunk/Humio and Grafana or other metric‑based reporting tools
Experience with GitHub and JIRA
Passion for product ownership
Benefits Salary Range: $160,000 - $240,000 USD Annually + Benefits + Bonus. The referenced salary range is based on the Company’s good faith belief at the time of posting. Actual compensation may vary based on factors such as geographic location, work experience, market conditions, education/training and skill level.
We offer a comprehensive benefits program that may include merit increases, incentive compensation (exempt roles only), paid holidays, paid time off, medical, dental, vision, short and long‑term disability benefits, 401(k) + matching, life insurance, and various wellness programs. The Company does not provide benefits directly to contingent workers/contractors and interns.
Referral Program Referrals increase your chances of interviewing at Bloomberg by 2x.
#J-18808-Ljbffr
Senior Software Engineer/SRE - Automated Disaster Recovery
role at
Bloomberg .
Base pay range $160,000.00/yr - $240,000.00/yr
Location New York
Role Description The Platform Database Services Disaster Recovery as a Service (DRaaS) SRE team administers end‑to‑end testing of Bloomberg’s datacenters for disaster recovery scenarios across numerous services that support Bloomberg’s products. The team is involved in inventing, engineering, developing, building, coding, troubleshooting, and maintaining a wide range of tools, monitors, frameworks, interfaces, protocols, solutions, and best practices around Disaster Recovery. These components stitch together a robust suite of automated and self‑healing systems that manage the services the Platform Database Services SRE team provides to the rest of the firm.
What’s In It For You You will help meet company and regulatory defined Disaster Testing standards. Manage and develop solutions for disaster recovery tools, integrating the services they provide into Bloomberg’s operational environment and products. These in‑house tools are required to test our clusters and managed services in an automated, scalable, self‑driven fashion with comprehensive metrics and transparency tools for internal and external clients. Tooling is expected to be written with end‑to‑end unit testing and continuous integration to provide the highest level of stability.
Responsibilities You’ll have product ownership and classic SRE responsibilities such as system tuning, performance analysis, defining and following availability targets (SLA’s, SLO’s, SLI’s), and immediate access to other experts designing Bloomberg‑specific components, APIs, and methods supporting the disaster recovery infrastructure. You’ll gain insight into how Bloomberg applications interact and the runtime environments, enabling in‑depth troubleshooting and enhancements to stability, reliability, performance, and feature set.
Qualifications
4+ years of experience in Python and/or TypeScript
A degree in Computer Science, Engineering, or similar field of study or equivalent work experience
5+ years experience with Unix, Unix tools, and shell scripting
Experience designing stable, long‑lasting APIs
Deep understanding of TCP/IP networking and the OSI model
Experience designing and automating repeatable processes in a client/server modeled environment
Ability to build and maintain highly sophisticated, available, performant and scalable systems
Experience building monitors and alarms for system performance, status and stability
Experience with CI/CD systems and writing robust unit and system tests
Nice to Have
Basic knowledge of Rapid framework
Experience analyzing existing systems and identifying shortcomings with proven methods for improvement
Experience with Chaos Engineering
Experience with Splunk/Humio and Grafana or other metric‑based reporting tools
Experience with GitHub and JIRA
Passion for product ownership
Benefits Salary Range: $160,000 - $240,000 USD Annually + Benefits + Bonus. The referenced salary range is based on the Company’s good faith belief at the time of posting. Actual compensation may vary based on factors such as geographic location, work experience, market conditions, education/training and skill level.
We offer a comprehensive benefits program that may include merit increases, incentive compensation (exempt roles only), paid holidays, paid time off, medical, dental, vision, short and long‑term disability benefits, 401(k) + matching, life insurance, and various wellness programs. The Company does not provide benefits directly to contingent workers/contractors and interns.
Referral Program Referrals increase your chances of interviewing at Bloomberg by 2x.
#J-18808-Ljbffr