Logo
Itlearn360

Site Reliability Engineer (SRE) - NYC, NY at Lorven technologies New York, NY

Itlearn360, New York, New York, us, 10261

Save Job

Site Reliability Engineer (SRE) - NYC, NY job at Lorven technologies. New York, NY. Our client seeks an Site Reliability Engineer (SRE) for a

Full time

project in

New York City, NY

.

Below is the detailed requirement Title: Site Reliability Engineer (SRE) Location : New York City, NY Duration: Full Time Job Description: Bachelor's degree preferably in Computer Science, Information technology, Computer Engineering, or related IT discipline or equivalent experience with 12+ Minimum Experience Programming - Experience with languages like Python, Java, C/C++, or Ruby can be beneficial along with IaC languages (Ansible, Terraform, and Cloud Native). Cloud Platforms - Knowledge of cloud platforms like AWS, Azure, or GCP is highly valued. Containerization - Familiarity with container technologies like Docker and Kubernetes is essential. Networking and System Administration - Strong understanding of networking and system administration principles is crucial. CI/CD - Experience with CI/CD tools like Jenkins, Harness, or Spinnaker is valuable Automation - automate tasks (scripts and triggers and workflow automations) for deployment, monitoring, and incident response (improve efficiency and reduce manual effort) Monitoring and Observability design instrumentation and identify KPIS/Metrics and identify Events/ing to track system health and identify potential issues proactively. Incident Response - responsible for responding to and resolving incidents that have exceeded L1/L2 thresholds. Work with L3 teams to ensure minimal downtime and a quick return to normal operations as well as identifying and following up on problem backlogs and shift left initiatives. Infrastructure as Code (IaC) - Use tools like Terraform or Ansible to manage infrastructure as code, enabling repeatable and scalable deployments. Collaboration - Work closely with architecture, development, QA and Testing, and Operations teams to understand system requirements and contribute to the overall resilience of the software/platform.

#J-18808-Ljbffr