Back

Distinguished Site Reliability Engineer – Cloud

Worldwide Salaried Open

Job Description:

  • Lead, design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus on performance at scale, real time monitoring, logging and alerting
  • Engage in and improve the whole lifecycle of services—from inception and design through deployment, operation and refinement
  • Support services before they go live through activities such as system design consulting, developing software tools, platforms and frameworks, capacity management and launch reviews
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity
  • Practice sustainable incident response and blameless postmortems
  • Be part of an on call rotation to support production systems

Requirements:

  • BS degree in Computer Science or a related technical field involving coding (e.g., physics or mathematics), or equivalent experience
  • 16+ years of experience with Infrastructure automation, distributed systems design, experience with design, develop tools for running large scale private or public cloud system in Production
  • Experience in one or more of the following: Python, Go, Perl or Ruby
  • In depth knowledge on Linux, Networking and Containers

Benefits:

  • equity
  • benefits

Apply tot his job Apply To this Job

More jobs

Site Reliability Engineer, IDaaS Data Platform

Worldwide Salaried

Site Reliability/Platform Engineer (Linux/ Kubernetes / Python) - 180-190K

Worldwide Salaried

Site Reliability Engineering Manager

Worldwide Salaried

Site Reliability Engineer – SkillBridge Intern

Worldwide Salaried

DevOps Engineer - Kubernetes, AWS & Docker Skills Required (Fully Remote )

Worldwide Salaried

FSO Audit LABS - Kubernetes DevOps Engineer - Senior - Bay Area

Worldwide Salaried

Team Lead, Site Reliability Engineering - Storage Layer Service

Worldwide Salaried

Site Reliability Engineer-SkillBridge Intern

Worldwide Salaried

SRE Architect + Strong Dynatrace exp

Worldwide Salaried

Software Engineer – Java, Spring Boot, Kubernetes, AWS

Worldwide Salaried

Experienced Remote Chat Support Agent – Flexible Hours, Competitive Pay, and Career Growth Opportunities at arenaflex

Worldwide Salaried

Experienced Customer Service Representative – Hybrid Work from Home Opportunity with Arenaflex

Worldwide Salaried

Experienced Customer Alarm Monitoring Agent – 2nd/3rd Shifts with arenaflex

Worldwide Salaried

Experienced Full Stack Product Manager – Customer Service Innovation and Experience

Worldwide Salaried

Senior Talent Aquisition Partner (Polish speaker) - Immediate Start

Worldwide Salaried

Mid Senior AI Cinematic Video Editor

Worldwide Salaried

Experienced Customer Practice Manager, Media & Entertainment, US - Telco, Media, Entertainment, Games, and Sports - arenaflex ProServe

Worldwide Salaried

Experienced Remote Data Entry Specialist – Logistics and Shipping Operations

Worldwide Salaried

Remote Claims Processing Clerk

Worldwide Salaried

Experienced Customer Support Agent – Remote Team at arenaflex

Worldwide Salaried