Back

Senior Software Engineer - Reliability Engineering (Remote)

Worldwide Salaried Open

With a career at The Home Depot, you can be yourself and also be part of something bigger. Position Purpose: The Senior Software Engineer for Site Reliability drives the platform's stability, scalability, and performance. This role enhances product reliability by engineering automated solutions for complex infrastructure and operational challenges, including leveraging AI-assisted tooling and prompt engineering to accelerate incident diagnosis, automate remediation workflows, and generate actionable insights from operational data. Key responsibilities include championing application availability and efficiency through proactive monitoring, performance tuning, and strategic improvements. The engineer will lead post-mortems, create automation to reduce operational toil—applying AI agents and large language models where they deliver measurable efficiency gains—and partner with product owners and developers to enable the deployment of reliable, high-performing services. This position participates in tool selection, assists with capacity planning, and builds the monitoring and alerting to meet business-defined Service Level Objectives (SLOs). Within a collaborative team, the role also involves mentoring less experienced engineers to foster a culture of operational excellence and practical AI fluency. Key Responsibilities: 50% Delivery and Execution - Develops, tests, deploys, and maintains software, with a clear understanding of the value the software is to provide; Takes on new opportunities and tough challenges with a sense of urgency, high energy and enthusiasm; Consistently achieves results, even under tough circumstances; Develops test suites (functional, destructive, etc) to enable success, rapid deployment of code to production; Takes a broad view when approaching issues; using a global lens 20% Learns and Grows - Learns through successful and failed experiment when tackling new problems; Actively seeks ways to grow and be challenged using both formal and informal development channels 20% Plans and Aligns - Collaborates with other team members in agile processes; Creates new and better ways for the organization to be successful; Works the Product Team to ensure user stories are valuable, developer ready, easy to understand and testable; Delivers multi-mode communications that convey a clear understanding of the unique needs of different audiences; Adapts approach and demeanor in real time to match the shifting demands of different situations; Relates openly and comfortably with diverse groups of people 10% Supports and Enables - Helps grow junior engineers by providing guidance on modern software development frameworks, and leading technical discussions Direct Manager/Direct Reports: This position typically reports to Software Engineer Manager or Sr. Manager This position has 0 Direct Reports Travel Requirements: No travel required. Physical Requirements: Most of the time is spent sitting in a comfortable position and there is frequent opportunity to move about. On rare occasions there may be a need to move or lift light articles. Working Conditions: Located in a comfortable indoor area. Any unpleasant conditions would be infrequent and not objectionable. Minimum Qualifications: Must be eighteen years of age or older. Must be legally permitted to work in the United States. Preferred Qualifications: GCP Cloud Infrastructure — BigQuery analytics, ADC auth, cloud-native services Observability — Grafana, Prometheus, Kibana/Elasticsearch (WES logs), OCP Health Dashboards Terraform Enterprise — Infrastructure as Code GitHub — SCM GH Copilot + AI Agents — AI-accelerated incident analysis, automated remediation workflows, prompt-engineered operational tooling SRE Practices — Production Readiness Review, Capacity Planning, Change Validation, Prod Support, Post-Mortems, SLO Definition & Tracking ServiceNow — Incident, Problem, and Change management; trend analysis; RCA grouping BigQuery — Incident analytics, problem candidate identification, operational reporting PagerDuty — On-call scheduling, escalation paths, push-button paging Rundeck — Self-heal automation, push-button remediation jobs Atlassian (Jira/Confluence) — RCA documentation, runbooks, architecture diagrams, onboarding CyberArk — Privileged access for WMS/DFC log pulls and node access Manhattan WMS — Warehouse Management System operations, RF/UI/LM node support Python Automation — Operational scripting, BQ pipelines, alert correlation, report generation Minimum Education: The knowledge, skills and abilities typically acquired through the completion of a bachelor's degree program or equivalent degree in a field of study related to the job. Preferred Education: No additional education Minimum Years of Work Experience: 3 Preferred Years of Work Experience: No additional years of experience Minimum Leadership Experience: None Preferred Leadership Experience: None Certifications: None Competencies: Global Perspective Manages Ambiguity Nimble Learning Self-Development Collaborates Cultivates Innovation Situational Adaptability Communicates Effectively Drives Results Interpersonal Savvy For California, Colorado, Connecticut, Rhode Island, Nevada, New York City, Ithaca (NY), Westchester County (NY), and Washington residents: The pay range for this position is between $90,000.00 - $180,000.00 Apply To This Job

More jobs

DLP / Desk Level Procedures Manager

Worldwide Salaried

Integrated Marketing Manager

Worldwide Salaried

Senior Professional Liability Underwriter - Architects & Engineers

Worldwide Salaried

Senior Director, Analyst — Managed IT Security Services, Sourcing Focus

Worldwide Salaried

Paralegal

Worldwide Salaried

(Remote) Support Analyst

Worldwide Salaried

Director, RFP Management and Rate Enablement

Worldwide Salaried

National Account Executive - Off Trade Wholesale (flexible location across UK)

Worldwide Salaried

Manager, Enterprise Projects (Remote - Eastern Hours)

Worldwide Salaried

National Accounts Sales Exec, Fire

Worldwide Salaried

Experienced Intern Data Entry Clerk – Technology Industry Data Management and Software Development Support

Worldwide Salaried

Experienced Data Entry Associate – Remote Opportunity at arenaflex

Worldwide Salaried

Experienced Full Stack Customer Support Specialist – Remote Live Chat Support

Worldwide Salaried

Experienced Remote Data Entry Specialist – Transforming Healthcare with arenaflex

Worldwide Salaried

Senior Account Manager

Worldwide Salaried

Project Manager / Senior Program Analyst – Research Portfolio Management & Operations

Worldwide Salaried

Experienced Entry-Level Data Entry Clerk – Remote Opportunity at arenaflex

Worldwide Salaried

1099 Contract Manager Contractor

Worldwide Salaried

Experienced Full Stack Staff Assistant – Remote Work From Home Jobs – DPS at arenaflex

Worldwide Salaried

Experienced Part-Time Data Entry Clerk - Flexible Work from Home Opportunity with arenaflex

Worldwide Salaried