Contact sales

We’d love to see how we can streamline your hiring together.

Request a demo
Contact sales

We’d love to see how we can streamline your hiring together.

Request a demo
Contact sales

We’d love to see how we can streamline your hiring together.

Request a demo

Site Reliability / Observability Engineer

Category :

Information Technology

Employment type :

Contract To Hire

Reference :

BH-393621

Site Reliability / Observability Engineer
Plano Hybrid (3 days)
12+ month Contract to Hire
 
MUST HAVES:
  • Python automation/scripting
  • AWS (ECS)
  • Splunk (3 years)
  • New Relic (3 years)
  • Observe (1-2 yrs) – Observe, Inc AI Observability platform
 
Software Development: Writing scripts (Python) for network tools, protocols, automation scripts (Ansible), and managing APIs.
Network Design & Implementation: ensuring optimal performance and security.
Automation: Creating self-service tools and scripts to simplify network management and reduce manual intervention.
Troubleshooting: Diagnosing complex issues at the intersection of software and hardware.
 
Position Overview
We are seeking a highly analytical and automation-focused Site Reliability / Observability Engineer to join a team. This role is responsible for ensuring the stability, reliability, and availability of production systems across multiple regions.
The ideal candidate will have deep experience in observability platforms (New Relic, Splunk, Observe), strong Python automation skills, and hands-on AWS experience. This is a daytime position supporting a 24x7 operational environment.
 
Key Responsibilities
Observability & Incident Triage (75%)
  • Monitor and manage alerts across New Relic, Splunk, and Observe (AI Observability platform).
  • Review logs, correlate data across systems, and “connect the dots” to identify root causes.
  • Analyze patterns and trends in system behavior and make data-driven recommendations.
  • Triage low-severity incidents and support escalation workflows.
  • Improve alert quality by reducing noise and refining monitoring strategies.
  • Partner with engineering teams to enhance visibility into system performance and reliability.
 
Automation & Reliability Engineering (25%)
  • Develop and maintain Python-based automation scripts to improve operational efficiency.
  • Build automation for failovers across different regions in AWS.
  • Create self-service tools to reduce manual intervention and streamline support processes.
  • Use tools such as Ansible and APIs to automate infrastructure and operational workflows.
  • Improve system resilience through automation-first design principles.
 
Infrastructure & Network Support
  • Support AWS environments, particularly ECS-based deployments.
  • Assist with network design and implementation to ensure performance, scalability, and security.
  • Troubleshoot complex issues at the intersection of software, infrastructure, and networking.
  • Contribute to continuous improvement initiatives focused on reliability engineering best practices.
 
Required Qualifications
  • 3+ years of Splunk experience
  • 3+ years of New Relic experience
  • 1–2 years of Observe (Observe, Inc.) experience
  • Strong Python scripting and automation skills
  • Hands-on experience with AWS (ECS preferred)
  • Experience building automation using Ansible and APIs
  • Proven experience reviewing logs, correlating data, and performing root cause analysis
  • Experience supporting production systems in a 24x7 environment
 


Estimated Min Rate: $52.50
Estimated Max Rate: $75.00


What’s In It for You?
We welcome you to be a part of the largest and legendary global staffing companies to meet your career aspirations. Yoh’s network of client companies has been employing professionals like you for over 65 years in the U.S., UK and Canada. Join Yoh’s extensive talent community that will provide you with access to Yoh’s vast network of opportunities and gain access to this exclusive opportunity available to you. Benefit eligibility is in accordance with applicable laws and client requirements. Benefits include:

  • Medical, Prescription, Dental & Vision Benefits (for employees working 20+ hours per week)
  • Health Savings Account (HSA) (for employees working 20+ hours per week)
  • Life & Disability Insurance (for employees working 20+ hours per week)
  • MetLife Voluntary Benefits
  • Employee Assistance Program (EAP)
  • 401K Retirement Savings Plan
  • Direct Deposit & weekly epayroll
  • Referral Bonus Programs
  • Certification and training opportunities

Note: Any pay ranges displayed are estimations. Actual pay is determined by an applicant's experience, technical expertise, and other qualifications as listed in the job description. All qualified applicants are welcome to apply.

Yoh, a Day & Zimmermann company, is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

Visit https://www.yoh.com/applicants-with-disabilities to contact us if you are an individual with a disability and require accommodation in the application process.

For California applicants, qualified applicants with arrest or conviction records will be considered for employment in accordance with the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. All of the material job duties described in this posting are job duties for which a criminal history may have a direct, adverse, and negative relationship potentially resulting in the withdrawal of a conditional offer of employment.

It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.

By applying and submitting your resume, you authorize Yoh to review and reformat your resume to meet Yoh’s hiring clients’ preferences. To learn more about Yoh’s privacy practices, please see our Candidate Privacy Notice:  https://www.yoh.com/privacy-notice

03-04-2026

Site Reliability / Observability Engineer

Information Technology

Apply Now
Create As Alert

Share this Job

Interested in this job?
Save Job
SCHEMA MARKUP ( This text will only show on the editor. )