Site Reliability Engineer required to instrument observability for on premise and cloud based infrastructure (onsite focus) for an Insurance client

Job Type: Contract
Positions to fill: 1
Start Date: Sep 04, 2023
Job End Date: May 31, 2024
Pay Rate: Hourly: Negotiable
Job ID: 131847
Location: Toronto
Apply

Site Reliability Engineer required to instrument observability for on premise and cloud based infrastructure (onsite focus) for an Insurance client


Environment: In office 2 days per week – Toronto, London or Winnipeg

Contract: May 2024 end – extension is probable


Responsibilities:

Site Reliability Engineers required to instrument observability for on premise and cloud based infrastructure (on premise is immediate need), provide interpretive analysis, troubleshooting, continuous improvement.


Must Haves:

  • Strong infrastructure background (network, compute, storage, mainframe)
  • Strong experience with Splunk
  • Deep OS knowledge / troubleshooting skills (windows, unix)
  • Document and maintain runbooks and procedures (Automation)
  • Mentor and train SME’s around proactive reliability decision making and planning
  • Liaise with infrastructure SME’s parsing relevant data / KPIs and instrument in collaboration with monitoring team


  • Nice to have: Monitoring capabilities (Dynatrace, AppD, Solarwinds) as a nice to have.