Site Reliability Engineer required to instrument observability for on premise and cloud based infrastructure (onsite focus) for an Insurance client
Job Type: Contract
Positions to fill: 1
Start Date: Sep 04, 2023
Job End Date: May 31, 2024
Pay Rate: Hourly: Negotiable
Job ID: 131847
Location: Toronto
Site Reliability Engineer required to instrument observability for on premise and cloud based infrastructure (onsite focus) for an Insurance client
Environment: In office 2 days per week – Toronto, London or Winnipeg
Contract: May 2024 end – extension is probable
Responsibilities:
Site Reliability Engineers required to instrument observability for on premise and cloud based infrastructure (on premise is immediate need), provide interpretive analysis, troubleshooting, continuous improvement.
Must Haves:
- Strong infrastructure background (network, compute, storage, mainframe)
- Strong experience with Splunk
- Deep OS knowledge / troubleshooting skills (windows, unix)
- Document and maintain runbooks and procedures (Automation)
- Mentor and train SME’s around proactive reliability decision making and planning
- Liaise with infrastructure SME’s parsing relevant data / KPIs and instrument in collaboration with monitoring team
- Nice to have: Monitoring capabilities (Dynatrace, AppD, Solarwinds) as a nice to have.