Login | Register

REMOTE - Senior Application / Infrastructure Monitoring Specialist to architect, manage the Monitoring platform critical applications and infrastructure - 4016

Job Type: Contract
Positions to fill: 1
Start Date: Jan 24, 2022
Job End Date: Jan 31, 2023
Pay Rate: Hourly: Negotiable
Job ID: 113907
Location: Calgary, Edmonton, Halifax, Montreal, Ottawa, Regina, Toronto, Vancouver, Victoria, Winnipeg
Apply
S.i. systems public sector client embraces a DevOps culture, fully adheres to scaled agile framework for enterprise (SAFe) processes and is deeply invested in modern Microsoft technologies such as Azure.

They are seeking a Senior Application / Infrastructure Monitoring Specialist to to architect, manage the Monitoring platform critical applications and infrastructure.

JOB DUTIES:
  • Evaluating, architecting, deploying, and managing our Monitoring & Observability platform
  • Defining, configuring, maintaining, and maximizing the platform that makes critical applications and infrastructure visible and investigable
  • Leading define methodically to build an observability-driven development practice
  • Working closely with development teams to implement monitoring & observability instrumentation within their platforms.
  • Driving adoption of best practices in monitoring, alerting, and performance
  • Defining best practices around making our systems and services measurable and working with our various teams to get those best practices applied
  • Collaborating with our Engineering & Platform teams to ensure our services, platforms and infrastructure are emitting the right metrics
  • Collecting, aggregating, and visualizing the collected metrics to provide actionable insight
  • Contributing to our evolving “data-driven” and “cloud first” culture through continuous learning
MUST HAVE SKILLS
  • 6+ years of experience in software engineering, SRE, or DevOps
  • Experience with one or more monitoring/logging/alerting tools at scale, such as:
    • Zabbix
    • Logstash
    • Kibana
    • Prometheus
  • Prior experience with instrumenting mission-critical services on distributed level, using cloud providers like Azure, AWS or GCP
  • Containerization experience
  • Experience with one or more of:
    • Elastic search
    • Cloud Watch
    • Prometheus
    •  Data Dog
    • Splunk
  • some Big Data technology experience such as:
    • Cassandra
    • Elastic search
    • Hadoop
  • Solid configuration management skills (Puppet, Salt)
  • Experience with scripting and system automation (Python, Perl etc.)
  • Able to comply with WorkSafeBC’s Employee and Contractor Mandatory Vaccine Policy
NICE TO HAVE SKILLS
  • Prior experience with AIOps is a plus