Site Reliability Engineer Job at Infinity Solutions, Ontario, CA

dUJMYS9MTjlkUk9QSy9nRnNBWmtSdVpRaGc9PQ==
  • Infinity Solutions
  • Ontario, CA

Job Description

About the Role and Job:



Position : Site Reliability Engineer (SRE)
Location
: Downtown, Canada
Mode of Work : Hybrid (3-4 days a week)

Job Summary

We are seeking a skilled Site Reliability Engineer (SRE) to enhance the reliability, scalability, and performance of our systems and applications. The ideal candidate will have strong experience in automation, cloud platforms, observability, incident management, and DevOps practices. This role involves working closely with cross functional teams to ensure high availability, continuous improvement, and efficient service delivery.



Key Responsibilities

  • Design, build, and maintain automation for infrastructure provisioning and configuration management.
  • Implement and manage monitoring, observability, and alerting systems to ensure service reliability.
  • Collaborate with development and operations teams to enhance CI/CD pipelines and deployment automation.
  • Lead incident response, root cause analysis, and continuous improvement initiatives.
  • Manage cloud infrastructure, container orchestration platforms, and distributed systems at scale.
  • Ensure security, compliance, and governance across systems and processes.
  • Optimize application performance and conduct capacity planning and load testing.
  • Maintain documentation, runbooks, SLOs/SLAs, and operational processes.

Required Skills & Experience

1. Automation & Configuration Management

  • Ansible: Writing playbooks, roles, and modules.
  • Python: Scripting for automation, monitoring, API integration.
  • PowerShell: Automation for Windows, AD, and cloud resources.

2. Monitoring & Observability

  • Dynatrace: Synthetic & real user monitoring, alerting, performance analysis.
  • Moogsoft: Event correlation, alert management, incident orchestration.
  • Elasticsearch Stack: Log aggregation & querying; familiarity with Kibana/Logstash.

3. Incident & Service Management

  • ServiceNow: Ticket lifecycle, CMDB, workflow automation.

4. Infrastructure & Platforms

  • Cloud: AWS, Azure, or GCP (compute, storage, serverless, networking).
  • Containers: Kubernetes/OpenShift, Docker, Helm.

5. Database & Storage

  • SQL Server: Query tuning, replication, HA/DR setups.
  • Distributed DBs: Cassandra, Redis, NoSQL systems.
  • Backup & disaster recovery planning.

6. Security & Compliance

  • IAM, encryption, secrets management (e.g., HashiCorp Vault).
  • Vulnerability scanning and compliance frameworks (e.g., SOC 2).

7. CI/CD & DevOps

  • CI/CD tools: Jenkins, GitHub Actions, UrbanCode Deploy (UCD).
  • Git workflows and branching strategies.
  • Artifact management: Artifactory, Nexus.

8. Performance Engineering

  • Load testing using JMeter.
  • Capacity planning & performance optimization.
  • Defining and measuring SLIs, SLOs, SLAs.

Job Tags

3 days per week,

Similar Jobs

WakeMed Health & Hospitals

Ambulatory Care Nurse I Job at WakeMed Health & Hospitals

Overview:The Ambulatory Care Nurse (ACN) is a Registered Nurse responsible for providing nursing care management services to the assigned...  ...:Registered Nurse RequiredEducation:Diploma Nursing Or Associate's Degree Nursing RequiredExperience:No Experience Required Required

Public Employees Retirement System

Investment Analyst Job at Public Employees Retirement System

 ...includesindependent review for sourcing and underwriting new investment opportunities, managing assigned portfolios, market/sector research and strategic planning and drive our CalPERS mission.Duties include but are not limited to: Perform asset management duties... 

Freshpoint

Non-CDL Local Delivery Truck Driver Job at Freshpoint

 ...during the week. Must be willing to work Saturdays. Early morning delivery start times. Start time 2am-5am. 8-12 hoursJOB SUMMARYAll drivers run daily routes with frequent stops and are required to load packages onto hand trucks and unload product at each stop on the... 

Texas Health Resources

Project Manager I Job at Texas Health Resources

25012445 IT Project Manager I IT Project Management Office Bring your passion to Texas Health so we are Better + Together Work...  ...Clinical field, Business, or related field. 4 Years Related experience in lieu of a degree. (Preferred) 2 Years Project management... 

CS Logistics

Van Delivery Driver Job at CS Logistics

Van Delivery Driver Location Madison, WI : We are seeking a Delivery Driver to join our team! You will drive your own vehicle to deliver and...  ...this opportunity is for someone who owns their own minivan, cargo van or Transit/Sprinter-type vehicle to use for deliveries....