01 Feb
Web.com Canada
Ontario
We are seeking a skilled Senior Observability Engineer to design, implement, and optimize observability solutions across cloud platforms and hybrid environments. The ideal candidate will have strong experience in cloud infrastructure (preferably OCI or other platforms), automation tools, observability stacks, and container orchestration. The role involves building scalable, resilient monitoring systems that ensure infrastructure and application performance, security, and availability.
Key Responsibilities:
Architecture & Design
- Design and implement end-to-end observability solutions leveraging tools like
Grafana, Prometheus, Zabbix, Nagios, Loki, Elastic Stack, or Open Telemetry.
- Architect scalable and fault-tolerant infrastructure monitoring for OCI cloud environment.
- Build robust observability stacks to enable application performance monitoring (APM), infrastructure metrics, and log aggregation.
Infrastructure as Code (IaC)
- Use Terraform to automate and manage infrastructure deployments and monitoring configurations.
- Collaborate with DevOps teams to maintain IaC standards and CI/CD workflows.
- Prior experience with ansible and puppet will be a plus point
Observability & Monitoring
- Deploy and configure Prometheus for metrics collection and alerting.
- Build custom dashboards and visualizations in Grafana to monitor system health and performance.
- Set up Osquery for endpoint visibility and security monitoring.
- Develop monitoring frameworks for Docker containers and Kubernetes clusters.
CI/CD Pipeline
- Hands-on experience with deploying infrastructure using Jenkins as the CI/CD tool.
- Knowledge with any other CI/CD environments will be favorable
Containerization & Orchestration
- Develop basic-to-medium level Docker configurations to containerize monitoring solutions.
- Configure and optimize Kubernetes clusters for observability, logging, and monitoring.
Collaboration & Leadership
- Work with cross-functional teams (DevOps, Cloud Engineering, Application Development) to align monitoring objectives.
- Provide technical guidance to junior team members on best practices for monitoring and observability.
- Partner with security teams to ensure compliance and security in monitoring solutions.
Key Qualifications:
Technical Skills:
- Proficient in cloud infrastructure (preferably OCI, AWS, GCP, or Azure).
- Strong knowledge of Grafana, Prometheus, Zabbix and Nagios.
- Experience with Terraform and CI/CD pipelines.
- Working knowledge of container platforms (Docker, Kubernetes).
- Expertise in setting up observability stacks, including logging, metrics, and tracing.
- Understanding of Linux/Windows system internals and basic networking concepts.
- Expertise in scripting languages like Go-Lang, python, perl, etc.
Soft Skills:
- Strong problem-solving and analytical skills.
- Effective communication with technical and non-technical teams.
- Team player with leadership abilities to drive monitoring best practices.
Preferred Experience
- 8+ years in cloud infrastructure or monitoring roles.
- Hands-on experience in implementing observability tools and stacks.
- Proven track record of improving infrastructure reliability through monitoring automation.
Why you’ll love us.
- We’ve evolved; we provide three work environment scenarios. You can feel like a Newfolder in a work-from-home, hybrid or work-from-the-office environment.
- Work-life balance. Our work is thrilling and meaningful,
but we know balance is key to living well.
- We celebrate one another’s differences. We’re proud of our culture of diversity and inclusion. We foster a culture of belonging. Our company and customers benefit when employees bring their authentic selves to work. We have programs that bring us together on important issues and provide learning and development opportunities for all employees. We have 20 + affinity groups where you can network and connect with Newfolders globally.
- Where can we take you? We’re fans of helping our employees learn different aspects of the business, be challenged with new tasks, be mentored, and grow their careers. Unfold new possibilities with #teamnewfold!
#LI-SM1
This Job Description includes the essential job functions required to perform the job described above,
as well as additional duties and responsibilities. This Job Description is not an exhaustive list of all functions that the employee performing this job may be required to perform. The Company reserves the right to revise the Job Description at any time, and to require the employee to perform functions in addition to those listed above.
Impress this employer describing Your skills and abilities, fill out the form below and leave Your personal touch in the presentation letter.