Sr Tech Ops Engineer - (Q949)

Sr Tech Ops Engineer - (Q949)

03 Jan
|
EQ Bank | Equitable Bank
|
Toronto

03 Jan

EQ Bank | Equitable Bank

Toronto

Purpose of JobOne of our goals is to fail small and learn fast. The Sr Tech Ops Engineer will focus on minimizing the impacts of production incidents while ensuring the organization learns quickly to become more resilient.The Sr. Tech Ops Engineer is accountable for tracking and managing the restoration and communication of incidents with pace. The role is responsible for managing the processes, guidelines and tools related to Major and other high priority Incidents, performing Root Cause Analysis, and proactively enhancing related tools and processes for monitoring and alerting. The role is also responsible for managing the lifecycle of Problems, with the aim of preventing incidents from happening or reoccurring.



The Sr Tech Ops Engineer will establish service level objectives, indicators and agreements (SLOs, SLIs and SLAs) for critical systems and channels, including championing automated monitoring and data aggregating to streamline reporting of metrics.Success in this role will be demonstrated by ensuring that incidents with business or customer impact are dealt with effectively and with minimum disruption to live service and SLAs. This role will record and manage problems through to resolution by performing root cause analysis, coordinating third party analysis, and communicating with stakeholders to ensure workaround and solution suitability. This role will be supporting EQ Bank, Equitable Bank, the direct-to-consumer digital bank channels and core banking applications, as well as any other Information Technology incidents and problems with a high and critical impact on business.Main Activities:Determine the severity of an incident that has been raised by the business (per risk definition)Ensure successful and timely restoration services of major incidentsIdentity application related issues and determine the trends to avoid replicationDevelop and drive SWOT opportunities for reoccurring incidentsEstablish a proper issue detecting mechanism and build useful dashboardsWork in partnership with senior IT Subject matter experts and business counterparts to identify opportunities that drive business value and improve effectivenessEstablish and maintain timely communications with all stakeholders to ensure they are informed of service statusManage stakeholder expectations and update people on progress or slippagesHost both technical and managerial conference calls, and facilitate effective incident management throughout the incident lifecycleSpin-up command centres (virtual/physical) as needed based on priority of incidents, including call-tree escalations to Level 3 experts as neededOwn the creation and distribution of succinct communications on command centre progress and activities, for both front-line technical staff and executive leadersCo-ordinate / facilitate escalations through internal and supplier teams as needed to heighten visibility and engagementAccountable for gathering required information from business and technology to enable effective Root Cause AnalysisProduce and maintain Executive level reporting of incidents, including number of customers impacted and restoration timingMinimize and mitigate business-impacting incidents through controlled problem identification and responsive / proactive service improvement plansIdentify the underlying root cause and the triggers of a Problem and initiate the most appropriate and economical Problem solution or temporary workaroundEnsure that - after a successful Problem solution - the Problem Record contains a full historical description, and that related Known Error Records are updatedUndertake a review of the resolution of a Problem to prevent reoccurrence and learn any lessons for the futureVerify if the Problems marked as closed have been fully resolvedProvide extended and off-hour support as needed to manage command centres and communicationsManage processes to ensure on-call team schedules, systematic call-trees, and distribution lists are accurate and up to dateKnowledge/Skill Requirements:University degree in technology-related field required, or equivalent work experienceKnowledge in Java software delivery or software development including multiple frameworks such as Hibernate, Spring MVC, Spring Security, SAML, OAuth, OICExperience in API management tools, e.g., MuleSoft API Manager (preferably)Experience in industry standard APM toolsIncident and Problem management, Major Incident Response experience preferable6+ years of experience in a corporate SDLC function (developer, support, architect, etc.) in technology4 years of experience with technical incident resolution or product defect resolutionStrong written and verbal communication, with an ability to articulate issues simply, concisely,



accurately and clearly for both technical working level teams and executivesAn ability to work efficiently, effectively and collaborativelyKnowledge of ITIL 4 practices is a mustAnalytical mind capable of managing numerous information sources and providing meaningful data analysis reports to senior managementOrganized, structured, logical thinking and detail oriented analytic skillsMulti-tasking – can manage several concurrent initiatives with pace and prioritise demands whilst maintaining composureExperience with tracking incidents and providing dashboard views of issues/tickets across timeRespectable knowledge of Jira Service Desk, Confluence and Jira is requiredStrong attention to detail, keen problem-solving skills, with experience performing impact and root cause analyses, recommending solutions,



and supporting resolution effortsAbility to approach problems logically and with good judgment to ensure the appropriate outcomeCapacity to adapt to and manage change in a fast-paced, dynamic environmentPersonal maturity, personable, credible, professional presence, Coordinative authority, Calm under pressureAbility to work professionally with other teams and individuals to get things done quickly

The original job offer can be found in Kit Job:
https://www.kitjob.ca/job/89776992/tech-ops-engineer-q949-toronto/?utm_source=html

Subscribe to this job alert:
Enter Your E-mail address to receive the latest job offers for: sr tech ops engineer - (q949)

Reply to this offer

Impress this employer describing Your skills and abilities, fill out the form below and leave Your personal touch in the presentation letter.

Subscribe to this job alert:
Enter Your E-mail address to receive the latest job offers for: sr tech ops engineer - (q949)