Manages the Enterprise Monitoring platform to ensure that the systems and network infrastructure for business-critical services are being rigorously and effectively monitored.
Redesign and implement a more progressive monitoring strategy that meets the evolving requirements in IT as we shift towards more modernized architectures, processes, and technologies.
Manage resource and project prioritization for Enterprise Monitoring.
Track and provide status reporting of work assignments, projects, and programs
Create and drive strategy and standardization of monitoring across the enterprise.
Ensure we collect the right metrics at the right frequency and the data is readily available for alerting, reporting, and analysis
Collaborate with cross-functional teams to understand complex application architectures in order to design an effective top-down monitoring strategy for holistic service visibility.
Design alerting and incident-based collaboration across the enterprise.
Build and grow the scope and capabilities of the Enterprise Monitoring team with a top-down, service-driven focus.
Ensure methodologies keep pace with the shifts & transformations taking place within IT.
Collaborate with NOC and Performance teams to design and implement a cohesive and consistent visibility strategy and incident response process across the enterprise.
Ensure monitoring team increases use of automation and adopts a DevOps mentality
8 plus years of experience in critical infrastructure monitoring leadership role or similar position.
Understand and implement Event Management for monitoring.
Design, install , configure Zabbix in different topologies
Experience with designing and engineering solutions to monitor critical systems and network infrastructure across a wide array of technologies and platforms.
In-depth experience managing at least 3 out of following monitoring tools such as Zabbix, Splunk, Elastic etc
Experience with Windows and Linux operating system management and administration.
Familiarity with LAN/WAN technologies and clear understanding of basic network concepts / services
Strong understanding of workflow management and ITSM tools
Ability to work with a team with participants from various technical disciplines and geographies.
Ability to work independently with a high degree of initiative and determination.