Cloud Operations Engineer - Cato Networks
- חברה: Cato Networks
- מיקום: Tel Aviv District, Israel
- טכנולוגיות: Bash, Grafana, Python
תיאור המשרה
Implement and improve network monitoring and alerting systems to proactively detect service-impacting issues.
Lead escalations of critical network and service incidents, coordinating with Engineering, Support, Security, and Operations teams.
Investigate and resolve complex network issues beyond standard NOC and Support tiers.
Contribute to RCAs by providing technical analysis, impact assessment, mitigation actions, and prevention recommendations.
Work closely with Engineering on new versions, features, and network capabilities, including design feedback, production readiness, rollout validation, and post-release monitoring.
Participate in CHNG management processes, service-component deployments, release pipelines, and production rollouts.
Create runbooks, workflows, and investigation procedures to improve troubleshooting efficiency across NOC team.
Identify recurring issues and operational gaps to drive improvements in network stability, monitoring, alerting, and automation.
Minimum of 3 years as Network engineer / Production engineer / T3 support engineer or similar role.
Strong understanding of network protocols (i.e: BGP, OSPF, DNS, TCP/IP).
Proficiency in tools like Grafana, Sensu, Zabbix, or similar platforms.
Advanced troubleshooting and problem-solving skills to diagnose and resolve network issues.
Ability to lead efforts during service incidents.
Experience with scripting languages (Python, Bash) to automate tasks and streamline workflows (Advantage).
Networking related certificates are an advantage.
תחומי אחריות
Implement and improve network monitoring and alerting systems to proactively detect service-impacting issues.
Lead escalations of critical network and service incidents, coordinating with Engineering, Support, Security, and Operations teams.
Investigate and resolve complex network issues beyond standard NOC and Support tiers.
Contribute to RCAs by providing technical analysis, impact assessment, mitigation actions, and prevention recommendations.
Work closely with Engineering on new versions, features, and network capabilities, including design feedback, production readiness, rollout validation, and post-release monitoring.
Participate in CHNG management processes, service-component deployments, release pipelines, and production rollouts.
Create runbooks, workflows, and investigation procedures to improve troubleshooting efficiency across NOC team.
Identify recurring issues and operational gaps to drive improvements in network stability, monitoring, alerting, and automation.
Required Skills & Experience: Minimum of 3 years as Network engineer / Production engineer / T3 support engineer or similar role. Strong understanding of network protocols (i.e: BGP, OSPF, DNS, TCP/IP). Proficiency in tools like Grafana, Sensu, Zabbix, or similar platforms. Advanced troubleshooting and problem-solving skills to diagnose and resolve network issues. Ability to lead efforts during service incidents. Experience with scripting languages (Python, Bash) to automate tasks and streamline workflows (Advantage). Networking related certificates are an advantage.
דרישות
Implement and improve network monitoring and alerting systems to proactively detect service-impacting issues.
Lead escalations of critical network and service incidents, coordinating with Engineering, Support, Security, and Operations teams.
Investigate and resolve complex network issues beyond standard NOC and Support tiers.
Contribute to RCAs by providing technical analysis, impact assessment, mitigation actions, and prevention recommendations.
Work closely with Engineering on new versions, features, and network capabilities, including design feedback, production readiness, rollout validation, and post-release monitoring.
Participate in CHNG management processes, service-component deployments, release pipelines, and production rollouts.
Create runbooks, workflows, and investigation procedures to improve troubleshooting efficiency across NOC team.
Identify recurring issues and operational gaps to drive improvements in network stability, monitoring, alerting, and automation.
Minimum of 3 years as Network engineer / Production engineer / T3 support engineer or similar role.
Strong understanding of network protocols (i.e: BGP, OSPF, DNS, TCP/IP).
Proficiency in tools like Grafana, Sensu, Zabbix, or similar platforms.
Advanced troubleshooting and problem-solving skills to diagnose and resolve network issues.
Ability to lead efforts during service incidents.
Experience with scripting languages (Python, Bash) to automate tasks and streamline workflows (Advantage).
Networking related certificates are an advantage.