Site Reliability Engineer - Algorithmic Trading - Dr. Weng
- חברה: Dr. Weng
- מיקום: תל אביב - יפו
- טכנולוגיות: Bash, Python
תיאור המשרה
Work with software engineers and traders to provide 24/7 first line support of our trading, testing, and research environment.
Proactively monitor and identify issues before they impact trading. Define and automate observability around application SLAs and build anomaly detectors that surface problems early.
Oversee and maintain our test trading environment. Keep the environment running smoothly and pursue automation of routine interventions required to maintain a stable test platform for development.
Standardize CI/CD and operational processes across development teams to reduce friction and speed up delivery. Improve deployment scripts and release verification processes.
Perform stress tests and verify SLAs under load. Work with application and devops engineers to design and improve support for this type of testing.
Work closely with traders, engineers, system administrators, data centers, networking, and other shared services.
Bachelor's degree in computer science or equivalent practical background.
3+ years of hands-on software development or SRE Experience
Demonstrated experience with observability tools - logging, metrics and tracing
Hands on development experience in scripting languages like Bash, Python, or Ruby
An automation-first approach to problem solving - you understand that manual processes don’t scale, and your instinct is to eliminate them before they become a bottleneck.
Demonstrated knowledge of network communications including comprehensive understanding of the Linux TCP-IP stack, use of multicast networking, and network protocol interactions.
Solid diagnostic capabilities from the application layer through the network and low-level hardware.
Experience with network capture and time synchronization.
High level of ownership and accountability, reliability, and strong follow through.
Positive AI mentality and experience with AI agents to accelerate SRE tasks.
תחומי אחריות
Work with software engineers and traders to provide 24/7 first line support of our trading, testing, and research environment.
Proactively monitor and identify issues before they impact trading. Define and automate observability around application SLAs and build anomaly detectors that surface problems early.
Oversee and maintain our test trading environment. Keep the environment running smoothly and pursue automation of routine interventions required to maintain a stable test platform for development.
Standardize CI/CD and operational processes across development teams to reduce friction and speed up delivery. Improve deployment scripts and release verification processes.
Perform stress tests and verify SLAs under load. Work with application and devops engineers to design and improve support for this type of testing.
Work closely with traders, engineers, system administrators, data centers, networking, and other shared services.
Bachelor's degree in computer science or equivalent practical background.
3+ years of hands-on software development or SRE Experience
Demonstrated experience with observability tools - logging, metrics and tracing
Hands on development experience in scripting languages like Bash, Python, or Ruby
An automation-first approach to problem solving - you understand that manual processes don’t scale, and your instinct is to eliminate them before they become a bottleneck.
Demonstrated knowledge of network communications including comprehensive understanding of the Linux TCP-IP stack, use of multicast networking, and network protocol interactions.
Solid diagnostic capabilities from the application layer through the network and low-level hardware.
Experience with network capture and time synchronization.
High level of ownership and accountability, reliability, and strong follow through.
Positive AI mentality and experience with AI agents to accelerate SRE tasks.
דרישות
Work with software engineers and traders to provide 24/7 first line support of our trading, testing, and research environment.
Proactively monitor and identify issues before they impact trading. Define and automate observability around application SLAs and build anomaly detectors that surface problems early.
Oversee and maintain our test trading environment. Keep the environment running smoothly and pursue automation of routine interventions required to maintain a stable test platform for development.
Standardize CI/CD and operational processes across development teams to reduce friction and speed up delivery. Improve deployment scripts and release verification processes.
Perform stress tests and verify SLAs under load. Work with application and devops engineers to design and improve support for this type of testing.
Work closely with traders, engineers, system administrators, data centers, networking, and other shared services.
Bachelor's degree in computer science or equivalent practical background.
3+ years of hands-on software development or SRE Experience
Demonstrated experience with observability tools - logging, metrics and tracing
Hands on development experience in scripting languages like Bash, Python, or Ruby
An automation-first approach to problem solving - you understand that manual processes don’t scale, and your instinct is to eliminate them before they become a bottleneck.
Demonstrated knowledge of network communications including comprehensive understanding of the Linux TCP-IP stack, use of multicast networking, and network protocol interactions.
Solid diagnostic capabilities from the application layer through the network and low-level hardware.
Experience with network capture and time synchronization.
High level of ownership and accountability, reliability, and strong follow through.
Positive AI mentality and experience with AI agents to accelerate SRE tasks.