HiTakeJobHiTakeJob

Data Center Operations Engineer - Final

  • חברה: Final
  • מיקום: Ramat Hasharon
  • סוג עבודה: Hybrid
  • טכנולוגיות: Linux knowledge for troubleshooting server and hardware-related issues

תיאור המשרה

Final is looking for an experienced Data Center Operations Engineer to join our IT department. In this role, you will maintain and support a large-scale global server and GPU infrastructure, monitor and troubleshoot hardware failures, and collaborate with senior system specialists to resolve complex issues across production and research environments. 

We offer a dynamic, innovative production and research environment and expect a strong commitment to maintaining high service availability, performance, and reliability. 


Responsibilities 

  • Deploy (rack and cable) new servers, GPUs, and infrastructure hardware. 
  • Monitor and troubleshoot IT infrastructure hardware issues across all company data centers. 
  • Coordinate vendor engineers’ site visits. 
  • Perform hardware break/fix activities including replacement of drives, RAM, CPUs, GPUs, power supplies, and other server components. 
  • Maintain firmware versions up to date. 
  • Manage company data center inventory. 
  • Monitor and manage DC environment parameters including power usage, airflow, cooling, and rack capacity. 
  • Support and keep hardware in an  high-performance compute environments used for compute-intensive workloads and research activities.

דרישות

  • 2+ years of experience working in a production data center environment. 
  • Experience supporting GPU-based servers or high-performance computing (HPC) environments – advantage. 
  • Familiarity with GPU technologies, hardware architectures, and accelerated computing platforms – advantage. 
  • Willingness to travel abroad 4–5 times a year for maintenance work during weekends. 
  • Excellent analytical and problem-solving skills with strong attention to detail. 
  • Fluent written and spoken English at a professional level. 
  • Good Linux knowledge for troubleshooting server and hardware-related issues. 
  • Hands-on experience with IT infrastructure hardware support (e.g., servers, HDDs/SSDs, hardware replacement, etc.). 
  • Experience with DC monitoring and infrastructure management tools – advantage. 
  • Comfortable working in hands-on environments, including lifting and racking equipment.