HiTakeJobHiTakeJob

Senior Backend Engineer - AI Framework (Python) - Gong.io

  • חברה: Gong.io
  • מיקום: Tel Aviv District, Israel
  • טכנולוגיות: Python, Java, LangChain

תיאור המשרה

Agentic Framework Architecture : Designing and building Gong’s internal agentic framework, leveraging and integrating industry-standard tools such as LangChain, LangSmith, ADK, and similar ecosystems. Evaluation and Quality Systems : Building evaluation frameworks and workflows for AI agents, including offline and online evaluations, quality metrics, regression detection, and experimentation infrastructure. Observability, Monitoring, and Guardrails : Providing the organization with robust observability capabilities for AI agents, including tracing, logging, monitoring, cost tracking, and safety guardrails to ensure reliable and responsible usage. Developer Enablement Platforms : Creating APIs, SDKs, and abstractions that enable product teams to easily build, test, and operate agents while adhering to platform standards. Cross-Language Integrations : Designing integrations and tooling across Python and Java to enable seamless adoption of the AI framework within Gong’s broader backend ecosystem. Agent Runtime and Execution Engine: Building the runtime responsible for orchestrating agent execution, managing tool calls, maintaining state and memory, and ensuring reliable execution across distributed systems. Agent Lifecycle and Orchestration Complexity : Managing agent execution, tool usage, memory, workflows, and failure modes in production-grade systems. AI System Reliability at Scale : Ensuring agents remain observable, debuggable, and safe as usage scales across teams and products. Evaluation and Drift Challenges : Detecting quality regressions, model behavior changes, and unintended agent behaviors through robust evaluation and monitoring systems. Platform Adoption Friction : Balancing flexibility with guardrails so teams can innovate quickly without compromising reliability, security, or cost controls. Company-Wide AI Enablement : Empowering every engineering team at Gong to build agent-based solutions faster, with higher quality and confidence. Foundational AI Infrastructure : Establishing the core frameworks, evaluations, and observability standards that all AI agents at Gong will rely on. AI Safety and Quality Bar : Raising the bar for how AI systems are evaluated, monitored, and governed across the company. 7+ years of backend engineering experience, with strong system design and platform-building expertise. Strong analytical and problem-solving skills, with the ability to debug and resolve complex technical issues efficiently. Hands-on experience designing and building agentic systems or agent frameworks in production, including orchestration, tool usage, memory management, and multi-step workflows. Experience extending or building frameworks on top of tools like LangChain, LangGraph, ADK, or similar agent orchestration frameworks. Experience designing, implementing, and operating production-grade AI agents , including handling failure modes, retries, observability, and real user traffic. Strong understanding of AI evaluation methodologies, including agent evaluations, prompt evaluation, regression testing, and quality monitoring. High proficiency in Python for building production-grade AI frameworks and services. Familiarity with Java and experience integrating backend platforms or tooling into Java-based systems. Experience building observability, monitoring, or platform tooling for distributed systems. Strong analytical skills and the ability to reason about complex, evolving AI-driven systems. Experience with cloud platforms and scalable microservices architectures. Excellent communication skills and a strong platform mindset, with experience enabling multiple teams.

תחומי אחריות

Agentic Framework Architecture : Designing and building Gong’s internal agentic framework, leveraging and integrating industry-standard tools such as LangChain, LangSmith, ADK, and similar ecosystems. Evaluation and Quality Systems : Building evaluation frameworks and workflows for AI agents, including offline and online evaluations, quality metrics, regression detection, and experimentation infrastructure. Observability, Monitoring, and Guardrails : Providing the organization with robust observability capabilities for AI agents, including tracing, logging, monitoring, cost tracking, and safety guardrails to ensure reliable and responsible usage. Developer Enablement Platforms : Creating APIs, SDKs, and abstractions that enable product teams to easily build, test, and operate agents while adhering to platform standards. Cross-Language Integrations : Designing integrations and tooling across Python and Java to enable seamless adoption of the AI framework within Gong’s broader backend ecosystem. Agent Runtime and Execution Engine: Building the runtime responsible for orchestrating agent execution, managing tool calls, maintaining state and memory, and ensuring reliable execution across distributed systems. Agent Lifecycle and Orchestration Complexity : Managing agent execution, tool usage, memory, workflows, and failure modes in production-grade systems. AI System Reliability at Scale : Ensuring agents remain observable, debuggable, and safe as usage scales across teams and products. Evaluation and Drift Challenges : Detecting quality regressions, model behavior changes, and unintended agent behaviors through robust evaluation and monitoring systems. Platform Adoption Friction : Balancing flexibility with guardrails so teams can innovate quickly without compromising reliability, security, or cost controls. Company-Wide AI Enablement : Empowering every engineering team at Gong to build agent-based solutions faster, with higher quality and confidence. Foundational AI Infrastructure : Establishing the core frameworks, evaluations, and observability standards that all AI agents at Gong will rely on. AI Safety and Quality Bar : Raising the bar for how AI systems are evaluated, monitored, and governed across the company. 7+ years of backend engineering experience, with strong system design and platform-building expertise. Strong analytical and problem-solving skills, with the ability to debug and resolve complex technical issues efficiently. Hands-on experience designing and building agentic systems or agent frameworks in production, including orchestration, tool usage, memory management, and multi-step workflows. Experience extending or building frameworks on top of tools like LangChain, LangGraph, ADK, or similar agent orchestration frameworks. Experience designing, implementing, and operating production-grade AI agents , including handling failure modes, retries, observability, and real user traffic. Strong understanding of AI evaluation methodologies, including agent evaluations, prompt evaluation, regression testing, and quality monitoring. High proficiency in Python for building production-grade AI frameworks and services. Familiarity with Java and experience integrating backend platforms or tooling into Java-based systems. Experience building observability, monitoring, or platform tooling for distributed systems. Strong analytical skills and the ability to reason about complex, evolving AI-driven systems. Experience with cloud platforms and scalable microservices architectures. Excellent communication skills and a strong platform mindset, with experience enabling multiple teams.

דרישות

Agentic Framework Architecture : Designing and building Gong’s internal agentic framework, leveraging and integrating industry-standard tools such as LangChain, LangSmith, ADK, and similar ecosystems. Evaluation and Quality Systems : Building evaluation frameworks and workflows for AI agents, including offline and online evaluations, quality metrics, regression detection, and experimentation infrastructure. Observability, Monitoring, and Guardrails : Providing the organization with robust observability capabilities for AI agents, including tracing, logging, monitoring, cost tracking, and safety guardrails to ensure reliable and responsible usage. Developer Enablement Platforms : Creating APIs, SDKs, and abstractions that enable product teams to easily build, test, and operate agents while adhering to platform standards. Cross-Language Integrations : Designing integrations and tooling across Python and Java to enable seamless adoption of the AI framework within Gong’s broader backend ecosystem. Agent Runtime and Execution Engine: Building the runtime responsible for orchestrating agent execution, managing tool calls, maintaining state and memory, and ensuring reliable execution across distributed systems. Agent Lifecycle and Orchestration Complexity : Managing agent execution, tool usage, memory, workflows, and failure modes in production-grade systems. AI System Reliability at Scale : Ensuring agents remain observable, debuggable, and safe as usage scales across teams and products. Evaluation and Drift Challenges : Detecting quality regressions, model behavior changes, and unintended agent behaviors through robust evaluation and monitoring systems. Platform Adoption Friction : Balancing flexibility with guardrails so teams can innovate quickly without compromising reliability, security, or cost controls. Company-Wide AI Enablement : Empowering every engineering team at Gong to build agent-based solutions faster, with higher quality and confidence. Foundational AI Infrastructure : Establishing the core frameworks, evaluations, and observability standards that all AI agents at Gong will rely on. AI Safety and Quality Bar : Raising the bar for how AI systems are evaluated, monitored, and governed across the company. 7+ years of backend engineering experience, with strong system design and platform-building expertise. Strong analytical and problem-solving skills, with the ability to debug and resolve complex technical issues efficiently. Hands-on experience designing and building agentic systems or agent frameworks in production, including orchestration, tool usage, memory management, and multi-step workflows. Experience extending or building frameworks on top of tools like LangChain, LangGraph, ADK, or similar agent orchestration frameworks. Experience designing, implementing, and operating production-grade AI agents , including handling failure modes, retries, observability, and real user traffic. Strong understanding of AI evaluation methodologies, including agent evaluations, prompt evaluation, regression testing, and quality monitoring. High proficiency in Python for building production-grade AI frameworks and services. Familiarity with Java and experience integrating backend platforms or tooling into Java-based systems. Experience building observability, monitoring, or platform tooling for distributed systems. Strong analytical skills and the ability to reason about complex, evolving AI-driven systems. Experience with cloud platforms and scalable microservices architectures. Excellent communication skills and a strong platform mindset, with experience enabling multiple teams.