Summary-We are seeking an experienced and visionary Product Owner to lead the development of AIOps capabilities for our next-generation hybrid cloud platform. The ideal candidate will have a strong background in observability, intelligent automation, and large-scale cloud operations.
Responsibilities
- Own the product roadmap and delivery strategy for the AIOps domain, focusing on building intelligent and automated operations for complex cloud environments.
- Define and prioritize features across six core AIOps product modules:
- Monitoring & Alerting unified observability through metrics, logs, and traces.
- ITSM integration and orchestration of incident, change, and problem management workflows.
- CMDB automated discovery and dynamic topology mapping of infrastructure and services.
- Operations Copilot AI-powered agents to assist with diagnostics, RCA, and recommendations.
- Automation Platform closed-loop automation for remediation and routine tasks.
- Smart Dashboard real-time visual intelligence for operations center and executive reporting.
- Collaborate with engineering, design, DevOps, and architecture teams to ensure successful implementation.
- Translate customer pain points and operational needs into actionable user stories and functional requirements.
- Drive cross-team alignment and facilitate agile delivery processes (backlog grooming, sprint planning, etc.).
- Engage with customers and internal stakeholders to validate product value and prioritize enhancements.
- Monitor competitive AIOps solutions and emerging technologies to ensure long-term differentiation.
Qualifications & Experience
- 5+ years of experience as a Product Owner or Technical Product Manager in AIOps, observability, or enterprise cloud operations.
- Familiarity with tools and frameworks such as Prometheus, Grafana, OpenTelemetry, ServiceNow, or Runbook Automation platforms.
- Hands-on experience with automation frameworks (e.g., Ansible, Terraform, Rundeck) and AI Agent technologies (e.g., LangChain, Agent Frameworks, or custom LLM-based operational agents).
- Solid understanding of SRE/DevOps principles and ITIL processes.
- Proficiency with product management and collaboration tools such as Figma, Jira, Confluence, Notion, or similar platforms.
- Ability to define and evaluate intelligent workflows for diagnostics, remediation, and infrastructure optimization.
- Strong analytical and communication skills; able to translate technical complexity into business value.
- Experience working in fast-paced, global, and cross-functional environments.