Job Summary:
We are seeking an experienced and certified Splunk Engineer to join our IT operations and observability team. The ideal candidate will have deep expertise in Splunk Enterprise, Splunk Observability Suite, IT Service Intelligence (ITSI), Application Performance Monitoring (APM), Real User Monitoring (RUM), and Synthetic Monitoring. This role will play a key part in building and optimizing our monitoring, alerting, and visualization strategies across applications, infrastructure, and services.
Key Responsibilities:
- Design, implement, and maintain Splunk-based monitoring and observability solutions across the enterprise.
- Configure and optimize Splunk Enterprise, ITSI, APM, RUM, and Synthetic Monitoring to ensure accurate and actionable visibility into applications and infrastructure.
- Develop and maintain custom dashboards, alerts, reports, and service health scores in ITSI for stakeholders including DevOps, SREs, and business units.
- Integrate logs, metrics, traces, and real user data from a variety of platforms including cloud, on-prem, and hybrid environments.
- Assist in the onboarding of data sources and develop efficient indexing and data retention strategies.
- Collaborate with application, network, and infrastructure teams to define monitoring requirements and improve system performance and reliability.
- Proactively identify system anomalies and performance bottlenecks using APM, RUM, and synthetic tests.
- Develop automation scripts for alerting and response using Splunk SOAR or other automation tools (if applicable).
- Stay up to date with the latest Splunk features and best practices and mentor junior team members.
- Support troubleshooting, RCA, and incident response efforts using Splunk-based insights.
Required Qualifications:
- 3+ years of hands-on experience with Splunk Enterprise architecture, configuration, and administration.
- 2+ years of experience in Splunk ITSI, including KPI creation, service design, and correlation searches.
- Proven experience in Splunk Observability, including:
- Splunk APM (Application Performance Monitoring)
- Real User Monitoring (RUM)
- Synthetic Monitoring
- Strong understanding of monitoring best practices, SRE principles, and DevOps workflows.
- Experience with distributed systems, microservices, and monitoring in cloud environments (AWS, Azure, GCP).
- Proficient in search processing language (SPL) and dashboard development.
- Familiarity with data onboarding techniques (via UF, HF, or APIs).
- Excellent analytical and problem-solving skills.
Certifications (Required):
- Splunk Certified Enterprise Admin
- Splunk Certified Observability Cloud Engineer
- Splunk Certified ITSI Analyst or Admin