1. 24/7 Production Database Operations
- Provide L2/L3 support for MSSQL and MongoDB in Production and DR environments.
- Participate in standby/on-call roster for critical incidents.
- Troubleshoot and resolve:
- Database outages
- Always On failover issues
- Blocking / deadlocks
- Performance bottlenecks
- Storage / IOPS constraints
- Perform root cause analysis
- (RCA) and impleme nt preventive measures.
2. High Availability & Reliability Management
Microsoft SQL Server (MSSQL)
- Manage Always On Availability Groups.
- Monitor replication and failover health.
- Perform database patching and cumulative updates.
- Maintain backup and restore strategy (Full, Differential, Log backups).
- Conduct periodic DR drills and restoration validation.
- Optimize indexing strategies and execution plans.
MongoDB
- Manage Replica Sets and automatic failover.
- Monitor memory, disk, and index performance.
- Manage backup/restore and periodic recovery validation.
- Maintain cluster health and shading (if applicable).
3. Performance & Capacity Management
- Conduct proactive health checks.
- Query execution plans (MSSQL)
- Slow queries and indexing (MongoDB)
- Perform database tuning and optimization.
- Forecast capacity growth and scaling requirements.
4. Change & Release Support
Review database scripts prior to production deployment.
Support data migration and release windows.
Validate database stability post-deployment.
Ensure proper rollback and recovery procedures are in place.
5. Security & Compliance (Government Environment)
- Implement database security hardening.
- Role-based access control
- Auditing and logging
- Support vulnerability assessment (VA) remediation.
- Ensure compliance with government security policies.
- Maintain proper documentation and SOPs for audit readiness.
Technical Requirements
- 5-8+ years of DBA experience.
- Strong hands-on experience in Microsoft SQL Server (Always On, HA, Performance Tuning), MongoDB (Replica Sets, Backup/Restore)
- Experience supporting 24/7 mission-critical systems.
- Experience handling Severity 1 production incidents.
- Experience in supporting application deployment (data patching, data seed etc.)
- Strong troubleshooting and analytical skills.