
Search by job, company or skills
Job Details
About the Company
. Leading fintech technology company specializingin efx/ CDF and crypto trading solutions
. Mission-critical operations supporting 24x7global trading environments.
. Enterprise-scale distributed systems acrossmultiple international data centers.
. Innovation-focused organization withcutting-edge trading technology solutions.
. Global presence serving major financialinstitutions worldwide.
Position Overview
We are seeking a highly skilled Senior Site ReliabilityEngineer to join our expanding SRE team and ensure the reliability,performance, and availability of mission-critical trading applications. Thisrole focuses on maintaining 99.99% uptime for high-frequency trading systemsthat process millions of transactions daily.
Key Responsibilities
. Ensure high availability of trading applicationsthrough proactive monitoring and automation
. Develop and maintain real-time monitoring,alerting, and logging systems for early issue detection
. Automate critical operations includingdeployment, configuration, scaling, and disaster recovery
. Support 24x7 operations across multiple globaldata centers and trading sessions
. Collaborate with engineering teams to integratereliability best practices into development lifecycle
. Conduct root cause analysis and implementpreventive measures for recurring issues
. Participate in on-call rotations for criticalapplication issues and outages
. Maintain CI/CD pipelines ensuring fast andreliable application releases
. Enhance system security through SSL certificatemanagement, encryption, and authentication
. Drive continuous improvement by evaluating newtools and methodologies
Required Qualifications
. Bachelor's degree in Computer Science,Engineering, or equivalent experience
. 5+ years of SRE, DevOps, or similarreliability-focused experience
. Strong expertise in Linux and Windows systemadministration
. Proficiency in scripting languages (Python,Shell, Perl, JavaScript)
. Experience with container technologies (Docker,Kubernetes)
. CI/CD tools expertise (Jenkins, deploymentautomation frameworks)
. Monitoring and observability tools (Prometheus,Grafana, ELK Stack, New Relic, Datadog)
. Networking fundamentals (TCP/IP, DNS, loadbalancing, firewalls)
. Configuration management tools (Ansible, Salt,Puppet)
. Strong debugging skills across application,database, and infrastructure layers
. Experience working in high-pressure environmentswith multiple priorities
. Excellent communication and collaboration skills
Preferred Skills
. Financial services or trading industryexperience highly valued
. Cloud platform knowledge (AWS, GCP, Azure)
. Security best practices and compliance standards
. Incident management frameworks (ITIL, SREmethodologies)
. Experience with distributed computing andhigh-frequency systems
. Performance optimization for latency-sensitiveapplications
Job ID: 146551251