My client, a leading global trading firm, is seeking a talented Site Reliability Engineer (SRE) to join their growing and dynamic team. As an SRE, you will play a crucial role in ensuring the stability, scalability, and reliability of their trading systems and infrastructure. You will collaborate closely with cross-functional teams to drive continuous improvement, automate processes, and optimize our technology stack for high-performance trading operations.
- System Stability and Reliability: Monitor, analyze, and troubleshoot the performance of their trading systems, identifying and resolving issues to ensure maximum uptime and minimal disruption to trading activities. Implement proactive measures to prevent future incidents and improve system resilience.
- Scalability and Performance Optimization: Collaborate with software engineers and infrastructure teams to design, implement, and optimize highly scalable and resilient trading infrastructure. Conduct capacity planning, performance tuning, and load testing to ensure the system can handle high transaction volumes and market demands.
- Automation and Tooling: Develop and maintain automation tools and frameworks to streamline operational processes, including deployment, configuration management, monitoring, and incident response. Implement and enhance monitoring and alerting systems to proactively identify and address potential problems.
- Incident Management: Participate in incident response and post-incident analysis, contributing to the development of incident management processes and best practices. Work closely with development teams to ensure timely resolution of critical issues and drive improvements to prevent recurrence.
- Collaboration and Communication: Collaborate with cross-functional teams, including traders, developers, and infrastructure teams, to understand their requirements and provide technical guidance. Communicate effectively with stakeholders to provide updates, share insights, and drive alignment on system reliability improvements.
- Continuous Improvement: Identify areas for improvement in system architecture, processes, and tooling. Stay up-to-date with the latest industry trends, technologies, and best practices related to SRE and trading systems. Drive initiatives to enhance system performance, reliability, and efficiency.
What you offer:
- Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
- Proven experience as an SRE or in a similar role within the financial services industry, preferably within a global trading firm or high-frequency trading environment.
- Strong understanding of trading systems and financial markets, including order management, market data, electronic trading protocols, and low-latency technologies.
- Proficiency in programming/scripting languages such as Python
- Experience with infrastructure automation tools (e.g., Ansible, Chef, Puppet) and containerization technologies (e.g., Docker, Kubernetes) is a plus.
- Solid knowledge of Linux/Unix systems, networking fundamentals, and cloud platforms (e.g., AWS, Azure, GCP).
- Experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK stack) and incident management processes.
- Strong problem-solving and analytical skills, with the ability to identify and resolve complex technical issues in a fast-paced trading environment.
- Excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams and stakeholders.
- Self-motivated and detail-oriented, with the ability to work independently and manage multiple priorities in a dynamic and high-pressure environment.
- Opportunity to work for a trading firm that offers global visibility and internal mobility
- Competitive base salary along with an average of 6-12 months of bonus
- Excellent benefits (annual leaves, medical etc)
- Hands on exposure with latest technologies on the market
If this opportunity interests you, please send your CV (word format) to firstname.lastname@example.org