Job Description
The company is seeking a Linux and DevOps Engineer to design and deploy a large-scale solution for systematic trading teams. The candidate will be responsible for ensuring a consistent environment across trading team environments, building and maintaining external package management tools, and designing a framework for production deployment. The role involves production monitoring and alerting, QA deployment, and implementing continuous integration frameworks.
Responsibilities include:
- Designing and maintaining a consistent environment across trading team environments.
- Building and maintaining external package management and tools.
- Designing and maintaining a framework for production deployment.
- Implementing production monitoring and alerting.
- Designing and implementing continuous integration frameworks.
- Managing backups, data archiving, and data organization.
- Database administration and configuration.
- Maintaining and updating the platform for stability and security.
- Designing and implementing cluster/cloud computing infrastructure.
- Troubleshooting and resolving system-related issues.
- Evaluating new compute and GPU hardware platforms and management.
Requirements:
- Strong working knowledge with Linux operating systems (RHEL8, RHEL9).
- Experience troubleshooting complex OS and kernel-related issues.
- Knowledge of Linux OS Builds and Patch management.
- Good understanding of LDAP/AD auth using Kerberos.
- Experience with Package Management (e.g., DNF, Yum, Satellite, Foreman).
- System performance tuning experience.
- Experience with Storage and protocols (e.g., NFS, CIFS, SMB, S3, XFS, ZFS, LVM, RAID).
- Experience with Configuration management (e.g., Ansible, Puppet/Chef).
- Experience with Monitoring, logging, reporting tools (e.g., Grafana, Datadog, ELK, MongoDB, Redis).
- Experience with Setup and management of Compute Grid (HTCondor, Rafay, Slurm).
- 3+ years of experience with Python and Bash scripting.
- Experience with Build and release automation.
- Experience with Database admin, configuration, and maintenance.
- Experience with Cloud compute deployment (e.g., AWS, Google).
- Experience with PaaS and containerization (Kubernetes, Docker, OCP).
- Experience with Automation and configuration management using Puppet, Chef, or Ansible.
- Ability to resolve problems effectively.
- Excellent communication skills.
The role offers:
- Opportunity to work with cutting-edge technologies.
- Chance to contribute to a large-scale, critical trading platform.
- Exposure to various aspects of DevOps and system administration.