This is a Site Reliability Engineer role with one of the leading companies in AU right now IMC Trading with an amazing team. They are continuing to grow rapidly. This is the chance to join right as the takes off. More About the Role at IMC Trading The Platform Engineering organization at IMC Trading is focused on accelerating tech team workflows through providing self-service tools, services, documentation, and support. Platform Engineering is responsible for designing, building, and maintaining the underlying runtime platforms that IMC's software applications depend on. The mission is to streamline development processes, establish a consistent technical foundation across regions, and empower teams with the necessary resources to innovate efficiently. Platform Engineering is a global team and acts as a bridge between the technical requirements of application development and the practical aspects of deploying and maintaining those applications in a production environment, minimizing friction, and ensuring that tech teams can operate seamlessly and drive progress forward. The Platform Engineering team is seeking a versatile and passionate Site Reliability Engineer to play a critical role in enhancing and optimizing our developer services infrastructure. You will join a highly experienced team that supports a wide array of critical systems, including source control, continuous integration pipelines, and observability tools, all of which are vital to the stability and performance of our trading platforms. Your Core Responsibilities: - Optimize and enhance the reliability, scalability, and performance of our development services infrastructure. - Administer and oversee source control systems, continuous integration services, artifact repositories, metrics, logging, observability tools, and related systems, both in on-premises environments and leveraging AWS cloud services. - Integrate and deploy new Cloud SaaS solutions, collaborating cross-functionally to deliver innovative tools and functionalities to our teams. - Partner with development teams to align system capabilities with their evolving needs, ensuring seamless operations and support, boosting organizational efficiency - Diagnose and resolve production incidents swiftly, supporting Global Operations in a 'follow the sun' model, working with regional peers to guarantee continuous 24/7 system availability. - Identify and eliminate system performance bottlenecks and drive uptime improvements through proactive automation, monitoring, and performance tuning. - Lead and mentor junior team members, fostering their growth and technical expertise. Your Skills and Experience: - 3 years of work experience in a relevant role - Bachelor's Degree in Computer Engineering, Computer Science or equivalent - Strong programming experience in Python or Go; Shell scripting is a plus. - Strong knowledge of Linux/Unix Systems - Experience in operating and enhancing services across various environments, designing and implementing disaster recovery plans, high availability (HA) configurations, and failover strategies. - Hands-on experience with containerization and orchestration Tools, including Kubernetes and Docker. - Experience with CI/CD tools such as Jenkins, GitLab CI, or TeamCity, including the automation of build, test, and deployment processes. - Understanding of security principles and best practices, including experience managing secrets and compliance frameworks If you don’t think you're a perfect fit, you should still sign up to Hatch and create a profile, we'll match you to other roles that suit your profile. Hatch exists to level the playing field for people as they discover a career that’s right for them. We model this in our hiring process for our partners like IMC Trading. ✅ Applying here is the first step in the hiring process for this role at IMC Trading. We do not discriminate on the basis of gender identity, sexual orientation, cultural identity, disability, age, or any other non-merit factors. To put it simply, Hatch is for everyone.