Site Reliability Engineer — The Rocks, Sydney
Expired

The Platform Engineering organization at IMC Trading is focused on accelerating tech team workflows through providing self-service tools, services, documentation, and support. Platform Engineering is responsible for designing, building, and maintaining the underlying runtime platforms that IMC's software applications depend on. The mission is to streamline development processes, establish a consistent technical foundation across regions, and empower teams with the necessary resources to innovate efficiently. Platform Engineering is a global team and acts as a bridge between the technical requirements of application development and the practical aspects of deploying and maintaining those applications in a production environment, minimizing friction, and ensuring that tech teams can operate seamlessly and drive progress forward. The Platform Engineering team is seeking a versatile and passionate Site Reliability Engineer to play a critical role in enhancing and optimizing our developer services infrastructure. You will join a highly experienced team that supports a wide array of critical systems, including source control, continuous integration pipelines, and observability tools, all of which are vital to the stability and performance of our trading platforms. Your Core Responsibilities:Optimize and enhance the reliability, scalability, and performance of our development services infrastructure.Administer and oversee source control systems, continuous integration services, artifact repositories, metrics, logging, observability tools, and related systems, both in on-premises environments and leveraging AWS cloud services.Integrate and deploy new Cloud SaaS solutions, collaborating cross-functionally to deliver innovative tools and functionalities to our teams.Partner with development teams to align system capabilities with their evolving needs, ensuring seamless operations and support, boosting organizational efficiencyDiagnose and resolve production incidents swiftly, supporting Global Operations in a 'follow the sun' model, working with regional peers to guarantee continuous 24/7 system availability.Identify and eliminate system performance bottlenecks and drive uptime improvements through proactive automation, monitoring, and performance tuning.Lead and mentor junior team members, fostering their growth and technical expertise. Your Skills and Experience:3 years of work experience in a relevant roleBachelor's Degree in Computer Engineering, Computer Science or equivalentStrong programming experience in Python or Go; Shell scripting is a plus.Strong knowledge of Linux/Unix SystemsExperience in operating and enhancing services across various environments, designing and implementing disaster recovery plans, high availability (HA) configurations, and failover strategies.Hands-on experience with containerization and orchestration Tools, including Kubernetes and Docker.Experience with CI/CD tools such as Jenkins, GitLab CI, or TeamCity, including the automation of build, test, and deployment processes.Understanding of security principles and best practices, including experience managing secrets and compliance frameworks

Applications close Sunday, 9 February 2025
Take me to the job
Find more jobs nearby: Sydney, Woolloomooloo, Barangaroo, The Rocks, Haymarket.