Senior Software Engineer — The Rocks, Sydney
Expired

This is a Senior Software Engineer role with one of the leading companies in AU right now SafetyCulture with an amazing team. They are continuing to grow rapidly. This is the chance to join right as the takes off. More About the Role at SafetyCulture The Role As a Software Engineer in the Site Reliability Engineering team at SafetyCulture you’ll help to design, build and run resilient systems. You live and die by Murphy’s Law, knowing that anything that can go wrong will go wrong at the worst possible moment. You will help to foster a culture of designing for, and expecting failure in production systems - a culture where learning and knowledge-sharing is expected. You love to solve sticky cross-service and cross-domain problems, and have a passion to identify root causes in complex scenarios. You understand how important it is for the teams to analyse past incidents and learn from them. Most importantly you are a team-player, are excited about the prospect of working in a fast-paced demanding environment and get that learning happens at the edge of the comfort zone. How you can have an impact As one of a core team of experienced SREs, you will shape and mature the culture, define the processes that the development teams will follow, and allow the business to scale to millions of users. You will be a key driver of our observability culture, enabling teams to diagnose cross domain issues and building a unified experience of metrics, logs and traces. You’ll coach and educate your engineering colleagues on systems reliability and fault-tolerance best practice, identify gaps in existing systems and come up with remediation plans. You’ll improve metrics such as MTTR and MTBF, and promote a culture of sustainable incident response and blameless post-mortem. We encourage involvement in the community, open source work, attending talks and events, and experimenting with new technologies. How you will spend your time: - Engaging with teams across Engineering on reliability and performance issues - Building out core capabilities such as load testing, observability improvements and advanced deployment mechanisms - Write and maintain Go modules providing fundamental capabilities to our applications (e.g observability instrumentation) - Evolving our Incident Management processes and engaging in post incident reviews, driving our learning culture - Educating and driving the SRE mandate across the organisation What you'll need - Fluency in at least one modern programming language - A solid education in SRE concepts like SLOs - Experience with distributed systems and Unix/Linux systems internals (e.g. filesystems, inodes, system calls) or networking (e.g. TCP/IP, routing, network topologies and hardware, SDN). - A good understanding of monitoring, logging, tracing, and observability instrumentation - Excellent human-handling-skills with an ability to build and maintain healthy cross-team relationships - You balance your love of systems-engineering with a product-mindset and build empathy with your customers and your product-engineering colleagues If you don’t think you're a perfect fit, you should still sign up to Hatch and create a profile, we'll match you to other roles that suit your profile. Hatch exists to level the playing field for people as they discover a career that’s right for them. We model this in our hiring process for our partners like SafetyCulture. ✅ Applying here is the first step in the hiring process for this role at SafetyCulture. We do not discriminate on the basis of gender identity, sexual orientation, cultural identity, disability, age, or any other non-merit factors. To put it simply, Hatch is for everyone.

Applications close Sunday, 24 November 2024
Take me to the job
Find more jobs nearby: Sydney, Woolloomooloo, Barangaroo, The Rocks, Haymarket.