As a Site Reliability Engineer at YouGov, you will join our talented individuals in being responsible for the delivery, optimization, resilience, and availability of high-value and high-transaction-rate services trusted and used by both the general public and some of the largest brands in the world. Site Reliability Engineering is a discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE’s ensure that YouGov’s internally critical and externally visible systems maintain the appropriate service levels (availability, latency, and reliability) to serve our customers’ needs, and reduce the friction for managing change, while being strategic about capacity, and constantly managing performance. SRE is a mindset and a set of engineering approaches focusing on delivery of the appropriate architecture, building infrastructure, optimizing existing systems, and eliminating toil through automation.
SREs have the acumen and experience to provide direct technical contributions to major projects both in code, and in building and optimizing the production environment. You will identify and solve critical problems and build automation to prevent their recurrence. You align with your peers across engineering, deliver subject matter expertise for the infrastructure within your product area, and draw on your strong communication skills to collaborate with your peers in other geographies. Your perspectives help foster and support successful delivery of reliability engineering, and you influence by way of metrics, data, and automation.
Any additional info:
This position is 100% remote, therefore having experience within a remote environment would be ideal.
Tagged as: kubernetes[...]