SRE - Site Reliability Engineer
Senior Site Reliability Engineer (Observability)Location: London/UK (Remote)Contract: 12 Months InitialDay rate : £55 Per Hour - £62 Per Hour Inside IR35Job OverviewWe are looking for a Senior Site Reliability Engineer with strong experience in Observability, Monitoring and Distributed Systems to support large-scale cloud infrastructure supporting millions of devices globally. The role focuses on building and scaling monitoring, logging and alerting platforms to ensure high availability and performance of cloud services.ResponsibilitiesDesign, deploy and scale observability platformsManage and scale Prometheus monitoring systemsDeploy and maintain large Elasticsearch clustersBuild and maintain data pipelines using KafkaDevelop alerting and monitoring frameworksAutomate infrastructure using Terraform and AnsibleDevelop tools and scripts using Python, Go, Ruby or BashWork with Linux systems (Debian/Ubuntu)Participate in on-call rotationImprove system reliability, performance and scalabilityRequired Skills5+ years experience in Site Reliability Engineering / DevOpsStrong Linux systems experienceObservability and Monitoring tools experiencePrometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana)KafkaTerraform / Infrastructure as CodeAnsible / Configuration ManagementProgramming experience (Python, Go, Ruby or Bash)Distributed systems and cloud infrastructure experienceThis is an urgent vacancy where the hiring manager is shortlisting for an interview immediately. Please ..... full job details .....
Other jobs of interest...
Perform a fresh search...
-
Create your ideal job search criteria by
completing our quick and simple form and
receive daily job alerts tailored to you!