AWS Site Reliability Engineer

scavern

Well-Known Member
Joined
Oct 2, 2004
Messages
210
Reaction score
10
Location
.
Job Description
Role Overview Help us build and operate secure, scalable, and resilient cloud platforms on AWS, ensuring 99.99% uptime for critical services. You'll automate infrastructure and mentor a growing SRE team.

Salary
Up to R100,000/month

##Non Negotiables
On-site 5 days in the Southern Suburbs.

What We're Looking For - 5+ years of deep AWS experience (EC2, EKS, Lambda, RDS, IAM) - Proven IaC skills with Terraform or AWS CDK - Expertise in observability tools (CloudWatch, Grafana, ELK) - Strong coding in Python or Go - Production experience with Kubernetes/EKS - AWS Professional certifications (preferred)
What You'll Do - Design and automate highly available AWS infrastructure using IaC - Reduce manual operations through automated workflows and CI/CD - Implement proactive alerting and define SLIs/SLOs - Mentor engineers on SRE best practices and AWS operations
Why You'll Love It - Solve complex reliability challenges at scale

Required Skills
AWS
Site Reliability Engineering
Python
Go
Kubernetes
EKS
Terraform
AWS CDK
IaC
CI/CD
CloudWatch
Grafana
ELK
Observability
SLO
SLI
 
I'm missing kubernetes, grafana, go. Why are we doing office work for cloud engineering jobs?
 


3a6d132bc9ce7398bbce0406369373ad9e59d5a3.jpg
 
Job Description
Role Overview Help us build and operate secure, scalable, and resilient cloud platforms on AWS, ensuring 99.99% uptime for critical services. You'll automate infrastructure and mentor a growing SRE team.

Salary
Up to R100,000/month

##Non Negotiables
On-site 5 days in the Southern Suburbs.

What We're Looking For - 5+ years of deep AWS experience (EC2, EKS, Lambda, RDS, IAM) - Proven IaC skills with Terraform or AWS CDK - Expertise in observability tools (CloudWatch, Grafana, ELK) - Strong coding in Python or Go - Production experience with Kubernetes/EKS - AWS Professional certifications (preferred)
What You'll Do - Design and automate highly available AWS infrastructure using IaC - Reduce manual operations through automated workflows and CI/CD - Implement proactive alerting and define SLIs/SLOs - Mentor engineers on SRE best practices and AWS operations
Why You'll Love It - Solve complex reliability challenges at scale

Required Skills
AWS
Site Reliability Engineering
Python
Go
Kubernetes
EKS
Terraform
AWS CDK
IaC
CI/CD
CloudWatch
Grafana
ELK
Observability
SLO
SLI


I'm interested. In addition to those though my techstack is:
Pickachu
Squirtle
OBE
Clairvoyance
Green/Gold

Would that be a good fit?
 
Top
Sign up to the MyBroadband newsletter
X