Site Reliability Engineer (sre)

Rosebank, GP, ZA, South Africa

Job Description

We're a world-leading smart mobility SaaS tech company with over 2,000,000 active users. Our teams are collaborative, vibrant and fast-growing, and all team members are empowered with the freedom to influence our products and technology.

Are you curious, innovative and passionate? Do you take ownership, embrace challenges, and love problem-solving?

We're looking for a Site Reliability Engineer (SRE) who will enable us to build industry disruptive tech products and revolutionize the way our customers use technology.



The Site Reliability Engineer (SRE) will be responsible for ensuring the reliability, performance, and scalability of Cartrack' Linux-based systems and services. This role combines software engineering with operations, focusing on automation, monitoring, and incident response. The position requires working in shifts and rotations to support 24/7 operations.



You want to



Maintain and improve the reliability, scalability, and performance of Cartrack' infrastructure and applications. Implement automation for deployments, monitoring, and system management. Troubleshoot production issues, perform root cause analysis, and implement permanent fixes. Develop and manage monitoring, alerting, and incident response processes. Work with development teams to design resilient and scalable systems. Participate in on-call shifts and rotation schedules to manage incidents and ensure uptime. Optimize system efficiency and cost-effectiveness in an open-source environment.

You have



Strong background in Linux/Unix system administration (open-source stack). Familiarity with monitoring and logging tools (Prometheus, Grafana etc.). Knowledge of networking, load balancing, and system security best practices. Strong problem-solving and debugging skills in a production environment. Proven experience in automation and scripting (Python, Bash, Go, or similar). Ability to design and maintain automation frameworks for deployments, monitoring, and system recovery. Hands-on experience with CI/CD pipelines and configuration management tools (e.g., GitLab CI, Ansible, Puppet, Terraform). Experience building self-healing and auto-remediation solutions for production environments.

Nice to Have



Experience with containerization and orchestration (Docker, Kubernetes). Exposure to microservices and service mesh environments. Knowledge of database reliability and performance tuning (PostgreSQL).

Qualifications



Bachelor's degree in Computer Science, Information Systems, or equivalent practical experience. 3+ years of experience in SRE, DevOps, or related infrastructure/operations roles. Ability to work flexible hours, including shift rotations and on-call duties.
Job Type: Full-time

Ability to commute/relocate:

Rosebank, Gauteng: Reliably commute or planning to relocate before starting work (Preferred)
Experience:

Linux: 4 years (Preferred) SRE: 3 years (Preferred) Network monitoring: 3 years (Preferred)
Work Location: In person

Beware of fraud agents! do not pay money to get a job

MNCJobs.co.za will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD1531576
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Rosebank, GP, ZA, South Africa
  • Education
    Not mentioned