Software Engineer (advanced) 1254

South Africa, South Africa

Job Description

SUMMARY:
Join our elite team of Site Reliability Engineers and take ownership of large-scale, fault-tolerant systems that never sleep. This role combines software engineering excellence with operational mastery, where your expertise directly impacts global business continuity and system resilience.
POSITION INFO:
Key Requirements:

  • Excellent experience PIC processes (Problem, Incident and Change management)
  • Excellent experience in operation of production-critical software
  • Excellent experience in cooperation with internal and external teams (including Provider management experience)
  • Infrastructure Management (Cloud and on-Prem)
  • Experience with Unix/Linux/Windows operating systems internals and administration or in-depth knowledge of the Unix networking stack
  • Experience in BASH, Python or PowerShell scripting
  • Experience with containerized Middleware (Docker, Docker Swarm, Kubernetes)
  • Solid understanding of monitoring and alerting practices (tools e.g Grafana, Prometheus, Elasticsearch) be able to develop new application metrics.
  • Experience with public cloud and hosting (AWS, Azure.)
  • Solid understanding of infrastructure as code principles and practical experience with Terraform or similar tools.
  • Agile methodology
  • Excellent technical understanding of IT systems
  • In-depth network know-how: Subnetting, Routing, Firewalling, DNS, (reverse)-proxies & understanding of OSI layers
  • Experience with Unix/Linux operating systems internals and administration or in-depth knowledge of the Unix networking stack.
  • Experience in developing software with C#
Key Responsibilities:
  • You will be a team member of a larger product team that focusses on the development, operate and support of a several mission-critical components together with external partners. You will be working on large-scale fault-tolerant systems and thrive to always improve the resiliency of our systems.
  • You will work closely with the product owner and are responsible for the planning and co-ordination of all Run activities.
  • You will strive to ensure continuous improvement for mission-critical systems.
  • As a Site Reliability Engineer with deep understanding of the underlaying systems, you take part in 24/7 on-call rotations with teams around the world and can restore systems in efficient manner.
For application! Apply here -

Beware of fraud agents! do not pay money to get a job

MNCJobs.co.za will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD1487715
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    South Africa, South Africa
  • Education
    Not mentioned