Site Reliability Engineer Merchant Division

Johannesburg, GP, ZA, South Africa

Job Description

Purpose of the position:




The primary focus areas of the Site Reliability Engineer (SRE) will be system performance, availability, latency, efficiency, monitoring, capacity planning and emergency response.


The SRE will form a bridge between the software development teams building features and capabilities and our traditional IT operations teams. They will apply a software engineering mindset to system administration topics. They will brief nonfunctional requirements into the development teams, take the lead on the operationalisation of systems and take a hands-on role in Cloud management.




They will identify and automate inefficient and time-consuming manual tasks. They will train and equip the IT operations teams on Cloud and SRE methods and practices.


The SRE will form part of the Technology Leadership Team and contribute to leading the approach and implementation of the extensive re-platforming initiative.

Duties and Responsibilities but not limited:



Communication:



Bad news travel fast. In instances like this, notify your direct report as quickly as possible. We need to get bad news out, before questions are being asked without having communicated anything Please respond to all communication directed to you in a timeously manner Respond to messages or questions, even if you are not done with a task, bad news is better than no news. Manage expectations with communication

Compliance checks:





Ensure that all assigned daily/weekly or monthly checks get completed with evidence to ensure we don't risk our compliance, this include but is not limited to network device config backups and Wi-Fi scans in the data centres, office and cloud environments.

Network and workload Management:



Implement proactive network maintenance, including updating licenses and firewall software to ensure system uptime. Ensure that all systems are updated regularly and secured to ensure PCI compliance. Proactively respond to network outages and inform vendors and third parties of any external issues that need to be resolved. Review and make recommendations on the NI for any initiatives relating to the network and cloud workloads. Make recommendations on all hardware and software for networking and on subsequent upgrades. Keep track of use of resources on network equipment in data centres and in the office Ensure process is followed to allow VPN access onto the systems Maximizing network performance by monitoring performance, troubleshooting network problems and outages, scheduling upgrades and managing network Configure and install software, routers and other network devices. Monitor network performance and integrity. Identify potential areas for automation, implement automated tasks and monitor their effectiveness. Optimise and automate network and workload deployments

General:



Work as a team and help where possible Take ownership of the area's that is assigned to you and ensure you manage it properly

Security:



Design specifications for establishing network and cloud security to protect vulnerable systems. Secure network systems and cloud workloads by establishing and enforcing policies and defining and monitoring access. Ensure timeous escalation of security and client issues. Notify the Security Manager of any suspected incidents in a timely manner and assist in the investigation of any incidents as required. Ensure adherence to company policies around network security Set authentication and authorisation to restrict users from accessing vital information as necessary. Ensure users get correct VPN profiles to access only the required system/device.

Infrastructure Management:



Manage all cloud environments in line with DevOps and SRE practices Manage all network WAN, LAN equipment, and network monitoring, including backups and patching. Manage and maintain network infrastructure, in the data centres, office and cloud. Contain and optimise cloud costs Upgrade data network equipment to the latest stable firmware releases Office Wi-Fi network to be maintained and ensure all end users are able to connect.

Maintaining Documentation:



Maintain the following documentation: Operational process documentation on all relevant systems All relevant documentation for annual PCI review Change request documentation (all system/network level change requests) Network procedures Network diagrams for all integrations Network diagrams for each data centre and cloud environments. Network diagram for the office Overview network diagram of connectivity between data centres, the office and 3rd party clients

CRM/Ticketing System:



Ensure all tasks are recorded in CRM/Ticketing System Complete tickets within SLA Log all changes and follow Change Control process

Skills and competencies:



Strong Communication Skills - Ability to clearly explain technical solutions to Collaborative Approach - Works well with internal teams to ensure smooth client Sales and Negotiation Ability - Identifies and capitalizes on cross-sell Attention to Detail - Ensures data accuracy and record Proactive Mindset - Willingness to conduct site visits and provide remote CRM Proficiency - Ability to manage and update client data on internal platforms effectively

Qualifications and Experience:



Diploma in IT in Software (NQF Level 6) AWS Solutions Architect qualified Experience in building and operating cloud environments 3 years in a Cloud Operations technical role AWS cloud practitioner Development/Programming

Reporting to:



DevOps Manager

Department:



Technology

Location:



* Gauteng

Beware of fraud agents! do not pay money to get a job

MNCJobs.co.za will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD1416987
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Johannesburg, GP, ZA, South Africa
  • Education
    Not mentioned