You will take responsibility for managing our master data set, developing reports, and troubleshooting data issues. To do well in this role you need a very fine eye for detail, experience as a data analyst, and deep understanding of the popular data analysis tools and databases:
Full job description available for short-listed candidates.
Requirements
Experience/Skills:
5+ years of experience in a Data Engineer role
Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.
Experience building and optimizing xe2x80x98big dataxe2x80x99 data pipelines, architectures, and data sets.
Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
Strong analytic skills related to working with unstructured datasets.
Build processes supporting data transformation, data structures, metadata, dependency, and workload management.
A successful history of manipulating, processing, and extracting value from large, disconnected datasets.
Working knowledge of message queuing, stream processing, and highly scalable xe2x80x98big dataxe2x80x99 data stores.
Strong project management and organizational skills.
Ability to work with stakeholders to assess potential risks.
Ability to analyze existing tools and databases and provide software solution recommendations.
Ability to translate business requirements into non-technical, lay terms.
High-level experience in methodologies and processes for managing large-scale databases.
Demonstrated experience in handling large data sets and relational databases.
Understanding of addressing and metadata standards.
High-level written and verbal communication skills.
Experience supporting and working with cross-functional teams in a dynamic environment.
Experience using the following software/tools:
Experience with relational SQL and NoSQL databases, including Postgres and Cassandra.
Experience with data pipeline and workflow management tools: Airflow, GCP Dataflow etc.
Experience with AWS cloud services: EC2, EMR, RDS, Redshift
Experience with stream-processing systems: Storm, Spark-Streaming, etc.
Experience with object-oriented/object function scripting languages: Python, Java, C++, Scala, etc.
Education and Qualifications:
Matric essential
Degree in Computer Science, Statistics, Informatics, Information Systems, or another quantitative field.
Benefits
Medical Aid, Provident Fund, Group Life
MNCJobs.co.za will not be responsible for any payment made to a third-party. All Terms of Use are applicable.