What will you need?
To be successful in this role you will need to have excellent analytic skills related to working with structured, semi structured and unstructured datasets, Expert SQL knowledge and experience with relational as well as no-SQL database systems.You must have expert level Programming skills in Python / PySpark, and it would be beneficial if you have also experience in Java / Scala programming.Furthermore, we would prefer you to have experience with Data Warehousing for large complex data sets – defining, building and optimizing data models based on use case requirements.
- University Degree in Computer Science, Information Systems, Statistics, or related field;
- Minimum 5 years of experience in the IT;
- Minimum 3 years of experience in the Data Engineer role;
- Expertise with Data Lake / Big Data Projects implementation in Cloud (preferably MS Azure) and / or On-premise platforms : o Cloud – Azure technology stack : ADLS Gen2, Databricks (proven experience is a big plus), EventHub, Stream Analytics, Synapse Analytics, AKS, Key Vault;o On Premise : Spark, HDFS, Hive, Hadoop distributions (Cloudera or MapR), Kafka, Airflow (or any other scheduler)
- Experience with designing and building lakehouse architectures in Parquet / Delta and Synapse Serverless or Databricks SQL (knowledge of Unity Catalogue is big plus)
- Ability to develop, maintain and distribute the code in modularized fashion
- Working experience with DevOps framework
- Very good understanding of Software Development Lifecycle, source code management, code reviews, etc.
- Experience in performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement;
- Experience in building processes supporting data transformation, data structures, metadata, dependency and workload managementYou should have :
- Ability to collaborate across different teams / geographies / stakeholders / levels of seniority
- Energetic, enthusiastic and results-oriented personality
- Customer focus with an eye on continuous improvement
- Motivation and ability to perform as a consultant in data engineering projects
- Ability to work independently but also within a Team - you must be a team player
- To coach junior data engineers
- Strong will to overcome the complexities involved in developing and supporting data pipelines
- Agile mindsetLanguage requirements :
- English – Fluent spoken and written
- Spanish / Portuguese – Fluent in one and the other one nice to have