Contexte du recrutement et définition de poste
Position Overview:
CIH BANK is seeking a Head of Data Engineering to lead the design, automation, and optimization of its data infrastructure. This role involves overseeing data architecture, orchestrating pipelines, ensuring data quality and security, and defining engineering standards. Working closely with Data Science, Analytics, and Data Management teams, the ideal candidate will bring deep expertise in big data technologies (Hadoop, PySpark), DevOps (CI/CD, Docker, Ansible), and team leadership, while staying current with industry best practices to drive a modern, efficient, and reliable data ecosystem.
Key Responsibilities:
Architecture & Industrialization:
- Design and continuously improve the architecture of the Data stack.
- Oversee the implementation and maintenance of the data platform.
- Automate and orchestrate data processing workflows.
- Optimize Data pipelines when required.
- Implement unit and functional tests for all critical data applications.
Collaboration & Methodology:
- Work closely with the Data Science, Data Analytics, and Data Management teams to understand needs and provide appropriate technical solutions.
- Define and enforce a methodology for the Data Engineering team.
- Ensure systematic documentation of all engineering work and solutions.
Data Quality & Security:
- Implement access control rules and data security recommendations.
- Contribute to defining technical data quality metrics.
- Deploy and monitor data quality indicators across the platform.
Benchmarking and state-of-the-art monitoring: Capacity to benchmark data architectures and pipelines and stay up-to-date with the latest practices.
Norming: Define and maintain the standards and norms for data pipelines, code and Operations.
Profil recherché
Qualifications:
Bachelor's degree in Big Data, Cloud computing, Data Engineering, Computer Science, or a related field.
Strong knowledge in Data modeling and SQL Querying.
Strong knowledge in Hadoop based technologies.
Strong knowledge in Python and PySpark.
Strong knowledge in DevOps technologies: CI/CD (Jenkins), Containers (Docker) and Infra as Code (Ansible);
Strong analytical and problem-solving skills.
Ability to lead a team and work in a group.
Bonus:
GitLabFLow framework;
Terraform