You will be part of an agile team and play a vital role in the design and development of a cloud-based data platform.
Build and manage the DataHub, which includes:
A front-end data catalog
DataHub users and data science projects management
Data subscriptions
An AWS s3 based Data Lake
etc.
Develop ways to improve self-service data consumption and data publishing:
Build and manage ETL pipelines in Airflow, which are responsible of ingesting the data and making the data available to users
Develop standard ways to deliver data in the DataHub
Develop CI/CD pipelines for data consuming teams to let them develop their products
etc.
You will be responsible for producing quality code and reusable components.
Using containerization, CI/CD and other automation technologies, you will be responsible for creating a backend for high availability and scalability, while at the same time being easily deployable, manageable and secure.
Together with the rest of the team you will be involved in the full product development process, from design, implementation, to testing, documentation and automated deployment.
Respond to and resolve operational incidents, performing root cause analysis and managing changes required to prevent future occurrences.
In this team you will have a wide range of responsibilities and should be willing to adapt to many different challenges.
Discuss with the users of the platform requirements and future improvements, but also come with proposals for our users on how to use the platform.