Artem Yudovin
Profitero
Start of main content
Day 2
RU
Profitero team faced the following problem: there was one large job ETL, which consists of many iterations, where each iteration is any methodology. Suppose we want to apply the changes to iteration i, in this case, it will affect iteration i+1, because it is calculated based on the results of iteration i and so on.
The following questions arise:
Stack of technologies: Apache Spark, Apache Airflow, Jupyter, Apache Zeppelin, Docker Swarm, LakeFS.
Audience: it will be interesting for those who are faced with the problem of conducting experiments in pipelines.
Profitero
Profitero