Large Scale Data Processing Course tasks l1 Linux - bash, ssh, scp, tmux, htop, kill, killall, pipe operator, ls, sed, vim, cat Docker - Dockerfile, docker-compose, containers in general Python - pip, virtualenv, requirements, tox Parallelize computation in Python l2 Docker - Dockerfile, docker-compose, containers in general Python - pip, requirements Celery Task queue l3 Text embedding Data persistency (MongoDB) Data analysis (Redash) l4 pySpark Linear regression Binary classification Multi-class classification l5 Kubernetes K3s Helm Docker Application deployment