Dockerfiles for the tstrohmeier/spark-development images and example setup
@Dockerhub: https://hub.docker.com/r/tstrohmeier/spark-development/
Base Images
- tstrohmeier/spark-development:2.2.0
- tstrohmeier/spark-development:2.4.5
Derived Images:
- tstrohmeier/spark-development:2.4.5-gradle-6.5
- tstrohmeier/spark-development:2.4.5-pip3
.
βββ ...
βββ docker # contains all the Dockerfiles
β βββ gradle # extends the spark-development image with gradle
β | βββ ...
β βββ pip3 # extends the spark-development image with python3 & pip3
β | βββ ...
β βββ spark # the spark-development base image
β | βββ ...
| βββ ...
βββ docker-compose-templates # docker-compose template files for new projects
β βββ gradle # Gradle template
β | βββ ...
β βββ sbt # SBT template
β | βββ ...
| βββ ...
βββ examples # Example Projects
β βββ gradle # Gradle example project
β | βββ ...
β βββ pyspark # pyspark example project
β | βββ ...
β βββ sbt # Sbt example project
β | βββ ...
| βββ ...
βββ scripts # CI / CD build scripts
βββ ...
- spark
- hadoop
- sbt
Just copy the right docker-compose template file into your project root folder and configure src location and the location of the cache location of your dependency management tool.
In the examples folder you can find examples for different programming languages and dependency management tools. Just copy the folder and modify for your needs
# version 2.4.5
docker pull tstrohmeier/spark-development:2.4.5Any improvement as Pull Request is welcome!
Thanks goes to these wonderful people:
Thomas π» π |
Fibs7000 π» |
This project follows the all-contributors specification. Contributions of any kind welcome!