EntreeETLPipeline_spark_streaming To run the program 1. Run the zookeeper zkServer start 2. Run the kafka kafka-server-start <... server.properties> 3. Run the producer script python3 EntreeKafkaProd.py 4. Run the pipeline script spark-submit --jars <jars...> EntreePipeline.py