type "bash run.sh" to run the code. It needs Java 8 installed in a Linux enviroment
The code is based in Lucene. Lucene is a java-based document indexer.
Instead of keeping the documents in a file I'm using RAMDirectory to keep them all in memory. By simply changing the directory it's possible to work with bigger datasets.