- commons
- crawler
- documents
- preprocessor
- query-processor
- indexer
- spring-boot-ui
CORPORA_DIR
This environment variable is the directory in which the corpora are saved.
Run in a directory that is the same level as corpora
Windows: C:>SET CORPORA_DIR=corpora\
Mac OSX/Linux: $export CORPORA_DIR=corpora/
Go to the project home and run
$mvn clean package
├── commons-1.0-SNAPSHOT.jar
├── crawler-1.0-SNAPSHOT-jar-with-dependencies.jar
├── crawler-1.0-SNAPSHOT.jar
├── documents-1.0-SNAPSHOT.jar
├── indexer-1.0-SNAPSHOT.jar
├── preprocessor-1.0-SNAPSHOT.jar
├── query-processor-1.0-SNAPSHOT.jar
├── spring-boot-ui-1.0.jar
- The
CORPORA_DIRenvironment variable must be set. - The directory
$CORPORA_DIR/bbcmust exist.
$java -jar crawler-1.0-SNAPSHOT-jar-with-dependencies.jar
- The
CORPORA_DIRenvironment variable must be set. - The
bbccorpus documents must be in$CORPORA_DIR/bbc - Port 8282 must be free.
$java -jar spring-boot-ui-1.0.jar
Using a web browser such as Chrome, browse to
http://localhost:8282