- Oracle/Open JDK 1.8 (must be pre-installed).
- Maven 3.3+. (must be pre-installed).
- Elasticsearch 2.4.5 (must be pre-installed).
- Jest rest client for ElasticSearch.
- Jcommander command line parameter parsing.
- Log4j 2 for logging.
- JUnit for unit testing.
- Mockito for mocked testing.
This project uses the Maven build system, you can build the project locally.
Go to the ./code/enron_indexer
and just typing the following command in the console:
mvn clean package
Go the ./code/enron_indexer/target
and typing the following command in the console:
java -jar enron_indexer.jar -i "file.path"
Usage: <main class> [options]
Options:
--elasticsearchHost, -e
Elasticsearch host
Default: http://127.0.0.1:9200
--elasticsearchMaxConnection, -c
Max HTTP connections to Elasticsearch
Default: 25
* --input, -i
File to input
--verbose, -v
Verbose
Default: false
Import process log is available at ./logs/import.log