Skip to content

Latest commit

 

History

History
54 lines (39 loc) · 1.33 KB

README.md

File metadata and controls

54 lines (39 loc) · 1.33 KB

Enron Data Import Project

Technologies used in the project

  • Oracle/Open JDK 1.8 (must be pre-installed).
  • Maven 3.3+. (must be pre-installed).
  • Elasticsearch 2.4.5 (must be pre-installed).
  • Jest rest client for ElasticSearch.
  • Jcommander command line parameter parsing.
  • Log4j 2 for logging.
  • JUnit for unit testing.
  • Mockito for mocked testing.

How to Build the Project

This project uses the Maven build system, you can build the project locally. Go to the ./code/enron_indexer and just typing the following command in the console:


mvn clean package

How to Run the Project

Go the ./code/enron_indexer/target and typing the following command in the console:

java -jar enron_indexer.jar -i "file.path"

Usage Options:

Usage: <main class> [options]
  Options:
    --elasticsearchHost, -e
      Elasticsearch host
      Default: http://127.0.0.1:9200
    --elasticsearchMaxConnection, -c
      Max HTTP connections to Elasticsearch
      Default: 25
  * --input, -i
      File to input
    --verbose, -v
      Verbose
      Default: false

Logs:

Import process log is available at ./logs/import.log