Skip to content

v0.6.0

Latest
Compare
Choose a tag to compare
@ato ato released this 08 Nov 06:09
· 230 commits to master since this release

New features:

  • Crawls can now be deposited by uploading a set of WARC files using the 'Upload' button on the crawl series page.
  • Additional WARC files can be uploaded into an existing crawl using the 'Upload' button on the crawl's WARC list page.

Changes:

  • Bamboo now uses the Spring Boot web framework instead of the Spark microframework.
    • This aligns Bamboo with other NLA applications and will make it easier for other developers familiar with Spring to work on the code.
    • It also enables us to use Spring Security in the near future to implement a permissions system to expose functionality like deposit and reporting to partners.
    • A moderately comprehensive set of integration tests for the webapp were added using MockMVC.
  • Several dependencies updated (jdk 11, jsoup 1.12.1, h2 1.4.200, oauth2-oidc-sdk 6.18, tika-parsers 1.22, webarchive-commons 1.1.9, guava 23.0, doss 1.6.3)

Known issues:

  • The download link for crawl artifacts in a subdirectory (eg 20190904085415/logs/crawl.log) returns a 404 error. This regression is due to differences in how Spark and Spring handle request routing and will be fixed in the next release.