Add log persistance for Make Data Count #156

poikilotherm · 2020-01-24T12:21:32Z

Since upstream release 4.18, you can just switch on the logging for Make Data Count. We should persist those files somehow, so we can settle on how to process it later.

Maybe use sidecar to suck up the logs and store them somewhere safe instead of storing on a volume?

qqmyers · 2020-01-28T12:20:18Z

What's the reason for persistence (more than a volume)? Once these are processed, the results are back in Dataverse tables. Is the intent to allow reprocessing in the future?

pdurbin · 2020-01-29T22:18:53Z

This is a guess but perhaps @poikilotherm is thinking about multiple Glassfish instances. There's a note about Make Data Count at http://guides.dataverse.org/en/4.19/installation/advanced.html#multiple-glassfish-servers

poikilotherm · 2020-01-30T12:15:30Z

@qqmyers and @pdurbin thanks for asking and getting in touch.

My idea behind shipping those logs away from containers is indeed about scaling, but also about avoiding too much persistance with the Dataverse app. IMHO those logfiles are similar to access logs and those shouldn't be part of the applications persistance (which makes things overly complex, too many volumes to handle), but be part of a log stack ASAP.

IMHO it makes more sense to handle such logs the same way you do nowadays with access logs etc: use things like ELK stack or similar for ingest. Query the index later to grasp the data. We might even think of pushing things into a separate Solr core, as it is already present at any Dataverse installation.

Feeding the index from log files written to disk/memory is really easy with sidecar containers, using tools like logstash/beats or fluentd.

poikilotherm added enhancement New feature or request integration Everything regarding a Dataverse integration labels Jan 24, 2020

poikilotherm added this to the v4.19 milestone Jan 24, 2020

poikilotherm added this to In Focus (go production) in Forschungszentrum Jülich Feb 3, 2020

poikilotherm moved this from In Focus (go production) to I'll be back! in Forschungszentrum Jülich Apr 3, 2020

poikilotherm removed this from the v4.19 milestone Apr 17, 2020

poikilotherm removed this from I'll be back! in Forschungszentrum Jülich Sep 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add log persistance for Make Data Count #156

Add log persistance for Make Data Count #156

poikilotherm commented Jan 24, 2020

qqmyers commented Jan 28, 2020

pdurbin commented Jan 29, 2020

poikilotherm commented Jan 30, 2020

Add log persistance for Make Data Count #156

Add log persistance for Make Data Count #156

Comments

poikilotherm commented Jan 24, 2020

qqmyers commented Jan 28, 2020

pdurbin commented Jan 29, 2020

poikilotherm commented Jan 30, 2020