Question regarding the predicted variable #48

nagsubhadeep · 2020-09-03T13:39:39Z

Yifan,

Source: LogKeyModel_predict.py

In the code below, can you please explain the difference between the output and predicted variables? Is output the same as predicted except it being sorted in tensors? Also, shouldn't the value of the predicted variable be something binary so that we can determine whether the predicted outcome is anomalous or not?

output = model(seq)
predicted = torch.argsort(output, 1)[0][-num_candidates:]

Thanks,
Deep

The text was updated successfully, but these errors were encountered:

wuyifan18 · 2020-09-03T13:50:41Z

Deep,
The output is a probability distribution describing the probability for each log key to appear as the next log key value given the history.

nagsubhadeep · 2020-09-03T13:52:31Z

Shouldn't the value of the predicted variable be something binary so that we can determine whether the predicted outcome is anomalous or not? I am getting a one-dimensional array instead.

wuyifan18 · 2020-09-03T14:19:14Z

Sort the possible log keys based on their probabilities and treat a key value as normal if it’s among the top g candidates. A log key is flagged as being from an abnormal execution otherwise.

You can read the paper for details.

Rufaida94 · 2021-06-23T20:14:35Z

@wuyifan18 where can I modify top g in your code?

wuyifan18 · 2021-06-24T16:42:44Z

@Rufaida94 here

DeepLog/LogKeyModel_predict.py

Line 51 in 502aaf0

parser.add_argument('-num_candidates', default=9, type=int)

Rufaida94 · 2021-07-03T21:01:21Z

than you @wuyifan18 , I know that num_candidates here is a hyperparameter that is supposed to be changed according to the dataset. But my question is if my data has 24297 num_classes (while your HDFS dataset has only 28 num_classes) what can be a reasonable num_candidates? for example is 1000 too high or too low for num_candidates? I know this is a very vague question but any pointers are appreciated.

wuyifan18 · 2021-07-05T02:05:37Z

@Rufaida94 the num_candidates is a hyperparameter, which means you should adjust it according to the metrics, such as F1 measure.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question regarding the predicted variable #48

Question regarding the predicted variable #48

nagsubhadeep commented Sep 3, 2020

wuyifan18 commented Sep 3, 2020

nagsubhadeep commented Sep 3, 2020 •

edited

Loading

wuyifan18 commented Sep 3, 2020

Rufaida94 commented Jun 23, 2021

wuyifan18 commented Jun 24, 2021

Rufaida94 commented Jul 3, 2021

wuyifan18 commented Jul 5, 2021 •

edited

Loading

Question regarding the predicted variable #48

Question regarding the predicted variable #48

Comments

nagsubhadeep commented Sep 3, 2020

wuyifan18 commented Sep 3, 2020

nagsubhadeep commented Sep 3, 2020 • edited Loading

wuyifan18 commented Sep 3, 2020

Rufaida94 commented Jun 23, 2021

wuyifan18 commented Jun 24, 2021

Rufaida94 commented Jul 3, 2021

wuyifan18 commented Jul 5, 2021 • edited Loading

nagsubhadeep commented Sep 3, 2020 •

edited

Loading

wuyifan18 commented Jul 5, 2021 •

edited

Loading