Variable grouping of results #95

jacobvjk · 2024-04-12T16:56:59Z

depends on RMI-PACTA/pacta.multi.loanbook.analysis#34
depends on RMI-PACTA/pacta.multi.loanbook.plot#30

updates calls of pacta.multi.loanbook.* functions to adjust to the new variable interface
gains a parameter BY_GROUP that can be used to generate aggregate metrics by any user defined variable, provided the variable exists in the matched_prioritized data set
in the given example, the aggregate alignment metric is calculated for:
- an aggregate loan book across the six fake input loan books, giving a meta loan book view
- grouped by loan book, as indicated by "group_id"
- grouped by another random user-provided variable above the loan book level, as indicated by "foo"
- grouped by a combination of "group_id" and "foo", giving a sub loan book view
at this point, calculating the aggregations at the level of multiple combined dimensions is only done for the calculation of results. Plots are currently limited to one level of aggregation, because a combination across variables is not in all cases straight-forward

NOTE:

calculation of the aggregate metric for corporate benchmarks is temporarily suspended, as it turns out too resource intense to calculate these on standard machines and we have not seen heavy use of the benchmark in the aggregate metric at this point

EXAMPLE OUTPUTS with new grouping functionality

Sankey plot at the aggregate level:

Sankey plot calculated based on the foo split:

Sankey plot calculated based on the group_id split:

Scatter plot alignment by exposure based on foo split:

Scatter plot alignment by exposure based on group_id split:

jdhoffa

Consider renaming by_groups to by_group otherwise LGTM (i think, it's a pretty mammoth PR so I may have missed something)

plot_aggregate_loanbooks.R

MonikaFu · 2024-04-24T15:55:35Z

I run it with test data provided and using the two versions of the supporting packages mentioned. It seems to work correctly in general. For some plots the axis labels overlap with the title but since this is only a demo I am not sure if it is worth it to spend time on it. I guess you would need to play around with figure size when saving. Also - the sankey plot per company is rather unreadable which is to be expected with a big number of companies.

jacobvjk · 2024-04-24T16:04:59Z

I run it with test data provided and using the two versions of the supporting packages mentioned. It seems to work correctly in general. For some plots the axis labels overlap with the title but since this is only a demo I am not sure if it is worth it to spend time on it. I guess you would need to play around with figure size when saving. Also - the sankey plot per company is rather unreadable which is to be expected with a big number of companies.

I agree, especially re sankey plot with companies. In the end we need to decide what we want to maintain there. At the same time, we could just as well show examples using other variables with less categories. In any case, this seems like a topic for discussing the standardized P4S offering

jacobvjk added 5 commits April 11, 2024 19:17

varible grouping

a2df1e3

comment out benchmark, simplify write

f67a8cc

flexible groups in plots part 1

3fc6d57

add comment

3249be9

scripts able to handle flexible grouping

edeaa94

jacobvjk mentioned this pull request Apr 17, 2024

Adjust to variable result grouping RMI-PACTA/pacta.multi.loanbook.plot#30

Merged

jacobvjk added 2 commits April 17, 2024 12:30

clean up

87119a8

adapt to new arg name by_group for scatter alignment exposure plot

79b3814

jacobvjk requested a review from jdhoffa April 17, 2024 11:19

jacobvjk changed the title ~~Variable groups~~ Variable grouping of results Apr 17, 2024

jacobvjk marked this pull request as ready for review April 17, 2024 11:20

jacobvjk added 2 commits April 18, 2024 16:53

adjust arg names

614fce7

adjust data_level from bank to group_var

d5c114c

jdhoffa approved these changes Apr 19, 2024

View reviewed changes

plot_aggregate_loanbooks.R Outdated Show resolved Hide resolved

plot_aggregate_loanbooks.R Outdated Show resolved Hide resolved

jacobvjk added 5 commits April 19, 2024 10:13

rename parameter for clarity

90bb7c3

adapt check to let by_group = NULL pass

10b679a

rm outdated TODOs

e7f1bce

rm outdated TODO

cf4e67e

streamline by_group and add clear explanation

9b656ee

jacobvjk merged commit b652a3e into main Apr 24, 2024

jacobvjk deleted the variable-groups branch April 24, 2024 16:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Variable grouping of results #95

Variable grouping of results #95

jacobvjk commented Apr 12, 2024 •

edited

Loading

jdhoffa left a comment

MonikaFu commented Apr 24, 2024

jacobvjk commented Apr 24, 2024

Variable grouping of results #95

Variable grouping of results #95

Conversation

jacobvjk commented Apr 12, 2024 • edited Loading

jdhoffa left a comment

Choose a reason for hiding this comment

MonikaFu commented Apr 24, 2024

jacobvjk commented Apr 24, 2024

jacobvjk commented Apr 12, 2024 •

edited

Loading