ADR for Scorecard metrics aggregation and stack quality scoring #439

mayaCostantini · 2022-08-01T17:30:45Z

Related Issues and Dependencies

Related to #434 and thoth-station/thamos#1148

This introduces a breaking change

No

This Pull Request implements

ADR Proposal for aggregating OSSF Scorecards data and computing a software stack quality score.

The logic proposed below would imply to approximate properly the budget allocated to aggregate this data from BigQuery.
To estimate this budget, we would need information about the average number of packages updated with a new release on each package-releases-job run and look more closely at BigQuery pricing according to the API calls we will need to make.

goern · 2022-08-02T12:26:42Z

docs/adr/0007-scorecard-metrics.md

+
+The data aggregation and score computation logic would be implemented as follows:
+
+* With each `package-releases-job` run, aggregate data about the latest package release and corresponding Scorecards data retrieved from BigQuery


Can e use the scorecard command line Utility for this?

As we want to aggregate data about packages we do not have yet into the prescriptions database, I don't think this will work unless we modify the scorecards handler logic. Another alternative to this architecture would be to compute prescriptions from new releases aggregated on package-release-job without storing this data in the DB, which will (if I'm not mistaken) greatly increase the number of prescriptions we already have. The global quality metrics could then be derived from prescriptions directly and stored in the DB on a regular job run for example.

goern · 2022-08-08T09:52:51Z

/approve

sesheta · 2022-08-08T09:54:09Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: goern

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [goern]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

sesheta added the size/S Denotes a PR that changes 10-29 lines, ignoring generated files. label Aug 1, 2022

mayaCostantini requested a review from goern August 1, 2022 17:31

sesheta requested review from harshad16 and KPostOffice August 1, 2022 17:31

ADR for Scorecard metrics aggregation and stack quality scoring

1b1477d

mayaCostantini force-pushed the adr-scorecard-metrics branch from 334c2bf to 1b1477d Compare August 1, 2022 18:40

This was referenced Aug 2, 2022

Aggregate Scorecards metrics on a new package release #440

Open

Create a new table for storing Scorecard metrics thoth-station/storages#2668

Open

goern reviewed Aug 3, 2022

View reviewed changes

sesheta added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Aug 8, 2022

sesheta merged commit daaf5f0 into thoth-station:master Aug 8, 2022

mayaCostantini deleted the adr-scorecard-metrics branch August 16, 2022 17:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ADR for Scorecard metrics aggregation and stack quality scoring #439

ADR for Scorecard metrics aggregation and stack quality scoring #439

mayaCostantini commented Aug 1, 2022

goern Aug 2, 2022

mayaCostantini Aug 3, 2022

goern commented Aug 8, 2022

sesheta commented Aug 8, 2022


		The data aggregation and score computation logic would be implemented as follows:

		* With each `package-releases-job` run, aggregate data about the latest package release and corresponding Scorecards data retrieved from BigQuery

ADR for Scorecard metrics aggregation and stack quality scoring #439

ADR for Scorecard metrics aggregation and stack quality scoring #439

Conversation

mayaCostantini commented Aug 1, 2022

Related Issues and Dependencies

This introduces a breaking change

This Pull Request implements

goern Aug 2, 2022

Choose a reason for hiding this comment

mayaCostantini Aug 3, 2022

Choose a reason for hiding this comment

goern commented Aug 8, 2022

sesheta commented Aug 8, 2022