Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Duplicate entries in 1993 #217

Open
neagle opened this issue Aug 9, 2021 · 9 comments
Open

Duplicate entries in 1993 #217

neagle opened this issue Aug 9, 2021 · 9 comments
Labels
bug Bugs in our AGAGD system.

Comments

@neagle
Copy link

neagle commented Aug 9, 2021

This problem report was submitted by @gurujeet-ngc via email:

AGAGD shows a big spike of over 3000 games on a day in 1993. This is in error and a result of games from the Go congress being doubly counted. See analysis and recommended action below.

image

The doubling occurred on the reporting for the Go Congress where games were reported as occurring on 8/7/1993. The total games for tournament cong19930807a (1540) equals the sum of the games for cong19930807b (778) and cong19930807c (762).

image

Looking at just one player’s games, Pin_Player_1, it is clear that the 9 games played in ‘a’ reappear in ‘b’ or ‘c’. This is true for all other players as well.

Recommendation: delete tournament cong19930807 and its games from the database as they are erroneous duplicates. Additionally there is a tournament cong19930808 with a much smaller number of games that all appear to duplicate some of the games played in the previous tournaments. It is possible that these were self-pair games but should be investigated further to see if they are duplicates also.

image

@neagle neagle added the bug Bugs in our AGAGD system. label Aug 9, 2021
@vash3g
Copy link
Member

vash3g commented Aug 9, 2021

These should not be deleted directly from the database. they should be removed using ratings tools at best.

@neagle
Copy link
Author

neagle commented Aug 9, 2021

What are the problems / risks associated with deleting them from the database?

What would be your recommendation as to how to remove them with the ratings tools?

@gurujeet-ngc
Copy link

gurujeet-ngc commented Aug 9, 2021 via email

@michaelhiiva
Copy link
Contributor

If we were just to use the ratings tools for this and use the revert function, would that not require the original [tournament_name].in file? This would be based on the current documentation for our aga-ratings tools.

@gurujeet-ngc
Copy link

gurujeet-ngc commented Oct 4, 2021 via email

@vash3g
Copy link
Member

vash3g commented Oct 5, 2021

How do we know this is in error? Is this based on a large spike or historical research?

This should not be done directly though SQL commands. We have an archive of over 4000 old tournaments. At a minimum this should be testes with the ratings tools before any other attempts are made.

@gurujeet-ngc
Copy link

gurujeet-ngc commented Oct 5, 2021 via email

@michaelhiiva
Copy link
Contributor

@neagle
Copy link
Author

neagle commented Oct 12, 2021

Yup, it's very clear from the evidence in the submission that this is an error. We just need to examine what the most appropriate way of fixing this is. @vash3g, you've mentioned that this should be fixed through the ratings tools and not by directly modifying the DB -- that makes intuitive sense, but what are the problems or issues with directly modifying the data, out of curiosity?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Bugs in our AGAGD system.
Projects
None yet
Development

No branches or pull requests

4 participants