-
Notifications
You must be signed in to change notification settings - Fork 0
2. Remove exact metadata duplicates
Ingrid M. Angel Benavides edited this page Dec 29, 2020
·
5 revisions
-
Finds exact metadata duplicates: same latitude, longitude, date (script box_meta_dup.m)
-
Compare contents using SbS algorithm
- > 95% is content duplicate
- > 75% visual decision
- Decision
- If is content duplicate: Delete worst profile (or the second if they are identical). Both the profile content properties (function prof_comppc.m) and the profile origin (qclevel and source, function prof_comppc.m) are compared. The profile with the best content is preferred to the one with best qclevel)
- If is not content duplicate: Delete both. _ The presence of this duplicates probably indicate a mismatch between the profile content and its metadata, as the one that occurred in n error in the preparation of CTD-RDB 2012v01 described in the MOCCA report (see the formated results file) _