Skip to content

2. Remove exact metadata duplicates

Ingrid M. Angel Benavides edited this page Dec 29, 2020 · 5 revisions
  • Finds exact metadata duplicates: same latitude, longitude, date (script box_meta_dup.m)

  • Compare contents using SbS algorithm

  • > 95% is content duplicate
  • > 75% visual decision
  • Decision
  • If is content duplicate: Delete worst profile (or the second if they are identical). Both the profile content properties (function prof_comppc.m) and the profile origin (qclevel and source, function prof_comppc.m) are compared. The profile with the best content is preferred to the one with best qclevel)
  • If is not content duplicate: Delete both. _ The presence of this duplicates probably indicate an error in the profiles as the one described in the MOCCA report (see the formated results file) _