Skip to content

chrissyrissmeyer/ld4-community-recon

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 

Repository files navigation

LD4 Community Working Group on Reconciliation

About

Linked Data for Libraries, Linked Data for Production, and Linked Data for Libraries Labs are a series of grant-funded projects working broadly on implementing Linked Data in library workflows across functional areas and domains. The umbrella effort is referred to as 'LD4'.

LD4 Community groups are a way to work with interested community members on a work area or topic that intersects with Linked Data and library work needs. They are open to participation from anyone and aim for the highest level of transparency in efforts, work, & outputs to help support a growing community of Linked Data implementers in libraries and other GLAM, cultural heritage, and memory institutions.

Conveners:

Arwen Hutt / ahutt@ucsd.edu ; @arwenhutt Christina Harlow / cmharlow@stanford.edu ; @cmh2166

Scope

(the following will be confirmed with participants at our kick-off call)

This LD4 Community group is an effort to:

  • generate a shared understanding of what reconciliation work entails, including use cases and user stories;
  • build collaboration for existing or emerging reconciliation tool development;
  • and outline pragmatic solutions including matching algorithms, workflows, tools, and functional requirements for Cultural Heritage/GLAM+ resource metadata reconciliation.

NB: Metadata here is used in an expansive way to cover not only traditional bibliographic metadata, but also descriptions for emerging resource types & metadata domains (rights, administrative, structural, etc.) as applicable to the group’s efforts.

NB: Reconciliation here is meant broadly and includes:

  • Reconciliation (in specific sense): entity/URI to entity/URI
  • Entity resolution: string to entity/URI (what most people are thinking about, it appears)
  • Lexicalization: entity/URI to string (i.e. getting a label)

The above is generated from LD4 Community Meeting Feedback, which included the following points:

  • “There were so many cool presentations and conversations today around reconciliation tools, I would love to see conversations around that continue”
  • “I have a lot of work queued up around reconciliation. lets work together? most is queued up in GH”
  • “We’ve been trying to compare reconciliation tools and would like to expand this with this group’s input”
  • Largest gap in tooling from the Tools breakout: Reconciliation listed

Participants

To be added as firms up.

The group is open to anyone who wants to contribute. Get in touch the conveners if you're uncertain where to start or how to jump in.

Working Areas

Proposed Outputs

Some ideas mentioned listed below, to be further developed at the kick off call:

  • List of existing reconciliation tools & a comparison of them
  • Identify common reconciliation use cases
  • Develop flexible (modular perhaps?) abstract workflow models for use cases
  • Create functional requirements for shared workflow components
  • Set of leveled (i.e. simplest to complex) reconciliation and matching algorithms for common reconciliation cases & data types (i.e. going from label look-ups based off various string matching algorithms to contextual data matching based off of string and other data type matching algorithms for multiple fields attached to a resource)
  • Work on light-weight tools for certain needs listed above to make proof of concepts or support intermediate work & review
  • (Meta) Model for LD4 / Other blackbox LODLAM efforts in community engagement & work

General Timeframe

(the following will be confirmed with participants at our kick-off call and continuously updated)

For group outputs: Focus on smaller deliverables with shorter timeframes (< 3 months), as well as parallel work to cover a few different areas of work at the same time. We will start planning these at the kick-off meeting.

For larger group planning: Plan on an assessment at 1 year (or earlier if things seem to not be working). The 1 year mark is a time to rotate conveners, choose new meeting times, refresh the goals, disband, morph, etc.

Workspace & Communication Channels

Primary Workspace (Notes, Outputs):

This GitHub repository https://github.com/ld4p/ld4-community-recon/ (Christina @cmh2166, cmharlow@stanford.edu can add you to the repository)

Communication:

(the following will be confirmed with participants at our kick-off call and continuously updated)

Questions or Comments?

Use Github Issues on this repository or get in touch with one of the conveners.

About

LD4 Community Working Group on Reconciliation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published