Skip to content

Latest commit

 

History

History
7 lines (4 loc) · 783 Bytes

README.md

File metadata and controls

7 lines (4 loc) · 783 Bytes

Source-free-domain-adaptation

Data sharing restrictions are common in NLP datasets. For example, Twitter policies do not allow sharing of tweet text, though tweet IDs may be shared. The situation is even more common in clinical NLP, where patient health information must be protected, and annotations over health text, when released at all, often require the signing of complex data use agreements

The goal is to develop an accurate system for a target domain when annotations exist for a related domain but cannot be distributed. Instead of annotated training data, participants are given a model trained on the annotations.

Currently baseline models and data are provided inline with https://github.com/Machine-Learning-for-Medical-Language/source-free-domain-adaptation.git