Report Matcher(paper under review)

/src: has the source code for the Supervised models

Supervised models

Each model has a corresponding folder containing source code to train and validate them.

SVM and Logistic Regression

contains training and inference pipeline for SVM and LR
Configuration template embedded in the workbook needs to be modified to be suitable to the model to train and validate

LLaMa

should run preparedata.py to create the datasets for training and validation
finetuning script: train.sh, inference script: inference.py
Prompts used/tested for the experiments: prompts.py
Meta-Llama-3-8B-Instruct was used for finetuning.
Needs alignment-handbook 0.4.0.dev0 and trl 0.9.6 The changes in the respective folders must be integrated to run the finetuning and inference.

Longformer

LF_crossval_withtestset.py is the script for finetuning and validation
Threshold_tuning folder has a script and notebook to analyze the best threshold by the fold and visualize the results

In-context learning using GPT-4

Baseline Prompt

Question:

You are a board-certified radiologist.

You will compare Report A and Report B. 
Your goal is to check whether if Report B is a proper follow-up for Report A.

At the end of your answer, you should include "True" or "False".
Your answer should be no longer than 5 sentences.

Answer:

Advanced Prompt

Question:

You are a board-certified radiologist.

You will compare Report A and Report B mostly focusing on information
from the sentence in Report A which explicitly suggests a follow-up examination.

Your goal is to check whether Report B is a proper follow-up of Report A.

While a proper follow-up does not always have to use the same imaging test,
same day evaluations are not considered as a correct follow-up.

Note: Modality types do not need to match the recommended imaging test,
if it can still qualify as a substitute.

After analyzing both reports, return a python list of two elements where you will
determine True or False for the following two issues respectively:

1) Reasonable timeframe (ignore the recommended timeframe and make the decision
based on your clinical expertise)
2) Provide updates for the recommendation from Report A.

At the end of your answer, you should include the following output: [True, False].
Your answer should be no longer than 5 sentences.

Answer: Let’s think step by step.

Significance tests

The first step is to consolidate the results in a single table
Run the rm_sigtest.py by passing the consolidated results table
When running the significance test code, create the bootstrap samples by enabling create_bootstrapsamples in the rm_sigtest.py code

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Report Matcher(paper under review)

Supervised models

SVM and Logistic Regression

LLaMa

Longformer

In-context learning using GPT-4

Baseline Prompt

Advanced Prompt

Significance tests

About

Releases

Packages

Contributors 2

Languages

uw-bionlp/ReportMatcher

Folders and files

Latest commit

History

Repository files navigation

Report Matcher(paper under review)

Supervised models

SVM and Logistic Regression

LLaMa

Longformer

In-context learning using GPT-4

Baseline Prompt

Advanced Prompt

Significance tests

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages