janetlauyeung

🦦

Janet Liu janetlauyeung

🦦

postdoc @mainlp

14 followers · 11 following

LMU Munich
Munich, Germany
janetlauyeung.github.io
@janetlauyeung

Achievements

Highlights

Stars

jl908069 / gum_sum_salience

Python 1 Updated Jan 17, 2025

hannamw / GP-mechanisms

Python 4 1 Updated Dec 22, 2024

huggingface / evaluation-guidebook

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 970 59 Updated Jan 7, 2025

linatal / rhetorical_UNSC

Dataset, amendment for RST annotation guidelines, and code for analysis experiments for the paper "Rhetorical Strategies in the UN Security Council: Rhetorical Structure Theory and Conflicts".

Jupyter Notebook 3 Updated Jan 6, 2025

kanishkamisra / minicons

Utility for behavioral and representational analyses of Language Models

Python 127 32 Updated Jan 23, 2025

DA-southampton / NLP_ability

总结梳理自然语言处理工程师(NLP)需要积累的各方面知识，包括面试题，各种基础知识，工程能力等等，提升核心竞争力

Python 7,043 1,192 Updated Aug 24, 2022

disrpt / latest

Latest data for the multilingual DISRPT discourse benchmark

Python 1 Updated Jul 29, 2024

EducationalTestingService / rstfinder

Fast Discourse Parser to find latent Rhetorical STructure (RST) in text.

Python 124 24 Updated Oct 30, 2023

aryamanarora / causalgym

CausalGym: Benchmarking causal interpretability methods on linguistic tasks

Python 40 5 Updated Nov 30, 2024

google-research / bleurt

BLEURT is a metric for Natural Language Generation based on transfer learning.

Python 712 86 Updated Aug 4, 2023

parrt / random-forest-importances

Code to compute permutation and drop-column importances in Python scikit-learn models

Jupyter Notebook 606 131 Updated Sep 29, 2024

gucorpling / gentle

Repository for the GENTLE corpus

Python 4 Updated Jan 7, 2025

carriex / lfqa_eval

ACL 2023 paper "A Critical Evaluation of Evaluations for Long-form Question Answering"

HTML 20 1 Updated Mar 22, 2024

gucorpling / DisCoDisCo

GUCorpling's DISRPT 2021 shared task submission

Python 6 1 Updated Mar 6, 2024

XuhuiZhou / cobra-frames

The official code repo of paper: COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements(https://arxiv.org/abs/2306.01985)

HTML 6 1 Updated Jul 20, 2023

facebookresearch / anli

Adversarial Natural Language Inference Benchmark

Python 393 45 Updated May 12, 2022

jennhu / lm-pragmatics

Code and data for "A fine-grained comparison of pragmatic language understanding in humans and language models"

Jupyter Notebook 11 7 Updated Dec 14, 2022

qinyiwei / T5Score

Python 3 1 Updated Feb 18, 2023

google-research-datasets / seahorse

Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 quality dimensions: comprehensibility, repetition, grammar, a…

86 13 Updated Feb 27, 2024