ThuCCSLab / Awesome-LM-SSP Public

Notifications You must be signed in to change notification settings
Fork 70
Star 1.1k

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

github.com/thuccslab/awesome-lm-ssp

Apache-2.0 license

1.1k stars 70 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 422 Commits
collection		collection
figure		figure
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Repository files navigation

Awesome-LM-SSP

Introduction

The resources related to the trustworthiness of large models (LMs) across multiple dimensions (e.g., safety, security, and privacy), with a special focus on multi-modal LMs (e.g., vision-language models and diffusion models).

This repo is in progress 🌱 (manually collected).
Badges:
- Model:
- Comment: ...
- Venue: ...
🌻 Welcome to recommend resources to us via pulling requests or opening issues with the following format:

Title	Link	Code	Venue	Classification	Model	Comment
aa	arxiv	github	bb'23	A1. Jailbreak	LLM	Agent

News

[2024.08.17] We collected 34 related papers from ACL'24!
[2024.05.13] We collected 7 related papers from S&P'24!
[2024.04.27] We adjusted the categories.
[2024.01.20] We collected 3 related papers from NDSS'24!
[2024.01.17] We collected 108 related papers from ICLR'24!
[2024.01.09] 🚀 LM-SSP is released!

Collections

Book (2)
Competition (5)
Leaderboard (3)
Toolkit (10)
Survey (33)
Paper (1297)
- A. Safety (716)
  - A0. General (17)
  - A1. Jailbreak (287)
  - A2. Alignment (75)
  - A3. Deepfake (58)
  - A4. Ethics (5)
  - A5. Fairness (54)
  - A6. Hallucination (109)
  - A7. Prompt Injection (43)
  - A8. Toxicity (68)
- B. Security (200)
  - B0. General (7)
  - B1. Adversarial Examples (83)
  - B2. Poison & Backdoor (96)
  - B3. System (14)
- C. Privacy (381)
  - C0. General (28)
  - C1. Contamination (13)
  - C2. Copyright (132)
  - C3. Data Reconstruction (44)
  - C4. Membership Inference Attacks (34)
  - C5. Model Extraction (10)
  - C6. Privacy-Preserving Computation (72)
  - C7. Property Inference Attacks (3)
  - C8. Unlearning (45)

Star History

Acknowledgement

Organizers: Tianshuo Cong (丛天硕), Xinlei He (何新磊), Zhengyu Zhao (赵正宇), Yugeng Liu (刘禹更), Delong Ran (冉德龙)
This project is inspired by LLM Security, Awesome LLM Security, LLM Security & Privacy, UR2-LLMs, PLMpapers, EvaluationPapers4ChatGPT

About

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

github.com/ThuCCSLab/Awesome-LM-SSP

nlp security privacy jailbreak safety awesome-list language-model vlm adversarial-attacks diffusion-models llm

Apache-2.0 license

Report repository

Contributors 9