WARNING: THIS SITE IS A MIRROR OF GITHUB.COM / IT CANNOT LOGIN OR REGISTER ACCOUNTS / THE CONTENTS ARE PROVIDED AS-IS / THIS SITE ASSUMES NO RESPONSIBILITY FOR ANY DISPLAYED CONTENT OR LINKS / IF YOU FOUND SOMETHING MAY NOT GOOD FOR EVERYONE, CONTACT ADMIN AT ilovescratch@foxmail.com

CryptoAILab / Awesome-LM-SSP Public

Notifications You must be signed in to change notification settings
Fork 116
Star 1.8k

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

github.com/CryptoAILab/Awesome-LM-SSP

Apache-2.0 license

1.8k stars 116 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 590 Commits
collection		collection
figure		figure
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Repository files navigation

Awesome-LM-SSP

Introduction

The resources related to the trustworthiness of large models (LMs) across multiple dimensions (e.g., safety, security, and privacy), with a special focus on multi-modal LMs (e.g., vision-language models and diffusion models).

This repo is in progress 🌱 (manually collected).
Badges:
- Model:
- Comment: ...
- Venue: ...
🔥🔥🔥 Help us update the list! 🔥🔥🔥
- First, check papers through our database: Metadata of LM-SSP.
- If you want to update the information of a paper (e.g., an arXiv paper has been accepted by a venue), search the paper title in our metadata table and then leave a message in the corresponding cell of the table.
- If you would like to add some paper, please fill in the following table through ISSUE:

Title	Link	Code	Venue	Classification	Model	Comment
This is a title	paper.com	github	bb'23	A1. Jailbreak	LLM	Agent

News

[2025.01.09] 🎂 Happy 1st Birthday to Awesome-LM-SSP! Keep Going! 💪
[2024.01.09] 🚀 LM-SSP is released!

Collections

Book (3)
Competition (5)
Leaderboard (5)
Toolkit (13)
Survey (40)
Paper (2327)
- A. Safety (1175)
  - A0. General (30)
  - A1. Jailbreak (528)
  - A2. Alignment (145)
  - A3. Deepfake (92)
  - A4. Ethics (8)
  - A5. Fairness (60)
  - A6. Hallucination (116)
  - A7. Prompt Injection (110)
  - A8. Toxicity (86)
- B. Security (451)
  - B0. General (16)
  - B1. Adversarial Examples (105)
  - B2. Agent (130)
  - B3. Poison & Backdoor (175)
  - B4. Side-Channel (1)
  - B5. System (24)
- C. Privacy (701)
  - C0. General (54)
  - C1. Contamination (17)
  - C2. Data Reconstruction (63)
  - C3. Membership Inference Attacks (65)
  - C4. Model Extraction (14)
  - C5. Privacy-Preserving Computation (128)
  - C6. Property Inference Attacks (7)
  - C7. Side-Channel (10)
  - C8. Unlearning (68)
  - C9. Watermark & Copyright (275)

Big love to the community — thank you! 🙏

Acknowledgement

Organizers: Tianshuo Cong (丛天硕), Xinlei He (何新磊), Zhengyu Zhao (赵正宇), Yugeng Liu (刘禹更), Delong Ran (冉德龙)
This project is inspired by LLM Security, Awesome LLM Security, LLM Security & Privacy, UR2-LLMs, PLMpapers, EvaluationPapers4ChatGPT