trustworthy-machine-learning

A project to add scalable state-of-the-art out-of-distribution detection (open set recognition) support by changing two lines of code! Perform efficient inferences (i.e., do not increase inference time) and detection without classification accuracy drop, hyperparameter tuning, or collecting additional data.

machine-learning deep-learning pytorch ood osr ai-safety open-set anomaly-detection novelty-detection robust-machine-learning open-set-recognition out-of-distribution out-of-distribution-detection ood-detection trustworthy-machine-learning trustworthy-ai

Updated Sep 22, 2022
Python

ai4ce / FLAT

Star

[ICCV2021 Oral] Fooling LiDAR by Attacking GPS Trajectory

deep-learning robotics point-cloud lidar gnss autonomous-driving ai-safety adversarial-attacks 3d-object-detection 3d-perception trustworthy-machine-learning trustworthy-ai

Updated Jul 5, 2022
Python

brandeis-machine-learning / awesome-ml-fairness

Star

Papers and online resources related to machine learning fairness

machine-learning awesome papers research-paper fairness fairness-ai fairness-ml trustworthy-machine-learning human-ai-interaction

Updated May 11, 2023

IBM / inFairness

Star

PyTorch package to train and audit ML models for Individual Fairness

machine-learning pytorch fairness fairness-ai responsible-ai trustworthy-machine-learning individual-fairness

Updated Sep 14, 2023
Python

dlmacedo / distinction-maximization-loss

Star

A project to improve out-of-distribution detection (open set recognition) and uncertainty estimation by changing a few lines of code in your project! Perform efficient inferences (i.e., do not increase inference time) without repetitive model training, hyperparameter tuning, or collecting additional data.

machine-learning deep-learning pytorch classification ood osr uncertainty-estimation ai-safety open-set anomaly-detection novelty-detection robust-machine-learning open-set-recognition out-of-distribution out-of-distribution-detection ood-detection trustworthy-machine-learning trustworthy-ai

Updated Sep 22, 2022
Python

95616ARG / SyReNN

Star

SyReNN: Symbolic Representations for Neural Networks

deep-neural-networks neural-network verification integrated-gradients network-patching trustworthy-machine-learning trustworthy-ai

Updated Mar 20, 2023
Python

leriomaggio / ppml-tutorial

Star

Privacy-Preserving Machine Learning (PPML) Tutorial

data-science machine-learning tutorial deep-learning privacy-enhancing-technologies privacy-preserving privacy-preserving-machine-learning trustworthy-machine-learning

Updated Jun 9, 2024
Jupyter Notebook

zRapha / FAME

Sponsor

Star

Framework for Adversarial Malware Evaluation.

machine-learning reinforcement-learning malware genetic-programming evasion adversarial-machine-learning adversarial-examples adversarial-attacks trustworthy-machine-learning trustworthy-ai

Updated Jul 28, 2023
Python

BirkhoffG / Explainable-ML-Papers

Star

A list of research papers of explainable machine learning.

machine-learning awesome research paper academic survey human-in-the-loop interpretability human-in-the-loop-machine-learning explanations interpretable-ml explainable-ml xai explainability counterfactual-explanations interpretable-models recourse trustworthy-machine-learning human-ai-interaction

Updated Jun 25, 2021

Crisp-Unimib / ContrXT

Star

a tool for comparing the predictions of any text classifiers

nlp data-science machine-learning natural-language-processing text-classification natural-language data-visualization code-quality human-computer-interaction explainable-ml xai interpretable-machine-learning text-classification-python datascience-machinelearning xai-library trustworthy-machine-learning trustworthy-ai

Updated Jul 30, 2022
Python

LucasFidon / trustworthy-ai-fetal-brain-segmentation

Star

Trustworthy AI method based on Dempster-Shafer theory - application to fetal brain 3D T2w MRI segmentation

deep-learning segmentation fetal-mri trustworthy-machine-learning trustworthy-ai

Updated Jul 24, 2023
Python

um-dsp / Morphence

Star

Morphence: An implementation of a moving target defense against adversarial example attacks demonstrated for image classification models trained on MNIST and CIFAR10.

security machine-learning adversarial-examples adversarial-attacks moving-target-defense trustworthy-machine-learning

Updated Aug 9, 2024
Python

lancopku / Avg-Avg

Star

[Findings of EMNLP 2022] Holistic Sentence Embeddings for Better Out-of-Distribution Detection

natural-language-processing ai-safety robust-machine-learning ood-detection trustworthy-machine-learning

Updated Jun 14, 2023
Python

Crisp-Unimib / MERLIN

Star

MERLIN is a global, model-agnostic, contrastive explainer for any tabular or text classifier. It provides contrastive explanations of how the behaviour of two machine learning models differs.

data-science machine-learning natural-language-processing text-classification tabular-data human-computer-interaction explainable-ai xai interpretable-machine-learning trustworthy-machine-learning trustworthy-ai

Updated Sep 15, 2023
Python

OPTML-Group / Unlearn-Simple

Star

"Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning" by Chongyu Fan*, Jiancheng Liu*, Licong Lin*, Jinghan Jia, Ruiqi Zhang, Song Mei, Sijia Liu

data-privacy language-model machine-unlearning trustworthy-machine-learning trustworthy-ai large-language-models llm-unlearning

Updated Nov 6, 2024
Python

Improve this page

Add a description, image, and links to the trustworthy-machine-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the trustworthy-machine-learning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

trustworthy-machine-learning

Here are 45 public repositories matching this topic...

HowieHwong / TrustLLM

THUYimingLi / BackdoorBox

ENSTA-U2IS-AI / torch-uncertainty

UCSC-REAL / negative-label-smoothing

verivital / nnv

dlmacedo / entropic-out-of-distribution-detection

ai4ce / FLAT

brandeis-machine-learning / awesome-ml-fairness

IBM / inFairness

dlmacedo / distinction-maximization-loss

95616ARG / SyReNN

leriomaggio / ppml-tutorial

zRapha / FAME

BirkhoffG / Explainable-ML-Papers

Crisp-Unimib / ContrXT

LucasFidon / trustworthy-ai-fetal-brain-segmentation

um-dsp / Morphence

lancopku / Avg-Avg

Crisp-Unimib / MERLIN

OPTML-Group / Unlearn-Simple

Improve this page

Add this topic to your repo