PyLDL

Label distribution learning (LDL) and label enhancement (LE) toolkit implemented in python, including:

LDL algorithms:
- (Geng, Yin, and Zhou 2013) [TPAMI]: CPNN$^1$.
- (Geng and Hou 2015) [IJCAI]: LDSVR.
- ⭐(Geng 2016) [TKDE]: SA_BFGS, SA_IIS, AA_KNN, AA_BP, PT_Bayes, and PT_SVM.
- (Yang, Sun, and Sun 2017) [AAAI]: BCPNN and ACPNN.
- (Xu and Zhou 2017) [IJCAI]: IncomLDL$^2$.
- (Shen et al. 2017) [NeurIPS]: LDLF.
- (Jia et al. 2018) [AAAI]: LDLLC$^\dagger$.
- (Ren et al. 2019a) [IJCAI]: LDLSF.
- (Ren et al. 2019b) [IJCAI]: LDL_LCLR$^\dagger$.
- (Wang and Geng 2019) [IJCAI]: LDL4C$^{3\dagger}$.
- (Shen et al. 2020) [南京理工大学学报 (Chinese)]: AdaBoostLDL.
- (González et al. 2021a) [Inf. Sci.]: SSG_LDL$^{4}$.
- (González et al. 2021b) [Inf. Fusion]: DF_LDL.
- (Wang and Geng 2021a) [IJCAI]: LDL_HR$^{3^\dagger}$.
- (Wang and Geng 2021b) [ICML]: LDLM$^{3^\dagger}$.
- (Jia et al. 2021) [TKDE]: LDL_SCL.
- (Jia et al. 2023a) [TKDE]: LDL_LRR$^\dagger$.
- (Jia et al. 2023b) [TNNLS]: LDL_DPA$^\dagger$.
- (Wen et al. 2023) [ICCV]: CAD$^1$, QFD2$^1$, and CJS$^1$.
- (Li and Chen 2024) [IJCAI]: WInLDL$^2$.
- (Kou et al. 2024) [IJCAI]: TLRLDL$^\dagger$ and TKLRLDL$^\dagger$.
- (Wu, Li, and Jia 2024) [TBD]: LDL_DA$^5$.
LE algorithms:
- (Xu, Liu, and Geng 2019) [TKDE]: FCM, KM, LP, ML, and GLLE.
- (Xu et al. 2020) [ICML]: LEVI.
- (Zheng, Zhu, and Tang 2023) [CVPR]: LIBLE.
LDL metrics: chebyshev, clark, canberra, kl_divergence, cosine, intersection, etc.
Structured LDL datasets: Human_Gene, Movie, Natural_Scene, s-BU_3DFE, s-JAFFE, Yeast, etc.
LDL applications:
- Facial emotion recognition (supported datasets: JAFFE and BU-3DFE).
- (Shirani et al. 2019) [ACL]: Emphasis selection (supported datasets: SemEval2020; pre-trained GloVe embeddings can be downloaded here).
- (Wu et al. 2019) [ICCV]: Lesion counting (supported datasets: ACNE04).
- (Chen et al. 2020) [CVPR]: Facial emotion recognition with auxiliary label space graphs (supported datasets: CK+; OpenFace can be downloaded here, and the required models can be downloaded here).

$^1$ Technically, these methods are only suitable for totally ordered labels.

$^2$ These are algorithms for incomplete LDL, so you should use pyldl.utils.random_missing to generate the missing label distribution matrix and the corresponding mask matrix in the experiments. Here is a demo on using the incomplete LDL algorithms.

$^3$ These are LDL classifiers, so you should use predict_proba to get label distributions and predict to get predicted labels.

$^4$ These are oversampling algorithms for LDL, therefore you should use fit_transform to generate synthetic samples.

$^5$ To use domain adaptation methods for LDL, you need to provide the source domain data via parameters sX and sy of the fit method. Here is a demo on domain adaptation for LDL.

$^\dagger$ These methods involve imposing constraints on model parameters, like regularization. Therefore, it is recommended to carefully tune the hyperparameters and apply feature preprocessing techniques like StandardScaler or MinMaxScaler before conducting experiments to achieve the expected performance.

Installation

PyLDL is now available on PyPI. Use the following command to install.

pip install python-ldl

To install the newest version, you can clone this repo and run the setup.py file.

python setup.py install

Usage

Here is an example of using PyLDL.

from pyldl.utils import load_dataset
from pyldl.algorithms import SA_BFGS
from pyldl.metrics import score

from sklearn.model_selection import train_test_split

dataset_name = 'SJAFFE'
X, y = load_dataset(dataset_name)
X_train, X_test, y_train, y_test = train_test_split(X, y)

model = SA_BFGS()
model.fit(X_train, y_train)

y_pred = model.predict(X_test)
print(score(y_test, y_pred))

For those who would like to use the original implementation:

Install MATLAB.
Install MATLAB engine for python.
Download LDL Package here.
Get the package directory of PyLDL (...\Lib\site-packages\pyldl).
Place the LDLPackage_v1.2 folder into the matlab_algorithms folder.

Now, you can load the original implementation of the method, e.g.:

from pyldl.matlab_algorithms import SA_IIS

You can visualize the performance of any model on the artificial dataset (Geng 2016) with the pyldl.utils.plot_artificial function, e.g.:

from pyldl.algorithms import LDSVR, SA_BFGS, SA_IIS, AA_KNN, PT_Bayes, GLLE, LIBLE
from pyldl.utils import plot_artificial

methods = ['LDSVR', 'SA_BFGS', 'SA_IIS', 'AA_KNN', 'PT_Bayes', 'GLLE', 'LIBLE']

plot_artificial(model=None, figname='GT')
for i in methods:
    plot_artificial(model=eval(f'{i}()'), figname=i)

The output images are as follows.


(Ground Truth)	`LDSVR`


`SA_BFGS`	`SA_IIS`


`AA_KNN`	`PT_Bayes`


`GLLE`	`LIBLE`

Enjoy! :)

Experiments

For each algorithm, a ten-fold cross validation is performed, repeated 10 times with s-JAFFE dataset and the average metrics are recorded. Therefore, the results do not fully describe the performance of the model.

Results of ours are as follows.

Algorithm	Cheby.(↓)	Clark(↓)	Can.(↓)	K-L(↓)	Cos.(↑)	Int.(↑)
SA-BFGS	.092 ± .010	.361 ± .029	.735 ± .060	.051 ± .009	.954 ± .009	.878 ± .011
SA-IIS	.100 ± .009	.361 ± .023	.746 ± .050	.051 ± .008	.952 ± .007	.873 ± .009
AA-kNN	.098 ± .011	.349 ± .029	.716 ± .062	.053 ± .010	.950 ± .009	.877 ± .011
AA-BP	.120 ± .012	.426 ± .025	.889 ± .057	.073 ± .010	.931 ± .010	.848 ± .011
PT-Bayes	.116 ± .011	.425 ± .031	.874 ± .064	.073 ± .012	.932 ± .011	.850 ± .012
PT-SVM	.117 ± .012	.422 ± .027	.875 ± .057	.072 ± .011	.932 ± .011	.850 ± .011

Results of the original MATLAB implementation (Geng 2016) are as follows.

Algorithm	Cheby.(↓)	Clark(↓)	Can.(↓)	K-L(↓)	Cos.(↑)	Int.(↑)
SA-BFGS	.107 ± .015	.399 ± .044	.820 ± .103	.064 ± .016	.940 ± .015	.860 ± .019
SA-IIS	.117 ± .015	.419 ± .034	.875 ± .086	.070 ± .012	.934 ± .012	.851 ± .016
AA-kNN	.114 ± .017	.410 ± .050	.843 ± .113	.071 ± .023	.934 ± .018	.855 ± .021
AA-BP	.130 ± .017	.510 ± .054	1.05 ± .124	.113 ± .030	.908 ± .019	.824 ± .022
PT-Bayes	.121 ± .016	.430 ± .035	.904 ± .086	.074 ± .014	.930 ± .016	.846 ± .016
PT-SVM	.127 ± .017	.457 ± .039	.935 ± .074	.086 ± .016	.920 ± .014	.839 ± .015

Requirements

matplotlib>=3.6.1
numpy>=1.22.3
qpsolvers>=4.0.0
quadprog>=0.1.11
scikit-fuzzy>=0.4.2
scikit-learn>=1.0.2
scipy>=1.8.0
tensorflow>=2.8.0
tensorflow-probability>=0.16.0

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.github/workflows		.github/workflows
dataset		dataset
demo		demo
docs		docs
pyldl		pyldl
visualization		visualization
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyLDL

Installation

Usage

Experiments

Requirements

About

Releases 2

Packages

Languages

License

SpriteMisaka/PyLDL

Folders and files

Latest commit

History

Repository files navigation

PyLDL

Installation

Usage

Experiments

Requirements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages