Naive Bayes with KDE(Kernel Density Estimation)

This method mainly has two features, I'll introduce them non-technically:

There's not assumption about the distribution of the data, and the probabilities are derived by KDE so the result should be more reliable. Some test shows that this method outperforms the Gaussian/Multinomial Naive Bayes provided by scikit-learn(I'll upload the test details afterwards).
It has a memory to 'remember' the things it learns. This also allow it to, first, learn while working, and second, forget things that are too old.

Quick Start

For example, if I have some data about programmers' heights and the level of their programming skill, and now I want to use heights to predict if one is a good programmer(I'm just kidding).

>>> import nb
>>> clf = nb.NB()
>>> X = [[169], [172], [185], [182], [162], [160], [190], [192]]
>>> y = ['guru', 'guru', 'beginner', 'beginner', 'ok', 'guru', 'guru']
>>> clf.fit(X, y)

This builds the classifier. And then I enter my height.

>>> clf.predict([171])
'guru'

It can continue to learn after this. You can continue to use:

>>> clf.fit([[200]], ['super'])

, if you have new train case.

More details to be added

TODOs

There's no bandwidth selection in the KDE currently. I'll fix it ASAP.
It's slow. More optimization needed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Naive Bayes with KDE(Kernel Density Estimation)

Quick Start

TODOs

Files

README.md

Latest commit

History

README.md

File metadata and controls

Naive Bayes with KDE(Kernel Density Estimation)

Quick Start

TODOs