Skip to content

Best practice regarding outliers for dynamic topic modeling #1738

Answered by MaartenGr
serenalotreck asked this question in Q&A
Discussion options

You must be logged in to vote

HDBSCAN can be quite strict in assigning outliers. Therefore, reduce_outliers was introduced to reduce its impact. In practice, you would have to check yourself whether the new assignments make sense by inspecting a subset manually. Do note that this function can also reduce some outliers and it's actually advised not to reduce all of them.

Replies: 1 comment 5 replies

Comment options

You must be logged in to vote
5 replies
@lintonye
Comment options

@MaartenGr
Comment options

@lintonye
Comment options

@lintonye
Comment options

@MaartenGr
Comment options

Answer selected by serenalotreck
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants