Skip to content

How to handle inappropriate word segmentation in English data? #1273

Discussion options

You must be logged in to vote

However, I have attached the file containing data related to "r," "ek," and "Ho," which I hope will be helpful. From the file, we can see that "your," "week," and "Houston" are all complete words in my data file.

Sorry for my poor English. But, can you reproduce the problem with the Sample.xlsx?

It would be helpful if you could provide a file that reproduces the problem.

Also, how about cleaning your data? A good place to start is the CLEAN function in Excel. Please try creating a new Excel file with CLEANed text and making a new KH Coder project with that new file.
https://www.educba.com/clean-in-excel/

Replies: 2 comments 7 replies

Comment options

You must be logged in to vote
1 reply
@xmgwzcn
Comment options

Comment options

You must be logged in to vote
6 replies
@xmgwzcn
Comment options

@ko-ichi-h
Comment options

@xmgwzcn
Comment options

@ko-ichi-h
Comment options

@xmgwzcn
Comment options

Answer selected by ko-ichi-h
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
2 participants