1.3.3: Custom Word Tokenizer
Add a new operation: CustomWordTokenizer
This operation tokenizes the input string according to the given user words.
If the substring of the input string matches the user words, it would be chunked as a single token.
Otherwise, the substring would be tokenized as a list of characters.