We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve the SentencePiece default settings for chat:
character_coverage=0.99995 byte_fallback=True normalization_rule_name="identity" remove_extra_whitespace="false" input_sentence_size=2000000000 split_digit=True split_by_unicode_script=True split_by_whitespace=True split_by_number=True allow_whitespace_only_pieces=True
Reference: https://www.youtube.com/watch?v=zduSFxRajkE&t=4925s
The text was updated successfully, but these errors were encountered:
No branches or pull requests
Improve the SentencePiece default settings for chat:
Reference:
https://www.youtube.com/watch?v=zduSFxRajkE&t=4925s
The text was updated successfully, but these errors were encountered: