Very simple set of utilities to tokenize a string using hugging face bindings. The underlying library for the bindings are daulet/tokenizers.
tokenizer -h
Usage of tokenizer:
-add_special_tokens
Add special tokens
-model string
The path to the model
The CLI is a simple command line interface that tokenizes a string using hugging face bindings. It reads from STDIN.
This is a Lambda function that tokenizes a string using hugging face bindings. It is meant to be fronted by an AWS Application Load Balancer.
This is a simple HTTP server that tokenizes a string using hugging face bindings. It is a standalone server that can be run locally or in a container.