Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
server : add speculative decoding support (ggerganov#10455)
* server : add speculative decoding support ggml-ci * server : add helper function slot.can_speculate() ggml-ci
- Loading branch information