Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
gitnlp authored Dec 27, 2023
1 parent 183931e commit d51f103
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -144,6 +144,8 @@ For LongNet models ([Flash Attention](https://github.com/Dao-AILab/flash-attenti
- [Retentive Network: A Successor to Transformer for Large Language Models](https://arxiv.org/abs/2307.08621)
* created by `config = RetNetConfig(vocab_size=64000)` and `retnet = RetNetDecoder(config)`.

- [LongNet: Scaling Transformers to 1,000,000,000 Tokens](https://arxiv.org/abs/2307.02486)

Most of the features above can be used by simply passing the corresponding parameters to the config. For example:

```python
Expand Down

0 comments on commit d51f103

Please sign in to comment.