VyomAI

Transfomer models implementation from scratch using pytorch to make it more accessible for research purposes.

The best way to understand is learning by doing.

Examples

Each example is in one single notebook for readability and understanding

**Each example implemented from scratch using Pytorch 2.0 **

Task	dataset link	Pyotch 2.0	description
`text classification`	clinc_oos	✅	encoder model for text classification
`masked language modeling`	clinc_oos	✅	encoder model pretraining with mlm style
`electra language modeling`	clinc_oos	✅	encoder model pretraining with electra style
`casual language-modeling`	mark-twain-books	✅	decoder model pretraining with gpt style and kv-cache for fast inference
`knowledge distilation`	clinc_oos	✅	initilization of a model from pretrained model
`seq2seq modeling`	Flicker-30k	✅	seq2seq model training for image caption with kv-cache
`adapters`	clinc_oos	✅	Lora and Dora for parameter efficient tunning
`vit`	Scene-classification	✅	visual image transformer for image classification
`detr`	Global-Wheat-Detection	✅	implementation of detr DEtection TRansformer encoder decoder model for object detection
`clip`	Flicker-30k	✅	implementation of contrastive language-image pre-training
`vision language multimodel-I`	COCO	✅	A minimmal vision-language model implementation with image-text fusion to generate image caption with RoPE and kv-cache
`vision language multimodel-II`	COCO	✅	A Multimodel implementation with image-text fusion to generate caption of image with RoPE and kv-cache which can we extended to visual question answering, open vocabulary object detection, optical character recognition
`Paligemma`	Flicker-30k	✅	Scratch implementation of Paligemma a Multimodel from Google-AI
**More to come

Usage

from VyomAI import EncoderModel, EncoderForMaskedLM
from VyomAI import EncoderConfig
config = EncoderConfig()
encoder = EncoderModel(config,pos_embedding_type='rope')
#pos_embedding_type supported: Absolute, sinusoidal, RoPE
#attention_type supported: gqa, Vanila

More About VyomAI

Learn the basics of PyTorch

At a granular level, it support the following components:

Component	Description
Encoder	Text encoder model with Bert like architecture that support absolute, sin,Rope embedding and GQA , Vanila attention
Decoder	Text decoder model with GPT like architecture that support absolute, sin,RoPE embedding and GQA , Vanila attention and KV-Cache for fast inference
Seq2Seq	Model with Bart like architecture that support absolute, sin,RoPE embedding and GQA, Vanila attention and KV-Cache for fast inference encoder can be text or image type
VisionEncoder	Model with Vit like architecture for image encoding **more to come
Multimodel	A Minimal vision-language model **more to come

Improvement and Contribution

We appreciate all contributions. If you want to contribute new features, utility functions, or tutorials please open an issue and discuss the feature with us.

Resources

Some helpfull learning resources

[1] https://www.youtube.com/@stanfordonline
[2] https://d2l.ai/
[3] https://pytorch.org/tutorials/

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
Examples		Examples
VyomAI		VyomAI
tests		tests
.coverage		.coverage
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VyomAI

Examples

Usage

More About VyomAI

Improvement and Contribution

Resources

References

About

Releases

Packages

Languages

License

Ajax0564/VyomAI

Folders and files

Latest commit

History

Repository files navigation

VyomAI

Examples

Usage

More About VyomAI

Improvement and Contribution

Resources

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages