multimodel
Here are 31 public repositories matching this topic...
RMDL: Random Multimodel Deep Learning for Classification
-
Updated
May 16, 2023 - Python
Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-context learning, visual reasoning, foundational models, and more. Stay updated with the latest advancement.
-
Updated
Aug 18, 2024
yolov5, yolov8, segmenations, face, pose, keypoints on deepstream
-
Updated
Dec 12, 2023 - Jupyter Notebook
🧘🏻♂️KarmaVLM (相生):A family of high efficiency and powerful visual language model.
-
Updated
Apr 29, 2024 - Python
This is our solution for KDD Cup 2020. We implemented a very neat and simple neural ranking model based on siamese BERT which ranked first among the solo teams and ranked 12th among all teams on the final leaderboard.
-
Updated
Jun 20, 2020 - Jupyter Notebook
OpenVINO+NCS2/NCS+MutiModel(FaceDetection, EmotionRecognition)+MultiStick+MultiProcess+MultiThread+USB Camera/PiCamera. RaspberryPi 3 compatible. Async.
-
Updated
Feb 12, 2023 - Python
Accepted by TMM 2022
-
Updated
Aug 18, 2022 - Python
ArangoGraph is the easiest way to run ArangoDB. Available on AWS, Google Cloud & Azure.
-
Updated
Feb 26, 2024
End-to-End AI Voice Assistant pipeline with Whisper for Speech-to-Text, Hugging Face LLM for response generation, and Edge-TTS for Text-to-Speech. Features include Voice Activity Detection (VAD), tunable parameters for pitch, gender, and speed, and real-time response with latency optimization.
-
Updated
Oct 19, 2024 - Jupyter Notebook
This project is a multi-modal model that works with multiple models combined and accepts audio, images, and text as inputs, generating corresponding audio, images, and text outputs.
-
Updated
Feb 26, 2024 - Python
Robust particle filter based on dynamic averaging of multiple noise models
-
Updated
Nov 14, 2019 - MATLAB
VyomAI: state-of-the-art NLP LLM Vision MultiModel transformers implementation into Pytorch
-
Updated
Aug 24, 2024 - Python
The Pictionary app uses LLaMA 3.1 to generate random drawing prompts and LLaMA 3.2 Vision to predict and judge user drawings based on these prompts. It provides an interactive and fun way to test your drawing skills within a set time limit.
-
Updated
Oct 3, 2024 - Python
📄 SemEval 2024 Task 8: Artificial Intelligence Text Detection System using Natural Language Processing and Neural Network techniques.
-
Updated
Feb 17, 2024 - Jupyter Notebook
Implementation of HUSE: Hierarchical Universal Semantic Embeddings https://arxiv.org/pdf/1911.05978.pdf
-
Updated
Feb 1, 2021 - Jupyter Notebook
Simplify time-consuming coding for the data scientist. Create beautiful charts, pandas transformers, and find the best model with the best parameters for your data.
-
Updated
Aug 20, 2024 - Jupyter Notebook
-
Updated
Jul 24, 2024 - Jupyter Notebook
Improve this page
Add a description, image, and links to the multimodel topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the multimodel topic, visit your repo's landing page and select "manage topics."