Change log for Ichigo v0.4:
- Unified Training Pipeline: Consolidated Phase 2 and Phase 3 into a single-phase training approach.
- Training data enhancements:
- Migrated speech noise data and speech multi-turn data from Phase 3 into Phase 2.
- Introduced noise-augmented multi-turn conversations: we synthetic by injecting noise turn in speech and text-only multi-turn datasets.
Performance Improvements vs v0.3:
- Enhanced Intelligence: Improved benchmark scores on MMLU (64.66).
- Extended Context Handling
- Advanced Noise Management: More robust rejection of noisy environmental inputs
- Improving Multi-turn Capabilities.
Model weight: https://huggingface.co/collections/homebrewltd/ichigo-v04-67317bde6dfdfdd55dddbc6e
Live demo at: https://ichigo.homebrew.ltd/