Skip to content
Change the repository type filter

All

    Repositories list

    • Scripts for training large-scale monolingual speech foundation models with 158K hours of Finnish speech
      Jupyter Notebook
      Apache License 2.0
      0410Updated Oct 15, 2024Oct 15, 2024
    • Scripts for training colloquial Finnish wav2vec2 models
      Jupyter Notebook
      0000Updated Oct 15, 2024Oct 15, 2024
    • Scripts for adapting large speech foundation models for Northern Sámi ASR
      Python
      Apache License 2.0
      0100Updated Sep 3, 2024Sep 3, 2024
    • mtkd4ser

      Public
      Multi-Teacher Language-Aware Knowledge Distillation for Speech Emotion Recognition
      Python
      MIT License
      1000Updated Aug 31, 2024Aug 31, 2024
    • MuSe-2024

      Public
      Multimodal Humor Detection and Social Perception Prediction
      Shell
      MIT License
      0000Updated Aug 27, 2024Aug 27, 2024
    • Scripts and jupyter notebooks to process and analyse ITE typing dataset
      Jupyter Notebook
      MIT License
      0000Updated Aug 27, 2024Aug 27, 2024
    • Python
      Apache License 2.0
      0000Updated Aug 19, 2024Aug 19, 2024
    • A test suite to assess the knowledge of morphology in LLMs
      Python
      0000Updated Jul 22, 2024Jul 22, 2024
    • slurpfood

      Public
      Repository containing scripts for ...
      Python
      0100Updated Jul 16, 2024Jul 16, 2024
    • dbca

      Public
      Distribution-based compositionality assessment of natural language corpora
      Python
      1000Updated Jul 11, 2024Jul 11, 2024
    • Aalto ASR preprocessing tool for preparing texts.
      Python
      MIT License
      02010Updated Feb 1, 2024Feb 1, 2024
    • Tools for downloading and processing Finnish parliament data
      Python
      MIT License
      23014Updated Feb 1, 2024Feb 1, 2024
    • Python
      0200Updated Nov 20, 2023Nov 20, 2023
    • Python
      MIT License
      5000Updated Oct 16, 2023Oct 16, 2023
    • Speech Recognition experiments combining Lahjoita Puhetta with Finnish Parliament
      Python
      2100Updated Oct 5, 2023Oct 5, 2023
    • Implementations for the Matched Encoder and Equal Data comparisons of HMM/DNN and Attention-based ASR systems
      0000Updated Sep 19, 2023Sep 19, 2023
    • Custom 🤗 Transformers for training multi-task wav2vec2 models that perform ASR and speech classification tasks simultaneously as described in Getman, Y., Al-Ghezi, R., Grósz, T., Kurimo, M. (2023) Multi-task wav2vec2 Serving as a Pronunciation Training System for Children.
      Python
      Apache License 2.0
      27k200Updated Aug 29, 2023Aug 29, 2023
    • Code repository for the experiments conducted for the ComParE 2023 challenge.
      Python
      0100Updated Jul 11, 2023Jul 11, 2023
    • Python
      2000Updated Jun 13, 2023Jun 13, 2023
    • scripts and images for article "Investigating wav2vec2 context representations and the effects of fine-tuning, a case-study of a Finnish model"
      Python
      MIT License
      0820Updated May 30, 2023May 30, 2023
    • Kaldi + SpeechBrain + W2V2 models for Northern Sami
      Python
      1400Updated May 29, 2023May 29, 2023
    • This directory runs NeMo on Puhti
      Shell
      0000Updated Mar 30, 2023Mar 30, 2023
    • How to setup a Kaldi and SpeechBrain environment on CSC Puhti
      Shell
      0000Updated Mar 30, 2023Mar 30, 2023
    • Implementation of different curriculum learning (CL) methods for speechbrain's ASR recipes.
      Python
      MIT License
      15019Updated Mar 28, 2023Mar 28, 2023
    • A notebook that looks at results from HMM and AED ASR systems.
      Jupyter Notebook
      1000Updated Mar 21, 2023Mar 21, 2023
    • Librispeech HMM/DNN and AED SpeechBrain experiments
      Python
      2000Updated Feb 8, 2023Feb 8, 2023
    • SpeechBrain recipes for Finnish Parliament data - HMM/DNN
      Python
      1000Updated Feb 6, 2023Feb 6, 2023
    • AED implementations for Finnish parliament Train20 (Includes Train16 and Train Comb as well)
      Python
      1000Updated Feb 6, 2023Feb 6, 2023
    • Implementation of automatic speech rating systems for second language (L2) learners of Finnish and Finland Swedish
      Jupyter Notebook
      1100Updated Dec 22, 2022Dec 22, 2022
    • A collection of resources related to the Lahjoita puhetta speech corpus.
      0100Updated Sep 22, 2022Sep 22, 2022