Skip to content

v0.3.0

Compare
Choose a tag to compare
@KoljaB KoljaB released this 03 Oct 00:00
· 76 commits to master since this release
a92e433

RealtimeSTT 0.3.0

New Features:

  • Soundcard Compatibility: Automatically adjusts from 48kHz downwards if 16kHz is unsupported, resampling to 16kHz.
  • Early Transcription: Added early_transcription_on_silence parameter to enable transcription during speech pauses, reducing overall latency.
  • Transcription Process Optimizations: Transcription process outsourced into separate class and optimized pipe communication for more stability and faster pipe communication, leading to fewer occurrances of audio chunks getting discarded due to queue size overflows.
  • Immediate Listen State: Fixed issue soi the system immediately returns to the listening state right after stopping the recording, preventing lost chunks.
  • Improved Logging: Always logs debug messages to a file, even if not explicitly configured. Option to disable logging with no_log_file parameter.
  • Transcription Time Display: New print_transcription_time parameter to show model processing time.

Bugfixes:

  • Chunk Handling: Enhanced chunk handling with the new allowed_latency_limit parameter, reducing dropped data during high-latency scenarios.