Skip to content

0.3.0

Latest
Compare
Choose a tag to compare
@soupslurpr soupslurpr released this 05 Jun 06:12
· 23 commits to master since this release
d931284

Transcribro's keyboard now adapts the transcript to the surrounding and selected text, allowing for easier editing without having to go back to other keyboards as much. Check out the description of the PR at #39 for an example of using it and information on how to make use of it properly. Even without reading, its been made seamless to use.

The keyboard also plays audio feedback when recognition gets toggled. Keep in mind that it will only play if your ringer is set to sound rather than vibrate or mute.

Next, the Whisper model being used was switched from base-en-q5_1 to tiny-en-q8_0, which means computing the transcripts is now a lot faster! There doesn't seem to be a noticeable quality loss in my limited testing.

The last major addition is an ACTION_RECOGNIZE_SPEECH activity, which allows other apps to ask Transcribro to launch on top of them, transcribe audio from the microphone, and then return the transcript to them.
Chromium, for example, uses this (by pressing the microphone button at the right of its search bar as of writing) to search, and seems to tell the search engine that speech was used to search. For Google search, a voice tells you a summary or such of the information you requested, acting sort-of like Google assistant, which is pretty cool.

Finally, the privacy policy was updated.

If there are any issues, please don't hesitate to create an issue in GitHub (first make sure to search in case a similar issue already exists), and/or join the Matrix rooms (check https://github.com/soupslurpr/transcribro?tab=readme-ov-file#community).

Enjoy!

Notable Changes

Full Changelog: v0.2.1...v0.3.0