Added encoder output #1136

SinanAkkoyun · 2024-11-13T14:29:08Z

WhisperModel now also outputs encoder embeddings optionally.

Usage:
normal, backwards compatible:

segments, info = model.transcribe(audio_path, beam_size=1)

for encoder outputs:

segments, info, encoder_output = model.transcribe(audio_path, beam_size=1).all()

Usage is fully backwards compatible

MahmoudAshraf97 · 2024-11-13T14:45:14Z

Hello and thanks for your contribution, however, I don't see why would the encoder output be needed after the transcription, so far I'm not in favor of merging this PR as it's a very niche use case

Added encoder output

422bfab

SinanAkkoyun mentioned this pull request Nov 13, 2024

Hidden state / embeddings of encoder #1132

Closed

SinanAkkoyun added 2 commits November 13, 2024 15:43

blank line

b8839a6

Merge branch 'SYSTRAN:master' into encoder_output

8ae9256

MahmoudAshraf97 closed this Nov 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added encoder output #1136

Added encoder output #1136

SinanAkkoyun commented Nov 13, 2024

MahmoudAshraf97 commented Nov 13, 2024

Added encoder output #1136

Added encoder output #1136

Conversation

SinanAkkoyun commented Nov 13, 2024

MahmoudAshraf97 commented Nov 13, 2024