Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Audio Transcription Capability? #6

Open
doxgt opened this issue May 17, 2024 · 1 comment
Open

Add Audio Transcription Capability? #6

doxgt opened this issue May 17, 2024 · 1 comment

Comments

@doxgt
Copy link

doxgt commented May 17, 2024

Greetings.

I have been able to use the OpenAI Python library to send audio recordings to OpenAI for transcription (https://platform.openai.com/docs/guides/speech-to-text/quickstart).

However, I am wondering about WinHTTP based interactions with OpenAI as you demonstrated in your utility. And I'd always prefer AHK to tinkering with Python.

I am wondering if you happen to have any plan to add a module for uploading audio file for transcription.

If not, could you point me to the way on how to interface with the API in terms of uploading audio files? I kind of figured that I would be doing something along the line of ComObject("WinHttp.WinHttpRequest.5.1").SetRequestHeader("Content-Type", "multipart/form-data").

Instead of "https://api.openai.com/v1/chat/completions", the speech API URL is at "https://api.openai.com/v1/audio/transcriptions; and the API Model would be "whisper-1".

Then I am not sure where to go from there.

Many thanks in advance!

@doxgt
Copy link
Author

doxgt commented May 19, 2024

I figured out how to use cURL to send audio files. No further actions needed here. Thanks for taking a look if you did.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant