Quick Whisper is a free and open-source speech-to-copy-edited-text software tool that uses AI to convert spoken audio into a copy-edited transcript, automatically pasting it into your active application.
Here is a video example of how this AI-enabled speech-to-text (STT) looks when running on Windows:
Designed to enhance productivity, it significantly accelerates workflows, allowing quicker responses to emails or messages, as speaking is generally two to three times faster than typing. Quick Whisper automates the entire speech-to-transcription-to-copy-editing process in the background, eliminating the need to switch apps for AI copy editing. This integration into your daily tasks makes using AI faster and easier, saving valuable time on writing-related activities.
Text-to-Mic uses the OpenAI text-to-speech engine, Whisper, which surpasses the standard text-to-speech tools available on Windows and Mac. This app is available to use for free.
- Automatic Speech-to-Text Conversion
Quickly captures spoken ideas and responses, allowing you to communicate faster than with traditional typing. - Built-in AI Copy Editing
Ensures polished, professional output by refining transcriptions for clarity, readability, and coherence, saving time on manual editing. - Auto-Paste Functionality
Instantly pastes the edited text into your active application, allowing a smooth, hands-free workflow without needing to switch apps. - Hotkey-Activated Recording
Reduces interruptions by enabling quick, one-click recording control, making it easy to integrate speech-to-text into your daily tasks. - Customizable AI Models
Provides flexibility in balancing performance and cost, allowing you to select models suited to your specific needs, from budget-friendly options to premium quality. - Adjustable Settings
Offers personalization through customizable auto-paste, auto-copy, and AI preferences, ensuring Quick Whisper seamlessly integrates with your workflow and preferences.
Watch the video above to see the power of the AI-enabled Quick Whisper in action!
Quick Whisper allows you to compose emails swiftly, reply to colleagues, and transcribe spoken text into paragraphs for blog articles or social media posts. It enhances your workflow and improves the quality of your output for any task requiring extensive written content.
The app is free, not use of your API Key when running it:
It's kind of like we've given you a free car, but you need to pay for the petrol to drive it; Although we are providing the app for free, please be aware that using an OpenAI key for transcription and AI copy editing incurs a cost. You should be mindful of these costs and keep track of them if you intend to use this tool. It is the software that we are offering for free, not the use of your API key. This isn't hugely expensive and can be mitigated by changing which model you use, and will vary by use.
Download Quick Whisper STT for Free
Virus scanners on windows can give false positives for this app given how it uses your mic and copy and paste. If you'd like to review and compile the source code yourself then you can access it here on github.
For Windows
- Download v1.8.0 for Windows (72MB ZIP)
- Download v1.8.0 for Windows (72MB EXE)
- Download v1.7.0 for Windows (72MB ZIP)
- Download v1.7.0 for Windows (72MB EXE)
- Download v1.6.1 for Windows (72MB ZIP)
- Download v1.6.1 for Windows (72MB EXE)
- Download v1.5.0 for Windows (72MB ZIP)
- Download v1.5.0 for Windows (73MB EXE)
- Download v1.4.0 for Windows (74MB ZIP)
- Download v1.4.0 for Windows (74MB EXE)
- Download v1.3.0 for Windows (73MB ZIP)
- Download v1.3.0 for Windows (74MB EXE)
- Download v1.2.0 for Windows (69MB ZIP)
- Download v1.2.0 for Windows (70.4MB EXE)
For Mac
- Though written in python, this app has not yet been compiled for Mac. Please let us know if you are a mac user who would like to use this so we can judge demand and consider releasing for Mac too.
If Windows displays a message stating that this is an unsigned application and asks if you want to run it anyway, please be assured this is just because we haven't compiled and exported a certified copy yet. This involves more work, and for now, we're releasing it for free. Perhaps we can release a certified Windows app store version later if there is enough demand.
Getting Started
- Download Quick Whisper
Begin by downloading the Quick Whisper application from the download link above. - Set Up Your OpenAI API Key
Obtain an OpenAI API key (see instructions on how to generate one) and keep it handy for setup. - Enter Your API Key
Open Quick Whisper and enter your OpenAI API key in the prompt that appears. - Start Recording
Click the Start Recording button or use the Windows + J shortcut to begin recording. Start speaking to capture your voice input. - Stop Recording
To end the recording, click Stop Recording or use the Windows + J shortcut again. - Retrieve Your Transcription
The app will automatically process your speech and produce a refined, copy-edited transcription. You can manually copy this text from the app or enable Auto-Copy and Auto-Paste to have it transferred directly into your active application.
Keyboard shortcuts explained:
- The "Win + J" keyboard shortcut will record, transcribe, and then automatically edit the transcription using AI.
- The "Win + Ctrl + J" shortcut will record and transcribe without using AI editing.
We've added both options because sometimes you might prefer a raw transcription rather than a copy-edited version.
Change Log
- 1.8.0 - 05/12/2024
Add "alt+left/right" keyboard shortcuts to navigate between prompts in the background (it reads their name out too). Added ability to set custom keyboard shortcuts. Refesh key bindings when app maximised (sometimes it loses them when you lock your screen so this enabled a quick re-bind process). - 1.7.0 - 02/12/2024
Add whisper input language select to prevent incorrect language detection (Whisper randomly thinks I'm speaking in Welsh sometimes). Fix key bindings being lost; add key binding test function. Update the Default prompt to account for different language input (translates to english by default). - 1.6.1 - 25/11/2024
Code structure improvements (tidy up) and small quality of life improvements, addition of cancel recording keyboard shortcut (Win+X), changes to make it ready for Mac users (pending testing), addition of new prompt editing and viewing feature to enable custom editing prompts. - 1.5.0 - 16/11/2024
UI improvements to enhance the visual flow and order of interface items. Also, updated buttons are to be rounded to look friendlier. - 1.4.0 - 15/11/2024
Update pop noises with different pitches so that it is clearer when a start or stop action is triggered. Add history for the current session and allow that history to be navigated. Also added saving of the current session to a JSON file. Note that history clears went he app is closed. Changed keyboard shortcut for transcription only to "Win+Ctrl+J". Fixed word wrapping issue in textarea. - 1.3.0 - 14/11/2024
Replace AI edit tick box with two buttons and independent keyboard shortcuts making it easier to trigger transcript only or edit only when running in the background. Added ability to copy the last transcript and edit from the context menu. Updated the copy edit prompt to better reflect tone of the intended use case. - 1.2.0 13/11/2024
Main public quick whisper release with core functions working.
Screenshots
The main quick whisper app interface (v1.5.0):
Model settings adjustment:
Retry last recording:
Prompt management:
Frequently Asked Questions (FAQs)
What is Quick Whisper, and how does it work?
Quick Whisper is a speech-to-copy-edited-text tool that uses AI to transcribe spoken audio into polished text. The app automatically pastes the text into your active application, saving time and enhancing productivity.
What are the key benefits of using Quick Whisper?
Quick Whisper speeds up workflows by enabling faster communication, saving time on editing with built-in AI, and eliminating the need to switch between applications for transcription and copy-editing tasks.
Does Quick Whisper support multiple AI models?
Yes, Quick Whisper allows users to select from various AI models, enabling flexibility in choosing the model that best suits your performance and budget needs. At present only OpenAI models via their API are supported.
How do I set up an OpenAI key to begin?
To set up an OpenAI key, first create an account with OpenAI if you haven't already. Navigate to their playground, which serves as their developer area. Under the dashboard, locate the API keys section, where you can set up your own key and copy it for use in the application. Please be cautious not to share these API keys, as others could use your account and incur charges. This video demonstrates how to generate an API key.
How do I activate Quick Whisper’s recording feature?
Quick Whisper includes a hotkey-activated recording function (win+j), which lets you start and stop recordings quickly, ensuring minimal interruptions to your workflow.
Can I customise settings in Quick Whisper?
Yes, Quick Whisper offers adjustable settings for auto-paste, auto-copy, and AI model preferences, allowing users to tailor the app to their individual preferences.
Is Quick Whisper compatible with my operating system?
Quick Whisper is designed for Windows and should work seamlessly on compatible Windows systems, ensuring smooth integration into your daily tasks.
What is an API key and what do you do with it?
The API key lets us connect to OpenAI to use their transcription and AI models within the application. You need to generate an API key, as it allows the app to use your OpenAI account. Consequently, OpenAI will bill you per use.
This setup enables us to offer the app for free, as we don't incur ongoing costs, given it utilises your API key. While the app is free, API key use is chargeable per use. We don't store the key in the cloud or have access to it on our side; it's saved only on your computer and accessible solely to you. We only use it to authenticate your access to the OpenAI API. It's crucial not to share this key with others.
What is the difference between the GPT models in AI manipulation settings?
This setting determines which AI 'model' is used to manipulate input or recorded text based on the provided prompt. Think of it as picking which AI brain to use. For example, at the time of writing:
- gpt-4o-mini is cheaper per word to manipulate text and is faster but less intelligent than gpt-4o.
- gpt-4o is a more powerful AI and is more likely to be able to deal with complex instructions, but it costs more per word to run and is a littler slower.
We recommend trying 4o-mini first due to its speed benefits and switching to GPT4 should you find you want it to perform certain AI manipulations better.
The OpenAI Whisper model has been open-sourced. Why didn't you use this free version instead of using an API key that incurs charges?
Yes, you can download the Whisper model for free and run it locally and this was an option to us when making the app; however, the model download file is quite large, often in gigabytes. The performance of running it locally depends greatly on the hardware specifications of your machine, which may result in slow operation.
Speed is crucial for the user experience of this application, so we decided to use the OpenAI Whisper API for transcription and copy editing. Although it incurs a cost, it significantly outperforms any on-device text-to-speech or copy-editing engines we've used so far. Our aim was to optimise for speed, user experience, and output quality, rather than being overly concerned about the API call cost, which is relatively minor in the grand scheme of things.
Will my speech and transcripts be used to train OpenAI's models?
No, because we handle the transcription and copy editing using the API version of OpenAI, they have committed that data sent and processed in this way is not used to train their models.
I have ideas for new features or custom extensions that would benefit my business. Can you help me with that?
If you notice a bug or small quality-of-life enhancement, please let us know, and we will consider implementing it in the tool for free.
We can also accommodate more substantial enhancements, such as custom extensions for business; Though please be aware these are likely to carry a development charge. Please contact us to let us know what you have in mind.
Terms of Use, Disclaimer, and Licence Information
Quick Whisper is provided "as is" and on an "as available" basis, without any warranties of any kind, either express or implied. Scorchsoft Ltd expressly disclaims all warranties, whether express, implied, statutory, or otherwise, including but not limited to the implied warranties of merchantability, fitness for a particular purpose, and non-infringement. We do not warrant that the software will function uninterrupted, that it is error-free, or that any errors or defects will be corrected.
Limitation of Liability
In no event will Scorchsoft Ltd be liable for any indirect, incidental, special, consequential, or punitive damages resulting from or related to your use or inability to use Text to Mic, including but not limited to damages for loss of profits, goodwill, use, data, or other intangible losses, even if Scorchsoft Ltd has been advised of the possibility of such damages.
Use at Your Own Risk
By using Quick Whisper, you acknowledge and agree that you assume full responsibility for your use of the software, and that any information you send or receive during your use of the software may not be secure and may be intercepted or later acquired by unauthorized parties. Use of Text to Mic is at your sole risk.
License Agreement