Amazon.co.uk Widgets

Log in

X
Open AI Logo (the logo belongs to Open AI)

Ever wished you could transcribe the text of an audio file easily just from the command line, without heath robinson playback and dictation software? Its easy to do now from the command line and only takes a few minutes to set up. Just remember, you're the product and nothing is for free so don't send data that might be confidential to be transcribed.

TL:DR – The whisper tool from openai does the job well, certainly for my needs, in UK English and is easy to install and use. YMMV.

Install ffmpeg

You'll need homebrew, to install some tools, (see brew.sh). You'll need a Mac and enough proficiency with the terminal to change to the right folder where your audio file is (hint: type cd  and then drop the folder from finder onto the terminal window to auto complete the full folder name. Then run brew install ffmpeg . 

% brew install ffmpeg
==> Auto-updating Homebrew...
... lots of output, allow it to finish ...

Install Whisper

Run % pip install -U openai-whisper

% pip install -U openai-whisper
Collecting openai-whisper
  Downloading openai-whisper-20231117.tar.gz (798 kB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 798.6/798.6 kB 14.2 MB/s eta 0:00:00
  Installing build dependencies ... done
  Getting requirements to build wheel ... done
  Installing backend dependencies ... done
  Preparing metadata (pyproject.toml) ... done
... lots of output, allow it to finish ...

Transcribe text from an audio file on a Mac

Just enter the command from the folder the file is in. whisper Audio.mp4 --output_format txt

Theres a console output showing you the progress through the audio, heres a snippet

[25:17.040 --> 25:23.440]  But that's a sort of slightly longer-term thing, perhaps.
[25:23.440 --> 25:26.040]  Any more? Any other questions from anyone?
[25:28.600 --> 25:29.600]  All right, thanks.
[25:29.600 --> 25:31.760]  Thanks for having me. I'll be here.

Go and have a cup of cofffe, when you return your audio transcription will be in the same folder.

Magic really. 

Licences, trademarks, source code licences and attributions

Licences, trademarks, source code licences and attributions

Multizone and this site is not affiliated with or endorsed by The Joomla! Project™. Any products and services provided through this site are not supported or warrantied by The Joomla! Project or Open Source Matters, Inc. Use of the Joomla!® name, symbol, logo and related trademarks is permitted under a limited licence granted by Open Source Matters, Inc. 928uk® is a trademark of Multizone Limited, registered in the UK. AdMob™, AdSense™, AdWords™, Android™, Chrome OS™, Chromebook™, Chrome™, DART™, Flutter™, Firebase™, Firestore™, Fuchsia™, Gmail™, Google Maps™, Google Pixel™, Google Play™, Pixelbook Go™, and Pixel™ and other trademarks listed at the Google Brand Resource center are trademarks of Google LLC and this site is not endorsed by or affiliated with Google in any way. Apple and the Apple logo are trademarks of Apple Inc., registered in the U.S. and other countries. App Store is a service mark of Apple Inc. The OSI logo trademark is the trademark of Open Source Initiative. UNIX® and the X® logo are registered trademarks of The Open Group. Any other product or company names may be trademarks™ or registered® trademarks of their respective holders. Use of these trademarks in articles here does not apply affiliation or endorsement by any of them.

Where the source code is published here on ezone.co.uk or on our GitHub by Angus Fox, Multizone Limited it is licenced according to the open source practice for the project concerned.

BSD 3-Clause "New" or "Revised" Licence
Original source code for mobile apps are licenced using the same licence as the one used by "The Flutter Authors". This Licence, the BSD 3-Clause "New" or "Revised" Licence (bsd-3-clause) is a permissive licence with a clause that prohibits others from using the name of the project or its contributors to promote derived products without written consent.
GNU General Public Licence v2.0 or later
Original source code for Joomla! published here on ezone.co.uk by Angus Fox, Multizone Limited is licenced using the same licence as the one used by Joomla!. This Licence, the GNU General Public Licence Version 2 or later (gpl-2.0) is the most widely used free software licence and has a strong copyleft requirement. When distributing derived works, the source code of the work must be made available under the same licence.

You can use any code you find here, just respect the licences and dont use the name of this site or our company to promote derived products without written consent. I mean, why would you? You're not us!

Amazon Associate
As an Amazon Associate we earn from qualifying purchases.
Logo
Our Logo Image is by Freepik. We chose it because its an M and also the letter A twice - and that represents us.