メインコンテンツへスキップ

All Posts

News bits

FFmpeg 8.0がWhisperフィルターで自動音声認識に対応

FFmpeg 8.0で、Whisperフィルターが新しく追加された。これにより、FFmpeg単体でOpenAIのWhisperモデルを使用した自動音声認識が可能になった

plaintext
ffmpeg -i input.mp4 -vf "whisper=language=ja" -f srt output.srt

音声認識と同時に動画処理を行う例:

plaintext
ffmpeg -i input.mp4 -vf "whisper=language=en" -c:v libx264 -c:a aac output.mp4

これまで音声認識には別途Whisperの実行環境が必要だったが、FFmpeg単体で処理できるようになったことで、動画編集ワークフローでの自動字幕生成が容易になった。

出展:August 22nd, 2025, FFmpeg 8.0 “Huffman”

著者について

Hi there. I'm hrdtbs, a frontend expert and technical consultant. I started my career in the creative industry over 13 years ago, learning on the job as a 3DCG modeler and game engineer in the indie scene.

In 2015 I began working as a freelance web designer and engineer. I handled everything from design and development to operation and advertising, delivering comprehensive solutions for various clients.

In 2016 I joined Wemotion as CTO, where I built the engineering team from the ground up and led the development of core web and mobile applications for three years.

In 2019 I joined matsuri technologies as a Frontend Expert, and in 2020 I also began serving as a technical manager supporting streamers and content creators.

I'm so grateful to be working in this field, doing something that brings me so much joy. Thanks for stopping by.