Automatic Speech Recognition
Transformers
Safetensors
Japanese
whisper
audio
hf-asr-leaderboard
Eval Results
asahi417 commited on
Commit
7a27b3e
·
verified ·
1 Parent(s): e1c7798

Create benchmark.sh

Browse files
Files changed (1) hide show
  1. benchmark.sh +9 -0
benchmark.sh ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ # clone dataset
2
+ git clone https://huggingface.co/datasets/kotoba-tech/kotoba-whisper-eval
3
+ # convert to 16khz
4
+ ffmpeg -i kotoba-whisper-eval/audio/long_interview_1.mp3 -ar 16000 -ac 1 -c:a pcm_s16le kotoba-whisper-eval/audio/long_interview_1.wav
5
+ ffmpeg -i kotoba-whisper-eval/audio/manzai1.mp3 -ar 16000 -ac 1 -c:a pcm_s16le kotoba-whisper-eval/audio/manzai1.wav
6
+ ffmpeg -i kotoba-whisper-eval/audio/manzai2.mp3 -ar 16000 -ac 1 -c:a pcm_s16le kotoba-whisper-eval/audio/manzai2.wav
7
+ ffmpeg -i kotoba-whisper-eval/audio/manzai3.mp3 -ar 16000 -ac 1 -c:a pcm_s16le kotoba-whisper-eval/audio/manzai3.wav
8
+ # run the benchmark
9
+ python benchmark.py