Spaces:

deepakkumar07
/

whisper-small-tamil-demo

Runtime error

App Files Files Community

deepakkumar07 commited on Mar 5

Commit

86e7cd1

verified ·

1 Parent(s): abce46e

Uploading food not food text classifier demo app.py

Browse files

Files changed (4) hide show

.gitignore +1 -0
README.md +48 -10
app.py +18 -0
requirements.txt +3 -0

.gitignore ADDED Viewed

	@@ -0,0 +1 @@


1	+ script.py

README.md CHANGED Viewed

@@ -1,12 +1,50 @@
 ---
-title: Whisper Small Tamil Demo
-emoji: 😻
-colorFrom: gray
-colorTo: red
-sdk: gradio
-sdk_version: 5.20.0
-app_file: app.py
-pinned: false
----
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

+# Whisper Small Tamil - Hugging Face Demo
+This repository hosts a demo for the **Whisper Small Tamil** model, fine-tuned for Tamil speech recognition. This model is based on OpenAI's Whisper-Small and has been trained to improve Automatic Speech Recognition (ASR) for Tamil language inputs.
+## 🚀 Demo
+Try the model directly on [🤗 Hugging Face Spaces](https://huggingface.co/spaces/deepakkumar07/whisper-small-tamil).
+## 📝 Model Details
+- **Base Model:** OpenAI Whisper-Small
+- **Fine-tuned for:** Tamil ASR
+- **Dataset Used:** Common Voice Tamil & other curated datasets
+- **Supports:** Tamil speech-to-text transcription
+## 🔧 How to Use
+You can use this model in Python with the `transformers` library:
+```python
+from transformers import pipeline
+# Load model from Hugging Face Hub
+asr_pipeline = pipeline("automatic-speech-recognition", model="deepakkumar07/whisper-small-tamil")
+# Transcribe an audio file
+result = asr_pipeline("path/to/audio.wav")
+print(result["text"])
+```
+## 📊 Performance
+This model is optimized for Tamil speech but may still have minor errors in transcription, especially with noisy audio or mixed-language inputs. Contributions and improvements are welcome!
+## 📌 Training Details
+- Fine-tuned using the **Hugging Face Transformers** and **datasets** libraries.
+- Trained on GPUs for better performance.
+- Supports **streaming inference** for real-time transcription.
+## 💡 Applications
+- Tamil voice-to-text conversion
+- Subtitling and transcription services
+- Voice-controlled Tamil applications
+## 🤝 Contributing
+If you find any issues or want to improve the model, feel free to open a PR or reach out!
+## 📜 License
+This model is released under an open license. Please refer to OpenAI's original Whisper license for base model terms.
 ---
+For more details, check out the [Hugging Face model page](https://huggingface.co/deepakkumar07/whisper-small-tamil). 🚀

app.py ADDED Viewed

	@@ -0,0 +1,18 @@

+from transformers import pipeline
+import gradio as gr
+pipe = pipeline(model="deepakkumar07/whisper-small-tamil")  # change to "your-username/the-name-you-picked"
+def transcribe(audio):
+    text = pipe(audio)["text"]
+    return text
+iface = gr.Interface(
+    fn=transcribe,
+    inputs=gr.Audio(sources=["microphone", "upload"], type="filepath"),
+    outputs="text",
+    title="Whisper Small Tamil",
+    description="Realtime demo for Tamil speech recognition using a fine-tuned Whisper small model.",
+)
+if __name__ == "__main__":
+    iface.launch()

requirements.txt ADDED Viewed

	@@ -0,0 +1,3 @@

+gradio
+torch
+transformers