Spaces:

snackshell
/

amharic-tts

Running

App Files Files Community

snackshell commited on Mar 23

Commit

04f3f01

verified ·

1 Parent(s): 51fe9d6

Upload 4 files

Browse files

Files changed (4) hide show

LICENSE +21 -0
README.md +115 -14
app.py +57 -0
requirements.txt +5 -0

LICENSE ADDED Viewed

	@@ -0,0 +1,21 @@

+MIT License
+Copyright (c) 2025 Snackshell
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

README.md CHANGED Viewed

@@ -1,14 +1,115 @@
----
-title: Amharic Tts
-emoji: 🦀
-colorFrom: blue
-colorTo: blue
-sdk: gradio
-sdk_version: 5.22.0
-app_file: app.py
-pinned: false
-license: mit
-short_description: ' # Amharic Text-to-Speech (TTS) Application'
----
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

+```markdown
+# Amharic Text-to-Speech (TTS) Application
+```
+<div align="center">
+  <img src="./assets/demo.png" alt="Amharic TTS Interface" width="800">
+  <br>
+  <em>Convert Amharic text to natural-sounding speech directly in your browser</em>
+</div>
+[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
+...
+[![Python 3.8+](https://img.shields.io/badge/python-3.8+-blue.svg)](https://www.python.org/downloads/)
+A simple web-based Text-to-Speech application focused on Amharic language support, powered by Microsoft Edge TTS.
+## Features ✨
+- 🗣️ Native Amharic voice support (Male & Female)
+- 🌍 Web interface with Amharic localization
+- ⚡ Real-time speech synthesis
+- 🎧 Direct audio playback in browser
+- 🛠️ Error handling with Amharic/English messages
+- ⏱️ 30-second timeout protection
+## Supported Voices 🎶
+| Name   | Gender | Voice ID           |
+|--------|--------|--------------------|
+| Ameha  | Male   | `am-ET-AmehaNeural`|
+| Mekdes | Female | `am-ET-MekdesNeural`|
+## Installation 💻
+### Prerequisites
+- Python 3.8+
+- pip package manager
+### Steps
+1. Clone repository:
+```bash
+git clone https://github.com/snackshell/amharic-tts.git
+cd amharic-tts
+```
+2. Create virtual environment:
+```bash
+python -m venv venv
+source venv/bin/activate  # Linux/Mac
+venv\Scripts\activate     # Windows
+```
+3. Install dependencies:
+```bash
+pip install -r requirements.txt
+```
+## Usage 🚀
+1. Start the application:
+```bash
+python app.py
+```
+2. Access the interface at:
+```
+http://localhost:7860
+```
+3. Enter Amharic text and select a voice:
+   - Type/paste text in the input box
+   - Choose between Ameha (Male) or Mekdes (Female)
+   - Click "ድምፅ ፍጠር" (Generate Audio)
+4. Play generated audio using the built-in player
+## Technical Details 🔧
+### Architecture
+```mermaid
+graph TD
+    A[User Interface] --> B(Gradio Frontend)
+    B --> C[Edge TTS Service]
+    C --> D[Microsoft Cognitive Services]
+```
+### Key Technologies
+- Python 3.10+
+- Gradio (Web Interface)
+- edge-tts (TTS Engine)
+- asyncio (Async Operations)
+- tempfile (Audio File Handling)
+## Contributing 🤝
+Contributions are welcome! Please follow these steps:
+1. Fork the repository
+2. Create a feature branch (`git checkout -b feature/your-feature`)
+3. Commit changes (`git commit -m 'Add some feature'`)
+4. Push to branch (`git push origin feature/your-feature`)
+5. Open a Pull Request
+## License 📄
+This project is licensed under the MIT License - see [LICENSE](LICENSE) file for details.
+## Acknowledgments 🙏
+- Microsoft Edge TTS services
+- Gradio team for the web interface framework
+- [Bana Codes](https://t.me/banacodes) community for Amharic language support
+```
+Create these additional files:
+1. **requirements.txt**
+```text
+gradio==4.13.0
+edge-tts==6.1.3
+python-dotenv==1.0.0
+```

app.py ADDED Viewed

	@@ -0,0 +1,57 @@

+import tempfile
+import edge_tts
+import gradio as gr
+import asyncio
+language_dict = {
+    "Amharic": {
+        "Ameha": "am-ET-AmehaNeural",
+        "Mekdes": "am-ET-MekdesNeural"
+    }
+}
+async def text_to_speech_edge(text, speaker):
+    voice = language_dict["Amharic"][speaker]
+    try:
+        communicate = edge_tts.Communicate(text, voice)
+        # Create temp file with increased timeout
+        with tempfile.NamedTemporaryFile(delete=False, suffix=".mp3") as tmp_file:
+            tmp_path = tmp_file.name
+            await asyncio.wait_for(communicate.save(tmp_path), timeout=30)
+        return tmp_path
+    except asyncio.TimeoutError:
+        error_msg = "ስህተት: ጊዜ አልቋል። እባክዎ እንደገና ይሞክሩ። (Timeout)"
+        raise gr.Error(error_msg)
+    except Exception as e:
+        error_msg = f"ስህተት: ድምፅ መፍጠር አልተቻለም።\nError: {str(e)}"
+        raise gr.Error(error_msg)
+with gr.Blocks(title="Amharic TTS") as demo:
+    gr.HTML("<center><h1>Amharic Text-to-Speech</h1></center>")
+    with gr.Row():
+        with gr.Column():
+            input_text = gr.Textbox(lines=5, label="የአማርኛ ጽሑፍ",
+                                  placeholder="ድምፅ ለመፍጠር ጽሑፍ ያስገቡ...")
+            speaker = gr.Dropdown(
+                choices=["Ameha", "Mekdes"],
+                value="Ameha",
+                label="አርቲስት"
+            )
+            run_btn = gr.Button(value="ድምፅ ፍጠር", variant="primary")
+        with gr.Column():
+            output_audio = gr.Audio(type="filepath", label="የድምፅ ውጤት")
+    run_btn.click(
+        text_to_speech_edge,
+        inputs=[input_text, speaker],
+        outputs=output_audio
+    )
+if __name__ == "__main__":
+    demo.launch(server_port=7860, share=False)

requirements.txt ADDED Viewed

	@@ -0,0 +1,5 @@

+gradio
+edge-tts
+pyarabic
+gradio-client
+transformers