--- title: LoRACaptioner emoji: 🤠 colorFrom: red colorTo: green sdk: gradio sdk_version: 5.25.2 app_file: demo.py pinned: false --- # LoRACaptioner - **Image Captioning**: Automatically generate detailed and structured captions for your LoRA dataset. - **Prompt Optimization**: Enhance prompts during inference to achieve high-quality outputs.
Sukuna example 4 Sukuna example 5 Sukuna example 6 Sukuna example 7
## Installation ### Prerequisites - Python 3.11 or higher - [Together API](https://together.ai/) account and API key ### Setup 1. Create the virtual environment: ```bash python -m venv venv source venv/bin/activate python -m pip install -r requirements.txt ``` 2. Run inference on one set of images: ```bash python main.py --input examples/ --output output/ ```
Arguments - `--input` (str): Directory containing images to caption. - `--output` (str): Directory to save images and captions (defaults to input directory). - `--batch_images` (flag): Caption images in batches by category.
## Gradio Web Interface Launch a user-friendly web interface for captioning and prompt optimization: ```bash python demo.py ``` ### Notes - Images are processed individually in standard mode - For large collections, batch processing by category is recommended - Each caption is saved as a .txt file with the same name as the image ### Troubleshooting - **API errors**: Ensure your Together API key is set and has funds - **Image formats**: Only .png, .jpg, .jpeg, and .webp files are supported ### Examples
User Prompt:

holding a bow and arrow in a dense forest

Optimized Prompt:

tr1gg3r anime-style, pink spiky hair and black markings on face, shirtless with dark arm bands, holding bow and arrow, focused expression, dense forest, soft dappled lighting, three-quarter view

Sukuna with bow and arrow
User Prompt:

drinking coffee in a san francisco cafe, white cloak, side view

Optimized Prompt:

tr1gg3r anime-style, spiky pink hair and facial markings, white cloak, sitting with cup in hand, neutral expression, cafe interior with san francisco view, soft natural lighting, side profile

Sukuna drinking coffee
User Prompt:

playing pick-up basketball on a sunny day

Optimized Prompt:

tr1gg3r photorealistic, athletic build, sleeveless basketball jersey and shorts, jumping with ball, focused expression, outdoor basketball court with spectators, bright sunlight, low-angle view

Sukuna playing basketball
User Prompt:

riding a horse on a prairie during sunset

Optimized Prompt:

tr1gger photorealistic, curly shoulder-length hair, floral button-up shirt, riding a horse, neutral expression, prairie during sunset, warm directional lighting, three-quarter view

Woman riding a horse
User Prompt:

painting on a canvas in an art studio, side-view

Optimized Prompt:

tr1gg3r photorealistic, curly shoulder-length hair, floral button-up shirt, standing at an angle with brush in hand, neutral expression, art studio with canvas and paints, soft natural lighting, right side profile

Woman painting in studio
User Prompt:

standing on a skyscraper in a dense city, dramatic stormy lighting, rear view

Optimized Prompt:

tr1gg3r photorealistic, curly shoulder-length hair, floral button-up shirt, standing upright, neutral expression, skyscraper rooftop in dense city, dramatic stormy lighting, back view

Woman on skyscraper
## License [MIT License](LICENSE)