--- title: LoRACaptioner emoji: 🤠 colorFrom: red colorTo: green sdk: gradio sdk_version: 5.25.2 app_file: demo.py pinned: false --- # LoRACaptioner - **Image Captioning**: Automatically generate detailed and structured captions for your LoRA dataset. - **Prompt Optimization**: Enhance prompts during inference to achieve high-quality outputs.

## Installation ### Prerequisites - Python 3.10 or higher - [Together AI](https://together.ai/) account and API key ### Setup 1. Create the virtual environment: ```bash python -m venv venv source venv/bin/activate python -m pip install -r requirements.txt ``` 2. Run inference on one set of images: ```bash python main.py --input examples/ --output output/ ```

Arguments

- `--input` (str): Directory containing images to caption. - `--output` (str): Directory to save images and captions (defaults to input directory). - `--batch_images` (flag): Caption images in batches by category.

## Gradio Demo Launch a user-friendly web interface for captioning and prompt optimization: ```bash python demo.py ``` ### Notes - Images are processed individually in standard mode - For large collections, batch processing by category is recommended - Each caption is saved as a .txt file with the same name as the image ### Troubleshooting - **API errors**: Ensure your Together API key is set and has funds - **Image formats**: Only .png, .jpg, .jpeg, and .webp files are supported ## Manual Captioning with ChatGPT Follow the instructions in my [blog post](https://rishidesai.github.io/posts/character-lora/) and use `system_prompt.txt` as the system prompt. ## Examples ### Sukuna from Jujutsu Kaisen **User Prompt:** holding a bow and arrow in a dense forest **Optimized Prompt:** `tr1gg3r anime-style, pink spiky hair and black markings on face, shirtless with dark arm bands, holding bow and arrow, focused expression, dense forest, soft dappled lighting, three-quarter view` Sukuna with bow and arrow

--- **User Prompt:** drinking coffee in a san francisco cafe, white cloak, side view **Optimized Prompt:** `tr1gg3r anime-style, spiky pink hair and facial markings, white cloak, sitting with cup in hand, neutral expression, cafe interior with san francisco view, soft natural lighting, side profile` Sukuna drinking coffee

--- **User Prompt:** playing pick-up basketball on a sunny day **Optimized Prompt:** `tr1gg3r photorealistic, athletic build, sleeveless basketball jersey and shorts, jumping with ball, focused expression, outdoor basketball court with spectators, bright sunlight, low-angle view` Sukuna playing basketball

--- ### A character generated by Flux.1-dev **User Prompt:** riding a horse on a prairie during sunset **Optimized Prompt:** `tr1gger photorealistic, curly shoulder-length hair, floral button-up shirt, riding a horse, neutral expression, prairie during sunset, warm directional lighting, three-quarter view` Woman riding a horse

--- **User Prompt:** painting on a canvas in an art studio, side-view **Optimized Prompt:** `tr1gg3r photorealistic, curly shoulder-length hair, floral button-up shirt, standing at an angle with brush in hand, neutral expression, art studio with canvas and paints, soft natural lighting, right side profile` Woman painting in studio

--- **User Prompt:** standing on a skyscraper in a dense city, dramatic stormy lighting, rear view **Optimized Prompt:** `tr1gg3r photorealistic, curly shoulder-length hair, floral button-up shirt, standing upright, neutral expression, skyscraper rooftop in dense city, dramatic stormy lighting, back view` Woman on skyscraper