Evaluate open-ended outputs from AI models using MM-Vet
Convert text to audio and vice versa
Visualize evaluation results for URA-LLaMa