A newer version of the Gradio SDK is available: 5.33.1
5.33.1
Example script of using FlashAttention for inference coming soon.