#大语言模型#Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding