[论文]余婷等人.Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering 时间:2024-06-27 09:58:34 文章来源 :学科 浏览量:0 Multi-Granularity_Contrastive_Cross-Modal_Collaborative_Generation_for_End-to-End_Long-Term_Video_Question_Answering