Page 1 - Showing 8 of 15 posts
View all posts by years →
- 工作记录文档
记录每周的工作内容
1 min read 中文 - LongVALE 论文复现
LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos——论文复现
3 min read 中文 - Whisper 论文阅读
Robust Speech Recognition via Large-Scale Weak Supervision——论文研读
7 min read 中文 - Transformer 论文阅读
Attention Is All You Need——论文精读(详解)
10 min read 中文 - 实验室服务器连接
记一下连接远程服务器的步骤
4 min read 中文 - LongVALE 论文阅读
LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos——论文研读
2 min read 中文 - VTimeLLM 论文阅读
VTimeLLM: Empower LLM to Grasp Video Moments——论文研读
6 min read 中文 - AVSegFormer 论文阅读
AVSegFormer: Audio-Visual Segmentation with Transformer——论文研读
9 min read 中文