Institute of Computing Technology, Chinese Academy IR
| Detection and tracking based tubelet generation for video object detection | |
| Wang, Bin1,2; Tang, Sheng1; Xiao, Jun-Bin1,2; Yan, Quan-Feng3; Zhang, Yong-Dong1 | |
| 2019 | |
| 发表期刊 | JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION
![]() |
| ISSN | 1047-3203 |
| 卷号 | 58页码:102-111 |
| 摘要 | Video object detection (VID) is a more challenging task compared with still-image object detection, which not only needs to detect objects accurately per frame but also needs to track objects for a long period of time. In order to detect objects from videos, we propose a Detection And Tracking (DAT) based tubelet generation framework. Under this framework, we first propose a detection-based tubelet generation method which can generate tubelets with more accurate bounding boxes compared with traditional tracking-based methods. On the other hand, the latter can produce a higher recall of bounding boxes than the former in general. To take advantage of their complementary attributes, we further propose a novel tubelet fusion method to combine these multi-modal information (appearance information in independent images and contextual information in videos). Our extensive experiments on the well-known ILSVRC 2016 VID dataset show that our proposed method can achieve state-of-the-art performances. (C) 2018 Elsevier Inc. All rights reserved. |
| 关键词 | Object detection Tubelet generation Tubelet fusion |
| DOI | 10.1016/j.jvcir.2018.11.014 |
| 收录类别 | SCI |
| 语种 | 英语 |
| 资助项目 | National Key Research and Development Program of China[2017YFB1002202] ; National Natural Science Foundation of China[61572472] ; National Natural Science Foundation of China[61525206] ; National Natural Science Foundation of China[U1703261] ; National Natural Science Foundation of China[61571424] |
| WOS研究方向 | Computer Science |
| WOS类目 | Computer Science, Information Systems ; Computer Science, Software Engineering |
| WOS记录号 | WOS:000457668100011 |
| 出版者 | ACADEMIC PRESS INC ELSEVIER SCIENCE |
| 引用统计 | |
| 文献类型 | 期刊论文 |
| 条目标识符 | http://119.78.100.204/handle/2XEOYT63/3457 |
| 专题 | 中国科学院计算技术研究所期刊论文_英文 |
| 通讯作者 | Tang, Sheng |
| 作者单位 | 1.Chinese Acad Sci, Key Lab Intelligent Informat Proc, Inst Comp Technol, Beijing 100190, Peoples R China 2.Univ Chinese Acad Sci, Beijing 100049, Peoples R China 3.Hunan Inst Sci & Technol, Coll Comp Sci, Yueyang 414006, Hunan, Peoples R China |
| 推荐引用方式 GB/T 7714 | Wang, Bin,Tang, Sheng,Xiao, Jun-Bin,et al. Detection and tracking based tubelet generation for video object detection[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION,2019,58:102-111. |
| APA | Wang, Bin,Tang, Sheng,Xiao, Jun-Bin,Yan, Quan-Feng,&Zhang, Yong-Dong.(2019).Detection and tracking based tubelet generation for video object detection.JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION,58,102-111. |
| MLA | Wang, Bin,et al."Detection and tracking based tubelet generation for video object detection".JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION 58(2019):102-111. |
| 条目包含的文件 | 条目无相关文件。 | |||||
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论