Institute of Computing Technology, Chinese Academy IR
Detection and tracking based tubelet generation for video object detection | |
Wang, Bin1,2; Tang, Sheng1; Xiao, Jun-Bin1,2; Yan, Quan-Feng3; Zhang, Yong-Dong1 | |
2019 | |
发表期刊 | JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION |
ISSN | 1047-3203 |
卷号 | 58页码:102-111 |
摘要 | Video object detection (VID) is a more challenging task compared with still-image object detection, which not only needs to detect objects accurately per frame but also needs to track objects for a long period of time. In order to detect objects from videos, we propose a Detection And Tracking (DAT) based tubelet generation framework. Under this framework, we first propose a detection-based tubelet generation method which can generate tubelets with more accurate bounding boxes compared with traditional tracking-based methods. On the other hand, the latter can produce a higher recall of bounding boxes than the former in general. To take advantage of their complementary attributes, we further propose a novel tubelet fusion method to combine these multi-modal information (appearance information in independent images and contextual information in videos). Our extensive experiments on the well-known ILSVRC 2016 VID dataset show that our proposed method can achieve state-of-the-art performances. (C) 2018 Elsevier Inc. All rights reserved. |
关键词 | Object detection Tubelet generation Tubelet fusion |
DOI | 10.1016/j.jvcir.2018.11.014 |
收录类别 | SCI |
语种 | 英语 |
资助项目 | National Key Research and Development Program of China[2017YFB1002202] ; National Natural Science Foundation of China[61572472] ; National Natural Science Foundation of China[61525206] ; National Natural Science Foundation of China[U1703261] ; National Natural Science Foundation of China[61571424] |
WOS研究方向 | Computer Science |
WOS类目 | Computer Science, Information Systems ; Computer Science, Software Engineering |
WOS记录号 | WOS:000457668100011 |
出版者 | ACADEMIC PRESS INC ELSEVIER SCIENCE |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://119.78.100.204/handle/2XEOYT63/3457 |
专题 | 中国科学院计算技术研究所期刊论文_英文 |
通讯作者 | Tang, Sheng |
作者单位 | 1.Chinese Acad Sci, Key Lab Intelligent Informat Proc, Inst Comp Technol, Beijing 100190, Peoples R China 2.Univ Chinese Acad Sci, Beijing 100049, Peoples R China 3.Hunan Inst Sci & Technol, Coll Comp Sci, Yueyang 414006, Hunan, Peoples R China |
推荐引用方式 GB/T 7714 | Wang, Bin,Tang, Sheng,Xiao, Jun-Bin,et al. Detection and tracking based tubelet generation for video object detection[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION,2019,58:102-111. |
APA | Wang, Bin,Tang, Sheng,Xiao, Jun-Bin,Yan, Quan-Feng,&Zhang, Yong-Dong.(2019).Detection and tracking based tubelet generation for video object detection.JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION,58,102-111. |
MLA | Wang, Bin,et al."Detection and tracking based tubelet generation for video object detection".JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION 58(2019):102-111. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论