图像序列的显著性目标区域检测方法

柯洪昌; 孙宏彬

doi:10.3788/CO.20150805.0768

图像序列的显著性目标区域检测方法

doi: 10.3788/CO.20150805.0768

cstr: 32171.14.CO.20150805.0768

柯洪昌^,,
孙宏彬^,

长春工程学院计算机技术与工程学院, 吉林长春 130012

基金项目: 国家高技术研究发展计划(863计划)资助项目(No.2012AA040104);吉林省科技厅资助项目(No.20120332);吉林省发改委资助项目(No.2013C048);吉林省科技厅国际合作资助项目(No.20140105);吉林省教育厅资助项目(No.20130434,No.20140032)

详细信息

通讯作者:
柯洪昌(1981—)，男，吉林德惠人，硕士，讲师，2004年、2007年于吉林大学分别获得学士、硕士学位，主要从事嵌入式系统、目标识别和目标跟踪方面的研究。 E-mail:kehongchang1981@163.com

孙宏彬(1969—)，男，吉林四平人，博士后，教授， 1991年、1997年于华北电力大学分别获得学士、硕士学位，2007年于东华大学获得博士学位，2010年于吉林大学博士后出站，主要从事智能信息系统、计算机视觉和机器学习方面的研究。 E-mail:Win_shb@163.com

中图分类号: TP391.4
计量
- 文章访问数: 2271
- HTML全文浏览量: 643
- PDF下载量: 734
- 被引次数: 0
出版历程
- 收稿日期: 2015-07-12
- 录用日期: 2015-09-20
- 刊出日期: 2015-01-25

A saliency target area detection method of image sequence

KE Hong-chang^,,
SUN Hong-bin^,

School of Computer Technology and Engineering, Changchun Institute of Technology, Changchun 130012, China

摘要

摘要: 针对传统视觉显著性模型在自顶向下的任务指导和动态信息处理方面的不足,设计并实现了融入运动特征的视觉显著性模型。利用该模型提取了图像的静态特征和动态特征,静态特征的提取在图像的亮度、颜色和方向通道进行,运动特征的提取采用基于多尺度差分的特征提取方法实现,然后各通道分别通过滤波、差分得到显著图,在生成全局显著图时,提出多通道参数估计方法,计算图像感兴趣区域与眼动感兴趣区域的相似度,从而可在图像上准确定位目标位置。针对20组视频图像序列(每组50帧)进行了实验,结果表明:本文算法提取注意焦点即目标区域的平均相似度为0.87,使用本文算法能够根据不同任务情境,选择各特征通道的权重参数,从而可有效提高目标搜索的效率。
- 视觉显著性 /
- 自顶向下 /
- 目标区域检测 /
- 显著图
Abstract: For the lack of top-down task guidance and dynamic information processing of traditional visual saliency model, a visual saliency model fused with the motion features is designed and implemented. The static features and motion features are extracted based on the proposed model. The static features are extracted from the intensity, color and orientation channel of the current frame image. The motion features are extracted based on the multi-scales difference method. The saliency maps of four channels can be obtained by filtering and difference. Based on the proposed model a method of parameter estimation for multi channel is proposed to calculate the similarity between the region of interesting of current image and the region of interesting of eyes movement, then guide to generate the global saliency map, which can provide a calculation mechanism for accurate location on images. 20 groups of video image sequences(50 images per group) are selected for the experiment. Experimental results show that the average similarity of focus of attention is 0.87. The proposed method can more efficiently and accurately locate the region where the searched target may be present and can improve the efficiency of target searching.
- visual saliency /
- top-down /
- target area detection /
- saliency map

HTML全文

图 1 融入运动特征的视觉显著性模型

Figure 1. Visual saliency model fused with motion features

下载: 全尺寸图片幻灯片

图 2 视觉显著性模型多通道显著图

Figure 2. Multi-channel saliency maps of visual saliency model

下载: 全尺寸图片幻灯片

图 3 20组图像的平均相似度比较

Figure 3. Average similarity of 20 group images

下载: 全尺寸图片幻灯片

表 1 多通道参数估计权值平均值的部分实验结果

Table 1. Part results of the avarage values of the multi-channel parameter estimation

下载: 导出CSV