Computational Modeling for Visual Attention Analysis

Thesis event information

Date and time of the thesis defence

Place of the thesis defence

Remote access: https://oulu.zoom.us/j/69133800918?pwd=MEVuVWo2TXVPMjNvRm9mYmNQejlRZz09

Topic of the dissertation

Computational Modeling for Visual Attention Analysis

Doctoral candidate

Master of Science (Technology) Yingyue Xu

Faculty and unit

University of Oulu Graduate School, Faculty of Information Technology and Electrical Engineering, Center for Machine Vision and Signal Analysis (CMVS)

Subject of study

Computer Science and Engineering

Opponent

Ph.D. (Electrical Engineering) Moncef Gabbouj, Tampere University

Custos

Ph.D. (Computer Science) Guoying Zhao, University of Oulu

Add event to calendar

Computational Modeling for Visual Attention Analysis

Visual scenes typically contain massive amounts of content that cannot be processed in a short time due to the limited processing capacity of the human visual system. The term, visual attention, is a biologically inspired and psychologically driven mechanism that works by selecting visually relevant information and filtering out the redundant contents. To duplicate the attention mechanism to facilitate compute vision tasks, computational modeling for visual attention analysis is crucial to suggest the salient regions on visual scenes by the human visual system.

This thesis is a thorough summary of the main subjects around computational modeling for visual attention analysis, consisting of several published papers corresponding to my research progress. First, the data preparation for computational modeling will be introduced, including eye movement data, eye tracking data collection and eye tracking datasets facilitating the evaluation of computational modeling of visual attention. Second, computational models for visual attention analysis, or saliency models, are presented from traditional unsupervised methods to deep saliency models. Third, the subject about saliency integration will be illustrated that unifies multiple saliency maps from the multiple candidate saliency models for better accuracy.

The contributions of this study are three folds. Firstly, we collect a task-driven eye tracking dataset for visual attention analysis. Secondly, we propose three saliency models for in-depth investigation in modeling visual attention, including an unsupervised model using the bi-directional propagation method, a Convolutional Neural Networks based model by connecting the Dense Conditional Random Fields for multi-scale saliency refinement, and a Convolutional Neural Networks based model with cascade Conditional Random Fields for joint model training. Thirdly, we propose a saliency integration method and conduct comprehensive experiments and analysis on the topic. Finally, we summarize the contributions of the work and propose the potential applications of saliency models and the extended saliency related topics to boost applications of saliency approaches on other computer vision topics.
Last updated: 1.3.2023