• CN: 11-2187/TH
  • ISSN: 0577-6686

Journal of Mechanical Engineering ›› 2019, Vol. 55 ›› Issue (11): 123-130.doi: 10.3901/JME.2019.11.123

Previous Articles     Next Articles

Visual Attentional Network and Learning Method for Object Search and Recognition

LÜ Jie, LUO Fangying, YUAN Zejian   

  1. School of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049
  • Received:2018-06-20 Revised:2019-03-19 Online:2019-06-05 Published:2019-06-05

Abstract: A recurrent visual network is proposed to search and recognize an object simultaneously. The network can automatically select a sequence of local observations, and accurately localize and recognize objects by fusing those local detail appearance and rough context visual information. The method is more efficient than other methods with sliding windows or convolution on a whole image. Besides, a hybrid loss function is proposed to learn parameters of the multi-task network end-to-end. Especially, The combination of stochastic and object-awareness strategy is imported into visual fixation loss, which is beneficial to mine more abundant context and ensure fixation point close to object as fast as possible. A real-world dataset is built to verify the capacity of the method in searching and recognizing the object of interest including those small ones. Experiments illustrate that the method can predict an accurate bounding box for a visual object, and achieve higher searching speed. The source code will be opened to verify and analyze the method.

Key words: attentional model, fixation strategy, object detection, reinforcement learning

CLC Number: