FULL TEXT IN RUSSIAN


Mekhatronika, Avtomatizatsiya, Upravlenie, 2016, vol. 17, no. 3, pp. 187—192
DOI: 10.17587/mau.17.187-192


Structural Detection of Visual Objects for Mobile Robots

N. A. Sergievskiy, dereyly@gmail.com, ELVEES-NeoTek, Zelenograd, Moscow, 124498, Russian Federation, A. A. Kharlamov, kharlamov@analyst.ru, Institute of Higher Nervous Activity and Neurophysiology of the Russian Academy of Sciences (IHNA&N RAS), Moscow, 117485, Russian Federation, Moscow State Linguistics University, Moscow, 119034, Russian Federation


Corresponding author: Kharlamov Aleksandr A., Associate Professor, D.Sc., Institute of Higher Nervous Activity and Neurophysiology of the Russian Academy of Sciences (IHNA&N RAS), Moscow, 117485, Russian Federation, Moscow State Linguistics University, Moscow, 119034, Russian Federation, e-mail: kharlamov@analyst.ru

Received on October 06, 2015
Accepted on October 22, 2015

This paper presents StructDetect, a fast method for object detection. The target detection process consists of two stages: generation of a hypothesis (object proposals) (1) and verification of the hypothesis (2). Generation of the object proposals is carried out by means of a simple structural model on the basis of line segment combining. Line segment is detected by EdLines algorithm. Then a computer attributes the line segments and their pairs and creates "a connection table", which filters some combinations. Further, it creates a triple combination of the line segments filtered by "the connection table". Each combination has a handcraft descriptor based on the line segment attribute. This descriptor is used to learn kNN classifier and generate object proposals in the area of 3 line segments. These proposals define a set of candidate bounding boxes available to the detector. The second module is based on a convolutional neural network, which takes a fixed-length feature vector from each region. The convolution neural network computes once per image and features vector extracts with adaptively-sized pooling from the last convolution layer. Then the feature vectors are classified by the random forest algorithm. Accuracy of this approach is comparable with the accuracy of such modern detector methods as SPPNet and RCNN. StructDetect is 7 times faster than SPPNet and has a frame rate of 4fps on a CPU.

Keywords: object detection, object proposals, computer vision, robot vision, deep learning, random forest


For citation:
Sergievskiy N. A., Kharlamov A. A. Structural Detection of Visual Objects for Mobile Robots, Mekhatronika, Avtomatizatsiya, Upravlenie, 2016, vol. 17, no. 3, pp. 187—192.
DOI: 10.17587/mau/17.187-192

To the contents