DIGCN: A Dynamic Interaction Graph Convolutional Network Based on Learnable Proposals for Object Detection | Journal of Artificial Intelligence Research

PDF

Published: Apr 4, 2024

DOI: https://doi.org/10.1613/jair.1.15698

Keywords:

Computer Vision, Deep Learning, Convolution Neural Network, Recurrent Neural Network

Pingping Cao

a:1:{s:5:"en_US";s:41:"China University of Mining and Technology";}

Yanping Zhu

Yuhao Jin

Benkun Ruan

Qiang Niu

Abstract

We propose a Dynamic Interaction Graph Convolutional Network (DIGCN), an image object detection method based on learnable proposals and GCN. Existing object detection methods usually work on dense candidates, resulting in redundant and near-duplicate results. Meanwhile, non-maximum suppression post-processing operations are required to eliminate negative effects, which increases the computational complexity. Although the existing sparse detector avoids cumbersome post-processing operations, it ignores the potential relationship between objects and proposals, which hinders detection accuracy improvement. Therefore, we propose a dynamic interaction GCN module in the DIGCN, which performs dynamic interaction and relational modeling on the proposal boxes and proposal features to improve the object detection accuracy. In addition, we introduce a learnable proposal method with a sparse set of learned object proposals to eliminate a huge number of hand-designed object candidates, avoiding complicated tasks such as object candidate design and many-to-one label assignment, and reducing object detection model complexity to a certain extent. DIGCN demonstrates accuracy and run-time performance on par with the well-established and highly optimized detector baselines on the challenging COCO dataset, e.g. with the ResNet-101FPN as the backbone our method attains the accuracy of 46.5 AP while processing 13 frames per second. Our work provides a new method for object detection research.

Issue

Vol. 79 (2024)

Section

Articles

Article Sidebar

Main Article Content

Abstract

Article Details