重庆理工大学学报(自然科学) ›› 2023, Vol. 37 ›› Issue (1): 19-25.

• “复杂环境智能汽车感知与控制”专栏 • 上一篇    下一篇

复杂交通环境下基于关键目标的视觉 SLAM

连 静,皮家豪,李琳辉   

  1. 1.大连理工大学 汽车工程学院,辽宁 大连 116024; 2.大连理工大学 工业装备结构分析国家重点实验室,辽宁 大连 11602
  • 出版日期:2023-02-16 发布日期:2023-02-16
  • 作者简介:连静,女,博士,副教授,主要从事车辆智能化研究,Email:lianjing@dlut.edu.cn;通讯作者 李琳辉,男,博士,副 教授,主要从事新能源汽车智能化技术研究,Email:lilinhui@dlut.edu.cn

Visual SLAM based on key targets in a complex traffic environment

  • Online:2023-02-16 Published:2023-02-16

摘要: 为解决当前视觉 SLAM(simultaneouslocalizationandmapping,同时定位及地图构 建)算法在近处纹理稀缺、动态物体遮挡等复杂交通环境下出现的定位失效的问题,提出一种基 于关键目标的视觉 SLAM算法。首先,以典型交通场景环境感知算法所检测的交通信号、标志 等静止目标为基础,在静止目标中进行特征提取并筛选关键目标。其次,通过关键目标的类别 和几何参数完成相连帧之间关键目标的匹配。然后,基于关键目标进行 SLAM系统的初始化和 跟踪,并通过最小化重投影误差求解当前相机位姿。最后,在局部建图线程中对相机位姿和关 键目标三维坐标联合优化,并在局部地图中更新。经实验验证,所提算法能有效解决近处纹理 缺失环境下的定位失效问题,保持了较高的定位精度,具有良好的环境适应性。

关键词: 视觉 SLAM, 近处纹理稀缺, 定位失效, 关键目标

Abstract:

Visual simultaneous localization and mapping(SLAM) plays an important role in autonomous driving. However, it is difficult for the current visual SLAM algorithm to solve those complex traffic scenes where few nearby textures but a large amount of dynamic object occlusion such as moving cars exists. Therefore, this paper proposes a visual SLAM algorithm based on key objects. The algorithm consists of five parts—key object extraction, system initialization, tracking based on key targets, pose solution and local bundle adjustment(BA).

First of all, based on stationary targets detected by the environment perception algorithm in typical traffic scenes such as traffic signals and signs, this paper performs feature extraction in stationary targets and uses the quad tree to homogenize the feature points. According to the number of feature points, the key targets are screened and the process of key object extraction is finished.

Secondly, the key target matching between the two continuous frames is performed in order to help system initialization. The matching criteria consist of the center, the height and the width of the key targets. System initialization is launched when the number of the matched key targets satisfies the need. RANSAC algorithm is used in feature point selection to compute the initial pose. A local map and map points are established at the end of the system initialization.

Then, the constant motion model is applied to predict the center point of the key targets in the following frame. Besides the height and the width of the key targets, the number of the matched points is also taken into account to decide which key target in the current frame matches the corresponding one best.

Furthermore, after data association is completed, the camera pose is estimated based on the feature points and the center point of each key target. Reprojection error is chosen to optimize the pose and distinguish the inliers and outliers in the process of optimization iteration.

Lastly, local BA is utilized in local mapping thread in order to reduce accumulative error. Levenberg-marquardt(LM) algorithm is used as the optimization algorithm and the optimization variables include the pose of the key frame, the center point of each key target and the 3D coordination of feature points in key targets.

To verify the performance of the proposed algorithm, the typical scenes in ApolloScape dataset are chosen. Experiments show that the proposed algorithm can effectively solve the problem of localization failure in some near-texture missing environments, and maintain a high localization accuracy, which has a good environmental adaptability.

中图分类号: 

  • U461.91