1
|
Barfoot TD, Holmes C, Dümbgen F. Certifiably optimal rotation and pose estimation based on the Cayley map. Int J Rob Res 2025; 44:366-387. [PMID: 40092623 PMCID: PMC11903194 DOI: 10.1177/02783649241269337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Revised: 05/31/2024] [Accepted: 06/18/2024] [Indexed: 03/19/2025]
Abstract
We present novel, convex relaxations for rotation and pose estimation problems that can a posteriori guarantee global optimality for practical measurement noise levels. Some such relaxations exist in the literature for specific problem setups that assume the matrix von Mises-Fisher distribution (a.k.a., matrix Langevin distribution or chordal distance) for isotropic rotational uncertainty. However, another common way to represent uncertainty for rotations and poses is to define anisotropic noise in the associated Lie algebra. Starting from a noise model based on the Cayley map, we define our estimation problems, convert them to Quadratically Constrained Quadratic Programs (QCQPs), then relax them to Semidefinite Programs (SDPs), which can be solved using standard interior-point optimization methods; global optimality follows from Lagrangian strong duality. We first show how to carry out basic rotation and pose averaging. We then turn to the more complex problem of trajectory estimation, which involves many pose variables with both individual and inter-pose measurements (or motion priors). Our contribution is to formulate SDP relaxations for all these problems based on the Cayley map (including the identification of redundant constraints) and to show them working in practical settings. We hope our results can add to the catalogue of useful estimation problems whose solutions can be a posteriori guaranteed to be globally optimal.
Collapse
Affiliation(s)
| | - Connor Holmes
- Robotics Institute, University of Toronto, Toronto, ON, Canada
| | | |
Collapse
|
2
|
Wu J, Liu M, Huang Y, Jin C, Wu Y, Yu C. SE(n)++: An Efficient Solution to Multiple Pose Estimation Problems. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:3829-3840. [PMID: 32877345 DOI: 10.1109/tcyb.2020.3015039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
In robotic applications, many pose problems involve solving the homogeneous transformation based on the special Euclidean group SE(n) . However, due to the nonconvexity of SE(n) , many of these solvers treat rotation and translation separately, and the computational efficiency is still unsatisfactory. A new technique called the SE(n)++ is proposed in this article that exploits a novel mapping from SE(n) to SO(n + 1) . The mapping transforms the coupling between rotation and translation into a unified formulation on the Lie group and gives better analytical results and computational performances. Specifically, three major pose problems are considered in this article, that is, the point-cloud registration, the hand-eye calibration, and the SE(n) synchronization. Experimental validations have confirmed the effectiveness of the proposed SE(n)++ method in open datasets.
Collapse
|
3
|
Rosinol A, Violette A, Abate M, Hughes N, Chang Y, Shi J, Gupta A, Carlone L. Kimera: From SLAM to spatial perception with 3D dynamic scene graphs. Int J Rob Res 2021. [DOI: 10.1177/02783649211056674] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Humans are able to form a complex mental model of the environment they move in. This mental model captures geometric and semantic aspects of the scene, describes the environment at multiple levels of abstractions (e.g., objects, rooms, buildings), includes static and dynamic entities and their relations (e.g., a person is in a room at a given time). In contrast, current robots’ internal representations still provide a partial and fragmented understanding of the environment, either in the form of a sparse or dense set of geometric primitives (e.g., points, lines, planes, and voxels), or as a collection of objects. This article attempts to reduce the gap between robot and human perception by introducing a novel representation, a 3D dynamic scene graph (DSG), that seamlessly captures metric and semantic aspects of a dynamic environment. A DSG is a layered graph where nodes represent spatial concepts at different levels of abstraction, and edges represent spatiotemporal relations among nodes. Our second contribution is Kimera, the first fully automatic method to build a DSG from visual–inertial data. Kimera includes accurate algorithms for visual–inertial simultaneous localization and mapping (SLAM), metric–semantic 3D reconstruction, object localization, human pose and shape estimation, and scene parsing. Our third contribution is a comprehensive evaluation of Kimera in real-life datasets and photo-realistic simulations, including a newly released dataset, uHumans2, which simulates a collection of crowded indoor and outdoor scenes. Our evaluation shows that Kimera achieves competitive performance in visual–inertial SLAM, estimates an accurate 3D metric–semantic mesh model in real-time, and builds a DSG of a complex indoor environment with tens of objects and humans in minutes. Our final contribution is to showcase how to use a DSG for real-time hierarchical semantic path-planning. The core modules in Kimera have been released open source.
Collapse
Affiliation(s)
- Antoni Rosinol
- Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Andrew Violette
- Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Marcus Abate
- Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Nathan Hughes
- Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Yun Chang
- Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Jingnan Shi
- Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Arjun Gupta
- Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Luca Carlone
- Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, MA, USA
| |
Collapse
|
4
|
Tian Y, Khosoussi K, Rosen DM, How JP. Distributed Certifiably Correct Pose-Graph Optimization. IEEE T ROBOT 2021; 37:2137-2156. [DOI: 10.1109/tro.2021.3072346] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
5
|
|
6
|
Chen Y, Huang S, Zhao L, Dissanayake G. Cramér–Rao Bounds and Optimal Design Metrics for Pose-Graph SLAM. IEEE T ROBOT 2021. [DOI: 10.1109/tro.2020.3001718] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
7
|
Fan T, Wang H, Rubenstein M, Murphey T. CPL-SLAM: Efficient and Certifiably Correct Planar Graph-Based SLAM Using the Complex Number Representation. IEEE T ROBOT 2020. [DOI: 10.1109/tro.2020.3006717] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
8
|
Tian Y, Koppel A, Bedi AS, How JP. Asynchronous and Parallel Distributed Pose Graph Optimization. IEEE Robot Autom Lett 2020. [DOI: 10.1109/lra.2020.3010216] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
9
|
Abstract
Nowadays, Nonlinear Least-Squares embodies the foundation of many Robotics and Computer Vision systems. The research community deeply investigated this topic in the last few years, and this resulted in the development of several open-source solvers to approach constantly increasing classes of problems. In this work, we propose a unified methodology to design and develop efficient Least-Squares Optimization algorithms, focusing on the structures and patterns of each specific domain. Furthermore, we present a novel open-source optimization system that addresses problems transparently with a different structure and designed to be easy to extend. The system is written in modern C++ and runs efficiently on embedded systemsWe validated our approach by conducting comparative experiments on several problems using standard datasets. The results show that our system achieves state-of-the-art performances in all tested scenarios.
Collapse
|
10
|
Kong FH, Zhao J, Zhao L, Huang S. Analysis of Minima for Geodesic and Chordal Cost for a Minimal 2-D Pose-Graph SLAM Problem. IEEE Robot Autom Lett 2020. [DOI: 10.1109/lra.2019.2958492] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
11
|
Aloise I, Grisetti G. Chordal Based Error Function for 3-D Pose-Graph Optimization. IEEE Robot Autom Lett 2020. [DOI: 10.1109/lra.2019.2956456] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
12
|
Giamou M, Ma Z, Peretroukhin V, Kelly J. Certifiably Globally Optimal Extrinsic Calibration From Per-Sensor Egomotion. IEEE Robot Autom Lett 2019. [DOI: 10.1109/lra.2018.2890444] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
13
|
Lajoie PY, Hu S, Beltrame G, Carlone L. Modeling Perceptual Aliasing in SLAM via Discrete–Continuous Graphical Models. IEEE Robot Autom Lett 2019. [DOI: 10.1109/lra.2019.2894852] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
14
|
Han L, Xu L, Bobkov D, Steinbach E, Fang L. Real-Time Global Registration for Globally Consistent RGB-D SLAM. IEEE T ROBOT 2019. [DOI: 10.1109/tro.2018.2882730] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
15
|
Lenac K, Ćesić J, Marković I, Petrović I. Exactly sparse delayed state filter on Lie groups for long-term pose graph SLAM. Int J Rob Res 2018. [DOI: 10.1177/0278364918767756] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
In this paper we propose a simultaneous localization and mapping (SLAM) back-end solution called the exactly sparse delayed state filter on Lie groups (LG-ESDSF). We derive LG-ESDSF and demonstrate that it retains all the good characteristics of the classic Euclidean ESDSF, the main advantage being the exact sparsity of the information matrix. The key advantage of LG-ESDSF in comparison with the classic ESDSF lies in the ability to respect the state space geometry by negotiating uncertainties and employing filtering equations directly on Lie groups. We also exploit the special structure of the information matrix in order to allow long-term operation while the robot is moving repeatedly through the same environment. To prove the effectiveness of the proposed SLAM solution, we conducted extensive experiments on two different publicly available datasets, namely the KITTI and EuRoC datasets, using two front-ends: one based on the stereo camera and the other on the 3D LIDAR. We compare LG-ESDSF with the general graph optimization framework ([Formula: see text]) when coupled with the same front-ends. Similarly to [Formula: see text] the proposed LG-ESDSF is front-end agnostic and the comparison demonstrates that our solution can match the accuracy of [Formula: see text], while maintaining faster computation times. Furthermore, the proposed back-end coupled with the stereo camera front-end forms a complete visual SLAM solution dubbed LG-SLAM. Finally, we evaluated LG-SLAM using the online KITTI protocol and at the time of writing it achieved the second best result among the stereo odometry solutions and the best result among the tested SLAM algorithms.
Collapse
Affiliation(s)
- Kruno Lenac
- University of Zagreb Faculty of Electrical Engineering and Computing, Croatia
| | - Josip Ćesić
- University of Zagreb Faculty of Electrical Engineering and Computing, Croatia
| | - Ivan Marković
- University of Zagreb Faculty of Electrical Engineering and Computing, Croatia
| | - Ivan Petrović
- University of Zagreb Faculty of Electrical Engineering and Computing, Croatia
| |
Collapse
|
16
|
Bai F, Vidal-Calleja T, Huang S. Robust Incremental SLAM Under Constrained Optimization Formulation. IEEE Robot Autom Lett 2018. [DOI: 10.1109/lra.2018.2794610] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|