| Literature DB >> 22319348 |
Luis M Bergasa1, Pablo F Alcantarilla, David Schleicher.
Abstract
In this article, we present a real-time 6DoF egomotion estimation system for indoor environments using a wide-angle stereo camera as the only sensor. The stereo camera is carried in hand by a person walking at normal walking speeds 3-5 km/h. We present the basis for a vision-based system that would assist the navigation of the visually impaired by either providing information about their current position and orientation or guiding them to their destination through different sensing modalities. Our sensor combines two different types of feature parametrization: inverse depth and 3D in order to provide orientation and depth information at the same time. Natural landmarks are extracted from the image and are stored as 3D or inverse depth points, depending on a depth threshold. This depth threshold is used for switching between both parametrizations and it is computed by means of a non-linearity analysis of the stereo sensor. Main steps of our system approach are presented as well as an analysis about the optimal way to calculate the depth threshold. At the moment each landmark is initialized, the normal of the patch surface is computed using the information of the stereo pair. In order to improve long-term tracking, a patch warping is done considering the normal vector information. Some experimental results under indoor environments and conclusions are presented.Entities:
Keywords: 2D warping; extended Kalman filter; inverse depth parametrization; localization; mapping; non-linearity analysis
Year: 2010 PMID: 22319348 PMCID: PMC3274267 DOI: 10.3390/s100404159
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1.Stereo vision system for our 6DoF visual SLAM.
Figure 2.Inverse depth point coding.
Figure 3.(a) Absolute and (b) relative depth estimation errors for a stereo rig considering a focal length f = 202 pixels and image size 320 × 240, for different baselines.
Figure 4.Depth and angular non-linearity indexes witch focal length f = 202 pixels and image size 320 × 240. (a) Different stereo rig baselines (b) A zoomed version for our stereo rig configuration B = 15 cm.
Optimal depth thresholds for different stereo baselines and fixed focal length f = 202 and image size (320, 240).
| 15 | 5.71 |
| 20 | 6.69 |
| 30 | 8.35 |
| 40 | 9.81 |
Figure 5.Stereo geometry and locally planar surfaces.
Figure 6.Inverse depth and 3D comparison. (a) Total state vector size, (b) Without inverse depth par. L sequence, (c) with inverse depth par. Z = 5.7 m L sequence, (d) without inverse depth par. loop sequence, (e) with inverse depth par. Z = 10 m loop sequence, (f) with inverse depth par. Z = 5.7 m loop sequence.
Inverse depth and 3D comparison: absolute errors in trajectory and map uncertainty.
| Corridor | Without Inverse Par. | 0.00 | 0.9394 | 0.4217 | 0.1351 |
| Corridor | With Inverse Par., | 5.23 | 0.9259 | 0.4647 | 0.0275 |
| Corridor | With Inverse Par., | 24.32 | 0.7574 | 0.3777 | 0.0072 |
| L | Without Inverse Par. | 0.00 | 0.5047 | 0.3985 | 0.1852 |
| L | With Inverse Par., | 7.85 | 0.5523 | 0.1017 | 0.0245 |
| L | With Inverse Par., | 19.21 | 0.5534 | 0.2135 | 0.0078 |
| Loop | Without Inverse Par. | 0.00 | 0.4066 | 0.9801 | 0.2593 |
| Loop | With Inverse Par., | 5.27 | 0.3829 | 0.63030 | 0.0472 |
| Loop | With Inverse Par., | 12.36 | 0.2191 | 0.3778 | 0.0310 |
Comparison of patch matching techniques: no patch transformation and 2D warping.
| No Patch Transformation | Corridor | 85 | 6,283 | 5,612 | 89.32 |
| 2D Warping | Corridor | 68 | 6,398 | 5,781 | 90.35 |
| No Patch Transformation | L | 116 | 11,627 | 8,922 | 76.73 |
| 2D Warping | L | 105 | 10,297 | 9,119 | 88.71 |
Processing times.
| Feature Initialization (15) | 18.00 |
| Prediction | 0.47 |
| Measurement | 10 |
| Update | 4.96 |