| Literature DB >> 34063490 |
Danijela Ristić-Durrant1, Marten Franke1, Kai Michels1.
Abstract
This paper provides a review of the literature on vision-based on-board obstacle detection and distance estimation in railways. Environment perception is crucial for autonomous detection of obstacles in a vehicle's surroundings. The use of on-board sensors for road vehicles for this purpose is well established, and advances in Artificial Intelligence and sensing technologies have motivated significant research and development in obstacle detection in the automotive field. However, research and development on obstacle detection in railways has been less extensive. To the best of our knowledge, this is the first comprehensive review of on-board obstacle detection methods for railway applications. This paper reviews currently used sensors, with particular focus on vision sensors due to their dominant use in the field. It then discusses and categorizes the methods based on vision sensors into methods based on traditional Computer Vision and methods based on Artificial Intelligence.Entities:
Keywords: AI-based vision; autonomous obstacle detection; on-board vision sensors; railways; traditional computer vision
Year: 2021 PMID: 34063490 PMCID: PMC8156009 DOI: 10.3390/s21103452
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Summary of the papers describing traditional CV methods.
| Paper | Type of Vision System | Rail Track Detection | Obstacle Detection | Distance Estimation | Evaluation Tests—Comments |
|---|---|---|---|---|---|
|
Fel et al., 2018 [ | Stereo-vision system consisting of 3 RGB cameras | ✓ | ✓ | Short-range obstacle detection using stereo cameras with a small stereo baseline (0.25 m) and long range depth estimation with a larger baseline (0.8 m) | The evaluation from dedicated automatic detection analysis tools and the drivers’ feedbacks over three years and thousands of accumulated kilometres |
| Fioretti et al., 2018 [ | Monocular RGB on-board cameras | ✓ | ✓ | 3D reconstruction of rails and of objects such as kilometres’ signs; distance (z-coordinate) range up to 200 m | Evaluation on image data of a single track railway recorded for 8 min and 20 s for 9 km at train speed of about 72 km/h. |
| Fonseca Rodriguez et al., 2012 [ | — | ✓ | ✓ | — | Evaluation on several modified videos from the Internet; digital objects artificially added to video frames |
| Gavrilova et al., 2018 [ | Standard RGB on-board camera | ✓ | ✓ | Using the image coordinates of the last point on detected obstacle-free rail tracks as wagon point coordinates, to calculate actual distance to wagon; up to 100 m | Evaluation on real-world images; 100 images with the object (wagon) on the rail tracks and 300 images without the object |
| Gebauer et al., 2012 [ | Stereo-cameras + thermal camera | ✓ | ✓ | Using stereo-vision and laser scanner; up to 80 m | Evaluation on real-world images recorded on 15 km long railway in Austria on which a prototype train operates |
| Gschwandtner et al., 2010 [ | Grayscale camera | ✓ | — | — | Evaluation on a dataset acquired in winter on the track length of approx. 15 km and the train driving at 40 km/h. |
| Kaleli et al., 2009 [ | Standard RGB camera | ✓ | — | — | Evaluation on several video sequences taken from the on-board camera while the train was moving at high speeds |
| Kudinov et al., 2020 [ | Grayscale camera | ✓ | ✓ | Short-range distances to wagons up to 50 m (absolute error no more than 1.2 m) | Evaluation on real-world images of the shunting track of the Ryazan-1 railway station recorded in cloudy weather |
|
Maire et al., 2010 [ | Grayscale camera | ✓ | — | Detection of obstacle-free rail tracks zone of about 100 m but no individual object distance estimation | Evaluation on real-world images of rather poor quality |
| Möckel et al., 2003 [ | Multi-focal camera system, one short-range and two long-range cameras | ✓ | ✓ | Using LiDAR with an up to 400 m look-ahead range but no individual object distance estimation | Evaluation on images from real-world field tests on a public rail track near Munich used as test track; simulated obstacles |
| Mukojima et al., 2016 [ | Standard RGB camera | — | ✓ | — | Evaluation on train frontal view images captured on a test line in the premises of the Railway Technical Research Institute, Japan. |
| Nakasone et al., 2017 [ | Standard RGB camera | — | ✓ | The farthest obstacle at 235 m as the test track was straight at a length of about 250 m leading to the obstacle | Images from four train runs without obstacles used as reference. Evaluation on 5000 frames with obstacles captured in 17 train runs |
| Nassu et al., 2011 [ | Standard RGB camera | ✓ | — | Rail extraction in the long distance was performed in addition to rail extraction in the short distance; however, the long distance range was not defined | Three sets of videos captured in real operating conditions in several Japanese railways. The 1st set has 10 videos, with 3549 frames, the 2nd has 27 videos, with 14,474 frames and the 3rd has 12 videos, with 5879 frames with rails are mostly visible in the long distance |
| Nassu et al., 2012 [ | RGB camera with zoom lens mounted on a pan–tilt unit | ✓ | — | Different rail exraction methods for short distances and long distances; no specification on distance range | Evaluation on 12 video sequences captured under real operating conditions in several Japanese railways, with 459,733 frames (4:15 h of recording). |
|
Qi et al., 2013 [ | RGB camera | ✓ | — | — | Evaluation on six real-world video datasets recorded in different illumination. The test data were collected in Hefei City, Anhui Province of China. The length of the rails approx. 10 km. The train speed of about 60 km/h. |
| Ross 2010 [ | Grayscale camera | ✓ | — | — | Evaluation on on-board camera images taken near Karlsruhe, Germany. Dataset included images of single track situations and of turnout situations |
| Saika et al., 2016 [ | Standard RGB camera | — | ✓ | — | Evaluation on a train driver’s view video, of scene of train approaching a platform, available on the Internet. The test video sequences of 12 frames |
| Selver et al., 2016 [ | — | ✓ | — | — | Evaluation data set was a collection of publicly available in Internet cabin view videos (different weather conditions); 389 frames |
| Ukai, 2004 [ | Ultra telephoto lens camera | ✓ | ✓ | Used camera allows for monitoring the rail tracks ahead up 600 m but no individual object distance estimation | The image sequence of 557 frames (18.56 s) recorded from a train was collected for evaluation |
| Uribe et al., 2012 [ | — | ✓ | ✓ | — | Evaluation on images from Internet with artificially added obstacles |
| Vazquez et al., 2004 [ | RGB camera | — | ✓ | — | Evaluation on real images captured in railway environment and in varied illumination and weather conditions (some images in very adverse conditions as images with fog) |
|
Wang et al., 2017 [ | Standard RGB on-board camera | ✓ | ✓ | — | Evaluation dataset is an open source video captured by on-board camera from Malmo to Angelholm, Sweden; videos were modified adding digital obstacles of different nature and shape |
| Wang et al., 2015 [ | Grayscale camera | ✓ | — | — | Test images were taken at Fengtai west railway station freight yard, Beijing |
| Wang et al., 2016 [ | Grayscale camera | ✓ | — | — | Test images were taken at Fengtai west railway station freight yard, Beijing |
| Weichselbaum et al., 2013 [ | Stereo-vision | ✓ | ✓ | Obstacle distance from 10 m up to 80 m ahead | Evaluation on two representative real-world test sequences with various numbers of frames, situations and obstacles |
| Wohlfeil, 2011 [ | Grayscale camera | ✓ | — | — | Evaluation test images from six different test rides in three different places at German railway tests sites; various challenging lightning conditions |
| Yamashita et al., 1996 [ | Mono thermal camera with a telephoto lens | ✓ | ✓ | Using laser range finder, up to 1 km | Real-world tests; the system on-board test train of the East Japan Railway |
Summary of the papers describing AI-based methods.
| Paper | Type of Vision System | Rail Track Detection | Obstacle Detection | Distance Estimation | Evaluation Tests—Comments |
|---|---|---|---|---|---|
|
Haseeb et al., 2018 [ | RGB cameras + thermal camera + night vision camera | — | ✓ | Feedforward neural network (NN)-based distance estimation. NN estimates object distance based on the features of the object Bounding Box (BB) extracted by the DL-based object detector. Mid-range (80–200 m) and long-range (up to 1000 m) distance estimation | Real-world tests; on-board multi-camera system mounted on the operational locomotive owned by Serbia Cargo and running on Serbian part of pan-European Corridor X in length of 120 km in different illumination and weather conditions |
| Wang et al., 2019 [ | Standard RGB on-board camera | ✓ | — | — | Evaluation on 300 images recorded in operational environment of low-speed autonomous trains in China |
| Wang et al., 2018 [ | Standard RGB on-board camera | ✓ | — | — | Evaluation on 1123 images recorded on the Beijing metro Yanfang line and Shanghai metro line 6 including tunnels and open lines |
| Wedberg, 2017 [ | Thermal on-board camera | — | ✓ | Detecting rails at long-range; no details of distance range | The evaluation dataset consisted of on-board thermal infrared video sequences. The data were collected in northern Sweden in April 2015 |
| Xu et al., 2019 [ | Standard RGB HD camera | — | ✓ | — | Evaluation on 1277 images recorded on the Hongkong MTR Tsuen Wan Line, Beijing Metro Yanfang line and Shanghai metro line 6 in different conditions, in daytime and night, sunny and rainy days |
| Ye et al., 2018 [ | Standard RGB camera | ✓ | ✓ | Using millimeter-wave radar; no details on object distance measurement | Evaluation of custom dataset recorded in real-world railway shunting scenarios |
|
Ye et al., 2020 [ | Standard RGB camera | ✓ | ✓ | Using millimeter-wave radar; no details on object distance measurement | Evaluation of custom dataset recorded in real-world railway shunting scenarios |
| Ye et al., 2020 [ | Standard RGB camera and a near infrared laser, supplementing the illumination of the camera | ✓ | ✓ | — | Evaluation of custom dataset recorded in real-world railway shunting scenarios |
| Yu et al., 2018 [ | — | — | ✓ | — | Evaluation on 5000 images of outdoor railway scenes, which were collected from the Internet |
Summary of the papers describing hybrid systems (traditional CV/AI-based methods).
| Paper | Type of Vision System | Rail Track Detection | Obstacle Detection | Distance Estimation | Evaluation Tests—Comments |
|---|---|---|---|---|---|
|
Chernov et al., 2020 [ | Stereo RGB cameras | ✓ | ✓ | Stereo-vision system enables determining the safe distance from the yard locomotive to the car coupling device; no details on distance estimation | Evaluation on real-world test images recorded by stereo cameras mounted on yard locomotives |
| Kapoor et al., 2020 [ | Thermal camera | ✓ | ✓ | Used cameras range up to 1500 m; Detection of objects within the rail tracks portions visible in the image but no individual object distance estimation | Evaluation on 749 images recorded with on-board thermal camera; no details of evaluation tests |
| Selver et al., 2017 [ | Standard RGB camera | ✓ | — | — | Evaluation on 2185 manually delineated frames. These are obtained from 29 left and 24 right turns belonging acquired during common public journeys. |
Figure 1Using of rail tracks geometry for the purpose of rail tracks detection. Projective transformation of the on-board camera image (left) into a bird’s-eye view image of the rail tracks (right), extracted from [33].
Figure 2Detection of objects in the ROI—region of detected rail tracks. (left) Windows-based systematic search for objects along the detected rail tracks ([17,41]); (right) digitally added stationary object and its detection ([17,41]).
Figure 3Detection of objects using background subtraction (extracted from [42]). (Top left) Current frame with an object; (Top right) reference frame without object; (Bottom left) difference in color between current and reference frame; (Bottom right) detected object using segmentation of difference image.
Figure 4Image annotations for AI-based rail tracks detection. (Top) Pixel level annotation for instance segmentation: original RGB image left, annotated image right ([18]); (Bottom) key points annotation for rail track key points detection: original Thermal image left, annotated image right ([53]).
Figure 5Custom made railway datasets for AI-based obstacle detection. (Top) Example images from a dataset generated using images that are publicly available on the Internet (extracted from [54]); (Bottom) example images from a dataset generated using real-world images from shunting environment (extracted from [13]).
Figure 6DisNet estimation of distances to objects in a rail track scene from the RGB camera image (extracted from [61]).