| Literature DB >> 23300930 |
Florian Raudies1, Heiko Neumann.
Abstract
The analysis of motion crowds is concerned with the detection of potential hazards for individuals of the crowd. Existing methods analyze the statistics of pixel motion to classify non-dangerous or dangerous behavior, to detect outlier motions, or to estimate the mean throughput of people for an image region. We suggest a biologically inspired model for the analysis of motion crowds that extracts motion features indicative for potential dangers in crowd behavior. Our model consists of stages for motion detection, integration, and pattern detection that model functions of the primate primary visual cortex area (V1), the middle temporal area (MT), and the medial superior temporal area (MST), respectively. This model allows for the processing of motion transparency, the appearance of multiple motions in the same visual region, in addition to processing opaque motion. We suggest that motion transparency helps to identify "danger zones" in motion crowds. For instance, motion transparency occurs in small exit passages during evacuation. However, motion transparency occurs also for non-dangerous crowd behavior when people move in opposite directions organized into separate lanes. Our analysis suggests: The combination of motion transparency and a slow motion speed can be used for labeling of candidate regions that contain dangerous behavior. In addition, locally detected decelerations or negative speed gradients of motions are a precursor of danger in crowd behavior as are globally detected motion patterns that show a contraction toward a single point. In sum, motion transparency, image speeds, motion patterns, and speed gradients extracted from visual motion in videos are important features to describe the behavioral state of a motion crowd.Entities:
Mesh:
Year: 2012 PMID: 23300930 PMCID: PMC3534068 DOI: 10.1371/journal.pone.0053456
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Depicts our biologically inspired proposal of visual motion processing that models the functions of primary visual area (V1), middle temporal area (MT), and medial superior temporal area (MST) as well as their interaction, indicated by an arrow.
We consider cells from the dorsal part of area MST, referred to by MSTd, which respond to motion patterns. The three gray boxes show details of these functions. The box to the lower-left shows V1 detecting initial motions from Gabor filtering results at multiple spatial scales. The box to the upper-left shows the motion integration mechanisms of MT that encodes motion in neurons selective to a combination of motion speed and direction (velocity). This velocity space representation is displayed here in a Cartesian coordinate frame for convenience. A center-surround interaction is applied in these codomains of speed and direction. The box to the upper-right shows the globally defined motion patterns that extend over a large region of the visual field and that are detected in model area MSTd receiving input from MT.
Parameter values for initial motion detection and three-stage-processing cascades for signal integration in model areas V1, MT, and MSTd.
| Description | Value | Eq. |
| Detection of initial motion | ||
| Orientations |
| 1 |
| Rings of the Gabor bank |
| 1 |
| Overlap factors |
| 1 |
| Radial standard deviation |
| 1 |
| Tangent standard deviation |
| 1 |
| Speed to wavelength factor |
| 1 |
| Motion directions |
| 3 |
| Motion speeds |
| 3 |
| Three-stage processing cascade of | ||
| Nonlinearity α | 2 | 6a |
|
| Motion speed: | 6a |
| Boundary conditions | Motion speed: Neumann. Motion direction: Circular | 6a |
|
| 100 | 6b |
| Normalization AV1 | 0.01 | 6c |
| Normalization BV1 | 100/112 | 6c |
| Three-stage processing cascade of | ||
|
|
| 7a |
|
| 5 | 7a |
| Nonlinearity α | 2 | 7a |
|
| Same as in V1 | 7a |
|
| Motion speed: Dirac pulse. No kernel is applied. Motion direction: | 7c |
|
| Motion speed: | 7c |
| Boundary conditions | Motion speed: Neumann; Motion direction: Circular | 7c |
| Normalization AMT | 0.01 | 7c |
| Normalization BMT | 10 | 7c |
| Three-stage processing cascade of | ||
| Positions of pattern (u, v) | □ {(0%, 0%), (25%, 0%), (50%, 0%), (75%, 0%), (100%, 0%), (0%, 50%), (25%, 50%), (50%, 50%), (75%, 50%), (100%, 50%), (0%, 100%), (25%, 100%), (50%, 100%), (75%, 100%), (100%,100%)} | 9a |
| Parameter for pattern δ | □ {0°, 45°, 90°, 135°, 180°, 225°, 270°, 315°} | 9a |
| Nonlinearity α | 2 | 9a |
| Normalization AMST | 0.01 | 9c |
| Normalization BMST | 10 | 9c |
| Pattern numbers Nu, Nv | Nu = 5, Nv = 3 | 9c |
The length specifies the size of the support for the filters.
Figure 2Shows motion transparency for the scenario of flowing a) and not for the scenario of congested crowd motion b).
These two scenarios were simulated using the social force model [7] where pedestrians are modeled as dots and viewed from above (birds-eye view). Panels c) and d) show image frames of the generated videos for a flowing and congested crowd, respectively. The detected motion for these frames is shown in e) and f). A legend at the bottom of the figure denotes the gray-value encoding of motion.
Figure 3Provides examples of motion transparency detected in image regions of videos showing a crossing (1st row), cheerleader dance (2nd row), or sidewalk in London (3rd row).
The first two examples a) and d) have a bimodal distribution of motion directions which indicates motion transparency for the zoomed-in areas in image frames of b) and e). Panels c) and f) show a pixel-wise labeling of ‘no-motion’, ‘single-motion’, and ‘multiple-motions’ for these two examples. The third example, the sidewalk in London, shows a single motion indicated by the unimodal distribution of motion directions in g) pooling over the zoom-in region of the image shown in h). The pixel-wise assignment gives i).
Figure 4Shows the detected motion for a crosswalk of varying people density [41].
a) Shows the scenario and motions for a low density. b) A single image frame from the video. c) The detected motion shows large regions with motion transparency. d) Globally detected motion patterns indicate a motion pattern of EXP and CON. e) As the density increases people radially stream inward indicated by the arrows. f) An image frame for increased density. g) The regions that contain transparency are reduced. h) A strong motion pattern of CON is present. i) For a further increase of density the motion pattern changes: The central part shows a spiral motion, that of joined and exiting motions from four sides – like in a rotary. j) An image of the video for an even further increase in density. k) Fewer parts exhibit motion transparency. l) Patterns of CON and CON&CW are active. The latter pattern captures some of the rotational inward, spiral motion in the central part of the image. All motion patterns refer to the center of the image plane.
Figure 5Shows the analysis of a simulated evacuation scenario.
a) For low density two 90° turns in the main motion direction appear. b) An image frame of the video displaying the evacuation. c) Velocity gradients represent local motion patterns of EXP, CCW, CON, CW, and combinations thereof. Boundaries between motion and no motion appear mainly as local CCW and CW and CON is detected locally for the people streams which indicates their slow down or negative speed gradient (deceleration). d) Globally detected motion patterns show a weak activation of the CON pattern that is centered in the image plane. e) Motion transparency appears at the narrow passages and 90° turns. f) At the same regions the image speed is slow. g) A combination of transparency and slow speeds shows “danger zones”. h) At high density the lower room is completely filled with people. i) An image frame of the video with higher people density. j) Local motion patterns of CW and CCW appear at boundaries of motions and the pattern of CON is now spread over the entire area that corresponds to the lower room. k) This increased area that has an CON velocity gradient yields an increased activation of the global CON motion pattern compared to the activation of the CON pattern for low density in d). l) Again, motion transparency appears at the narrow passages. m) Motion speeds are slow at the passages. n) “Danger zones” for the high density scenario.
shows examples of coordinated and uncoordinated motion for dangerous and non-dangerous crowd behavior.
| Non-dangerous | Dangerous | |
|
| Coherent single motion or coherent multiplemotions ( | Incoherent single motion or coherent multiple motions with slow speed ( |
|
| Coherent multiple motions ( | No motion of people ( |