Please use this identifier to cite or link to this item: http://dx.doi.org/10.18419/opus-5155
|Authors:||Dridi, Mohamed H.|
|Title:||Tracking individual targets in high density crowd scenes analysis of a video recording in Hajj 2009|
|metadata.ubs.publikation.source:||Current urban studies 3 (2015), S. 35-53. URL http://dx.doi.org./10.4236/cus.2015.31005|
|Abstract:||In this paper we present a number of methods (manual, semi-automatic and automatic) for tracking individual targets in high density crowd scenes where thousand of people are gathered. The necessary data about the motion of individuals and a lot of other physical information can be extracted from consecutive image sequences in different ways, including optical flow and block motion estimation. One of the famous methods for tracking moving objects is the block matching method. This way to estimate subject motion requires the specification of a comparison window which determines the scale of the estimate. In this work we present a real-time method for pedestrian recognition and tracking in sequences of high resolution images obtained by a stationary (high definition) camera located in different places on the Haram mosque in Mecca. The objective is to estimate pedestrian velocities as a function of the local density.The resulting data of tracking moving pedestrians based on video sequences are presented in the following section. Through the evaluated system the spatio-temporal coordinates of each pedestrian during the Tawaf ritual are established. The pilgrim velocities as function of the local densities in the Mataf area (Haram Mosque Mecca) are illustrated and very precisely documented. Tracking in such places where pedestrian density reaches 7 to 8 Persons/m2 is extremely challenging due to the small number of pixels on the target, appearance ambiguity resulting from the dense packing, and severe inter-object occlusions. The tracking method which is outlined in this paper overcomes these challenges by using a virtual camera which is matched in position, rotation and focal length to the original camera in such a way that the features of the 3D-model match the feature position of the filmed mosque. In this model an individual feature has to be identified by eye, where contrast is a criterion. We do know that the pilgrims walk on a plane, and after matching the camera we also have the height of the plane in 3D-space from our 3D-model. A point object is placed at the position of a selected pedestrian. During the animation we set multiple animation-keys (approximately every 25 to 50 frames which equals 1 to 2 seconds) for the position, such that the position of the point and the pedestrian overlay nearly at every time. By combining all these variables with the available appearance information, we are able to track individual targets in high density crowds.|
|Appears in Collections:||08 Fakultät Mathematik und Physik|
Items in OPUS are protected by copyright, with all rights reserved, unless otherwise indicated.