EBookClubs

Read Books & Download eBooks Full Online

EBookClubs

Read Books & Download eBooks Full Online

Book Geometrical and Contextual Scene Analysis for Object Detection and Tracking in Intelligent Vehicles

Download or read book Geometrical and Contextual Scene Analysis for Object Detection and Tracking in Intelligent Vehicles written by Bihao Wang and published by . This book was released on 2015 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: For autonomous or semi-autonomous intelligent vehicles, perception constitutes the first fundamental task to be performed before decision and action/control. Through the analysis of video, Lidar and radar data, it provides a specific representation of the environment and of its state, by extracting key properties from sensor data with time integration of sensor information. Compared to other perception modalities such as GPS, inertial or range sensors (Lidar, radar, ultrasonic), the cameras offer the greatest amount of information. Thanks to their versatility, cameras allow intelligent systems to achieve both high-level contextual and low-level geometrical information about the observed scene, and this is at high speed and low cost. Furthermore, the passive sensing technology of cameras enables low energy consumption and facilitates small size system integration. The use of cameras is however, not trivial and poses a number of theoretical issues related to how this sensor perceives its environmen. In this thesis, we propose a vision-only system for moving object detection. Indeed,within natural and constrained environments observed by an intelligent vehicle, moving objects represent high risk collision obstacles, and have to be handled robustly. We approach the problem of detecting moving objects by first extracting the local contextusing a color-based road segmentation. After transforming the color image into illuminant invariant image, shadows as well as their negative influence on the detection process can be removed. Hence, according to the feature automatically selected onthe road, a region of interest (ROI), where the moving objects can appear with a high collision risk, is extracted. Within this area, the moving pixels are then identified usin ga plane+parallax approach. To this end, the potential moving and parallax pixels a redetected using a background subtraction method; then three different geometrical constraints : the epipolar constraint, the structural consistency constraint and the trifocaltensor are applied to such potential pixels to filter out parallax ones. Likelihood equations are also introduced to combine the constraints in a complementary and effectiveway. When stereo vision is available, the road segmentation and on-road obstacles detection can be refined by means of the disparity map with geometrical cues. Moreover, in this case, a robust tracking algorithm combining image and depth information has been proposed. If one of the two cameras fails, the system can therefore come back to a monocular operation mode, which is an important feature for perception system reliability and integrity. The different proposed algorithms have been tested on public images data set with anevaluation against state-of-the-art approaches and ground-truth data. The obtained results are promising and show that the proposed methods are effective and robust on the different traffic scenarios and can achieve reliable detections in ambiguous situations.

Book Contributions Des Syst  mes de Vision    la Localisation Et Au Suivi D objets Par Fusion Multi capteur Pour Les V  hicules Intelligents

Download or read book Contributions Des Syst mes de Vision la Localisation Et Au Suivi D objets Par Fusion Multi capteur Pour Les V hicules Intelligents written by Sergio Alberto Rodriguez Florez and published by . This book was released on 2010 with total page 170 pages. Available in PDF, EPUB and Kindle. Book excerpt: Advanced Driver Assistance Systems (ADAS) can improve road safety by supporting the driver through warnings in hazardous circumstances or triggering appropriate actions when facing imminent collision situations (e.g. airbags, emergency brake systems, etc). In this context, the knowledge of the location and the speed of the surrounding mobile objects constitute a key information. Consequently, in this work, we focus on object detection, localization and tracking in dynamic scenes. Noticing the increasing presence of embedded multi-camera systems on vehicles and recognizing the effectiveness of lidar automotive systems to detect obstacles, we investigate stereo vision systems contributions to multi-modal perception of the environment geometry. In order to fuse geometrical information between lidar and vision system, we propose a calibration process which determines the extrinsic parameters between the exteroceptive sensors and quantifies the uncertainties of this estimation. We present a real-time visual odometry method which estimates the vehicle ego-motion and simplifies dynamic object motion analysis. Then, the integrity of the lidar-based object detection and tracking is increased by the means of a visual confirmation method that exploits stereo-vision 3D dense reconstruction in focused areas. Finally, a complete full scale automotive system integrating the considered perception modalities was implemented and tested experimentally in open road situations with an experimental car.

Book Visual Object Tracking with Deep Neural Networks

Download or read book Visual Object Tracking with Deep Neural Networks written by Pier Luigi Mazzeo and published by BoD – Books on Demand. This book was released on 2019-12-18 with total page 208 pages. Available in PDF, EPUB and Kindle. Book excerpt: Visual object tracking (VOT) and face recognition (FR) are essential tasks in computer vision with various real-world applications including human-computer interaction, autonomous vehicles, robotics, motion-based recognition, video indexing, surveillance and security. This book presents the state-of-the-art and new algorithms, methods, and systems of these research fields by using deep learning. It is organized into nine chapters across three sections. Section I discusses object detection and tracking ideas and algorithms; Section II examines applications based on re-identification challenges; and Section III presents applications based on FR research.

Book Representations and Techniques for 3D Object Recognition and Scene Interpretation

Download or read book Representations and Techniques for 3D Object Recognition and Scene Interpretation written by Derek Hoiem and published by Morgan & Claypool Publishers. This book was released on 2011 with total page 172 pages. Available in PDF, EPUB and Kindle. Book excerpt: One of the grand challenges of artificial intelligence is to enable computers to interpret 3D scenes and objects from imagery. This book organizes and introduces major concepts in 3D scene and object representation and inference from still images, with a focus on recent efforts to fuse models of geometry and perspective with statistical machine learning. The book is organized into three sections: (1) Interpretation of Physical Space; (2) Recognition of 3D Objects; and (3) Integrated 3D Scene Interpretation. The first discusses representations of spatial layout and techniques to interpret physical scenes from images. The second section introduces representations for 3D object categories that account for the intrinsically 3D nature of objects and provide robustness to change in viewpoints. The third section discusses strategies to unite inference of scene geometry and object pose and identity into a coherent scene interpretation. Each section broadly surveys important ideas from cognitive science and artificial intelligence research, organizes and discusses key concepts and techniques from recent work in computer vision, and describes a few sample approaches in detail. Newcomers to computer vision will benefit from introductions to basic concepts, such as single-view geometry and image classification, while experts and novices alike may find inspiration from the book's organization and discussion of the most recent ideas in 3D scene understanding and 3D object recognition. Specific topics include: mathematics of perspective geometry; visual elements of the physical scene, structural 3D scene representations; techniques and features for image and region categorization; historical perspective, computational models, and datasets and machine learning techniques for 3D object recognition; inferences of geometrical attributes of objects, such as size and pose; and probabilistic and feature-passing approaches for contextual reasoning about 3D objects and scenes. Table of Contents: Background on 3D Scene Models / Single-view Geometry / Modeling the Physical Scene / Categorizing Images and Regions / Examples of 3D Scene Interpretation / Background on 3D Recognition / Modeling 3D Objects / Recognizing and Understanding 3D Objects / Examples of 2D 1/2 Layout Models / Reasoning about Objects and Scenes / Cascades of Classifiers / Conclusion and Future Directions

Book Object Detection in Unstructured 3D Data Sets Using Explicit Semantics

Download or read book Object Detection in Unstructured 3D Data Sets Using Explicit Semantics written by Jean-Jacques Ponciano and published by . This book was released on 2019 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: With the evolution of technologies and robotics, the possibilities offered by 3D acquisition systems have increased. Nowadays, these systems are used in different domains as for autonomous vehicles, rescue robots, cultural heritage, for example. These application fields often require to perform object recognition from acquired data. Therefore, various methodologies have been investigated to automatically process 3D point cloud data in order to detect contained objects. The best methodologiesdepend on the context, that means they are specific to the data to be processed and the objects to be recognized. They produce efficient recognition, which is essential whatever the application field. However, adapting methodologies to a particular application field or use case limits the flexibility to extend the use of a method to other fields. These observations highlight the importance of developing object recognition methodologies specific to a detection context, but also the limitation of existing methods to preserve their capacity within changing detection contexts. An excellent example of a high degree of flexibility to changing contexts is human intelligence and human's ability to design ad hoc methodologies. Humans can analyze the context according to their knowledge and combine different characteristics or strategies according to the objective to be achieved. It would, therefore, be helpful for Computer Vision tools to integrate elements of artificial intelligence, allowing to adapt to the context of an application fields and to guide the detection process in this respect. This Ph.D. thesis presents a knowledge-based approach for object recognition that can be used whatever the application field. Its architecture is based on semantic technologies to allow a knowledge management module to guide the objects detection process through a step by step procedure performing the selection, parameterization, and execution of algorithms. The detection process is performed thanks to an artificial intelligence approach that uses explicit knowledge to design a context-dependent object recognition solution. Its strength is its adaptability to the context, but also its capability to analyze and understand a scene and contained objects and the specificities of the data to be processed. This understanding capability is realized through a self-learning process able to define and validate hypotheses concerning the context, also enabling to enrich the knowledge base and to improve the objects recognition process. The efficiency of this adaptation capability will be demonstrated in four use cases from different application fields. The first use case is an indoor of a building. It is used for a monitoring purpose. The second use case is located in the field of Archaeology represented by ancient ruins containing a terrace house with a watermill. The third use case is an outdoor representing a part of the city of Freiburg in Germany. It is used for an industrial purpose. Finally, the last use case is an indoor acquired by Microsoft's Kinect. It is used for a robotic purpose.

Book Development of a Vision based Object Detection and Recognition System for Intelligent Vehicle

Download or read book Development of a Vision based Object Detection and Recognition System for Intelligent Vehicle written by Xianghong (Henry). Liu and published by . This book was released on 2000 with total page 154 pages. Available in PDF, EPUB and Kindle. Book excerpt:

Book Object Detection and Tracking for Intelligent Vehicle Systems

Download or read book Object Detection and Tracking for Intelligent Vehicle Systems written by Xiaochao Yao and published by . This book was released on 2006 with total page 192 pages. Available in PDF, EPUB and Kindle. Book excerpt:

Book Strong Geometric Context for Scene Understanding

Download or read book Strong Geometric Context for Scene Understanding written by Raul Diaz Garcia and published by . This book was released on 2016 with total page 104 pages. Available in PDF, EPUB and Kindle. Book excerpt: Humans are able to recognize objects in a scene almost effortlessly. Our visual system can easily handle ambiguous settings, like partial occlusions or large variations in viewpoint. One hypothesis that explains this ability is that we process the scene as a global instance. Using global contextual reasoning (e.g., a car sits on a road, but not on a building facade) can constrain interpretations of objects to plausible, coherent precepts. This type of reasoning has been explored in Computer Vision using weak 2D context, mostly extracted from monocular cues. In this thesis, we explore the benefits of strong 3D context extracted from multiple-view geometry. We demonstrate strong ties between geometric reasoning and object recognition, effectively bridging the gap between them to improve scene understanding.In the first part of this thesis, we describe the basic principles of structure from motion, which provide strong and reliable geometric models that can be used for contextual scene understanding. We present a novel algorithm for camera localization that leverages search space partitioning to allow a more aggressive filtering of potential correspondences. We exploit image covisibility using a coarse-to-fine, prioritized search approach that can recognize scene landmarks rapidly. This system achieves state of the art results in large-scale camera localization, especially in difficult scenes with frequently repeated structures.In the second part of this thesis, we study how to exploit these strong geometric models and localized cameras to improve recognition. We introduce an unsupervised training pipeline to generate scene-specific object detectors. These classifiers outperform state of the art and can be used when the rough camera location is known. When precise camera pose is available, we can inject additional geometric cues into novel re-scoring framework to further improve detection. We demonstrate the utility of background scene models for false positive pruning, akin to video-surveillance background subtraction strategies. Finally, we observe that the increasing availability of mapping data stored in Geographic Information Systems (GIS) provides strong geo-semantic information that can be used when cameras are located in world coordinates. We propose a novel contextual reasoning pipeline that uses lifted 2D GIS models to quickly retrieve precise geo-semantic priors. We use these cues to to improve object detection and image semantic segmentation, providing a successful trade-off of false positives that boosts average precision over baseline detection models.

Book Advanced Concepts for Intelligent Vision Systems

Download or read book Advanced Concepts for Intelligent Vision Systems written by Jaques Blanc-Talon and published by Springer. This book was released on 2011-09-06 with total page 777 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book constitutes the refereed proceedings of the 13th International Conference on Advanced Concepts for Intelligent Vision Systems, ACIVS 2011, held in Ghent, Belgium, in August 2011. The 66 revised full papers presented were carefully reviewed and selected from 124 submissions. The papers are organized in topical sections on classification recognition, and tracking, segmentation, images analysis, image processing, video surveillance and biometrics, algorithms and optimization; and 3D, depth and scene understanding.

Book Representations and Techniques for 3D Object Recognition and Scene Interpretation

Download or read book Representations and Techniques for 3D Object Recognition and Scene Interpretation written by Derek Santhanam and published by Springer. This book was released on 2011-08-18 with total page 147 pages. Available in PDF, EPUB and Kindle. Book excerpt: One of the grand challenges of artificial intelligence is to enable computers to interpret 3D scenes and objects from imagery. This book organizes and introduces major concepts in 3D scene and object representation and inference from still images, with a focus on recent efforts to fuse models of geometry and perspective with statistical machine learning. The book is organized into three sections: (1) Interpretation of Physical Space; (2) Recognition of 3D Objects; and (3) Integrated 3D Scene Interpretation. The first discusses representations of spatial layout and techniques to interpret physical scenes from images. The second section introduces representations for 3D object categories that account for the intrinsically 3D nature of objects and provide robustness to change in viewpoints. The third section discusses strategies to unite inference of scene geometry and object pose and identity into a coherent scene interpretation. Each section broadly surveys important ideas from cognitive science and artificial intelligence research, organizes and discusses key concepts and techniques from recent work in computer vision, and describes a few sample approaches in detail. Newcomers to computer vision will benefit from introductions to basic concepts, such as single-view geometry and image classification, while experts and novices alike may find inspiration from the book's organization and discussion of the most recent ideas in 3D scene understanding and 3D object recognition. Specific topics include: mathematics of perspective geometry; visual elements of the physical scene, structural 3D scene representations; techniques and features for image and region categorization; historical perspective, computational models, and datasets and machine learning techniques for 3D object recognition; inferences of geometrical attributes of objects, such as size and pose; and probabilistic and feature-passing approaches for contextual reasoning about 3D objects and scenes. Table of Contents: Background on 3D Scene Models / Single-view Geometry / Modeling the Physical Scene / Categorizing Images and Regions / Examples of 3D Scene Interpretation / Background on 3D Recognition / Modeling 3D Objects / Recognizing and Understanding 3D Objects / Examples of 2D 1/2 Layout Models / Reasoning about Objects and Scenes / Cascades of Classifiers / Conclusion and Future Directions

Book Feature Based Probabilistic Data Association for Video Based Multi Object Tracking

Download or read book Feature Based Probabilistic Data Association for Video Based Multi Object Tracking written by Grinberg, Michael and published by KIT Scientific Publishing. This book was released on 2018-08-10 with total page 296 pages. Available in PDF, EPUB and Kindle. Book excerpt: This work proposes a feature-based probabilistic data association and tracking approach (FBPDATA) for multi-object tracking. FBPDATA is based on re-identification and tracking of individual video image points (feature points) and aims at solving the problems of partial, split (fragmented), bloated or missed detections, which are due to sensory or algorithmic restrictions, limited field of view of the sensors, as well as occlusion situations.

Book 3D Object Detection and Tracking for Autonomous Vehicles

Download or read book 3D Object Detection and Tracking for Autonomous Vehicles written by Su Pang and published by . This book was released on 2022 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: Autonomous driving systems require accurate 3D object detection and tracking to achieve reliable path planning and navigation. For object detection, there have been significant advances in neural networks for single-modality approaches. However, it has been surprisingly difficult to train networks to use multiple modalities in a way that demonstrates gain over single-modality networks. In this dissertation, we first propose three networks for Camera-LiDAR and Camera-Radar fusion. For Camera-LiDAR fusion, CLOCs (Camera-LiDAR Object Candidates fusion) and Fast-CLOCs are presented. CLOCs fusion provides a multi-modal fusion framework that significantly improves the performance of single-modality detectors. CLOCs operates on the combined output candidates before Non-Maximum Suppression (NMS) of any 2D and any 3D detector, and is trained to leverage their geometric and semantic consistencies to produce more accurate 3D detection results. Fast-CLOCs can run in near real-time with less computational requirements compared to CLOCs. Fast-CLOCs eliminates the separate heavy 2D detector, and instead uses a 3D detector-cued 2D image detector (3D-Q-2D) to reduce memory and computation. For Camera-Radar fusion, we propose TransCAR, a Transformer-based Camera-And-Radar fusion solution for 3D object detection. The cross-attention layer within the transformer decoder can adaptively learn the soft-association between the radar features and vision queries instead of hard-association based on sensor calibration only. Then, we propose to solve the 3D multiple object tracking (MOT) problem for autonomous driving applications using a random finite set-based (RFS) Multiple Measurement Models filter (RFS-M3). In particular, we propose multiple measurement models for a Poisson multi-Bernoulli mixture (PMBM) filter in support of different application scenarios. Our RFS-M3 filter can naturally model these uncertainties accurately and elegantly. We combine learning-based detections with our RFS-M3 tracker by incorporating the detection confidence score into the PMBM prediction and update step. We have evaluated our CLOCs, Fast-CLOCs and TransCAR fusion-based 3D detector and RFS-M3 3D tracker using challenging datasets including KITTI, nuScenes, Argoverse and Waymo that are released by academia and industry leaders. Superior experimental results demonstrated the effectiveness of the proposed approaches.

Book Advanced Machine Learning

Download or read book Advanced Machine Learning written by Dr. Amit Kumar Tyagi and published by BPB Publications. This book was released on 2024-06-29 with total page 612 pages. Available in PDF, EPUB and Kindle. Book excerpt: DESCRIPTION Our book is divided into several useful concepts and techniques of machine learning. This book serves as a valuable resource for individuals seeking to deepen their understanding of advanced topics in this field. Learn about various learning algorithms, including supervised, unsupervised, and reinforcement learning, and their mathematical foundations. Discover the significance of feature engineering and selection for enhancing model performance. Understand model evaluation metrics like accuracy, precision, recall, and F1-score, along with techniques like cross-validation and grid search for model selection. Explore ensemble learning methods along with deep learning, unsupervised learning, time series analysis, and reinforcement learning techniques. Lastly, uncover real-world applications of the machine and deep learning algorithms. After reading this book, readers will gain a comprehensive understanding of machine learning fundamentals and advanced techniques. With this knowledge, readers will be equipped to tackle real-world problems, make informed decisions, and develop innovative solutions using machine and deep learning algorithms. KEY FEATURES ● Basic understanding of machine learning algorithms via MATLAB, R, and Python. ● Inclusion of examples related to real-world problems, case studies, and questions related to futuristic technologies. ● Adding futuristic technologies related to machine learning and deep learning. WHAT YOU WILL LEARN ● Ability to tackle complex machine learning problems. ● Understanding of foundations, algorithms, ethical issues, and how to implement each learning algorithm for their own use/ with their data. ● Efficient data analysis for real-time data will be understood by researchers/ students. ● Using data analysis in near future topics and cutting-edge technologies. WHO THIS BOOK IS FOR This book is ideal for students, professors, and researchers. It equips industry experts and academics with the technical know-how and practical implementations of machine learning algorithms. TABLE OF CONTENTS 1. Introduction to Machine Learning 2. Statistical Analysis 3. Linear Regression 4. Logistic Regression 5. Decision Trees 6. Random Forest 7. Rule-Based Classifiers 8. Naïve Bayesian Classifier 9. K-Nearest Neighbors Classifiers 10. Support Vector Machine 11. K-Means Clustering 12. Dimensionality Reduction 13. Association Rules Mining and FP Growth 14. Reinforcement Learning 15. Applications of ML Algorithms 16. Applications of Deep Learning 17. Advance Topics and Future Directions

Book Exploiting Scene Context for On line Object Tracking in Unconstrained Environments

Download or read book Exploiting Scene Context for On line Object Tracking in Unconstrained Environments written by Salma Moujtahid and published by . This book was released on 2016 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: With the increasing need for automated video analysis, visual object tracking became an important task in computer vision. Object tracking is used in a wide range of applications such as surveillance, human-computer interaction, medical imaging or vehicle navigation. A tracking algorithm in unconstrained environments faces multiple challenges : potential changes in object shape and background, lighting, camera motion, and other adverse acquisition conditions. In this setting, classic methods of background subtraction are inadequate, and more discriminative methods of object detection are needed. Moreover, in generic tracking algorithms, the nature of the object is not known a priori. Thus, off-line learned appearance models for specific types of objects such as faces, or pedestrians can not be used. Further, the recent evolution of powerful machine learning techniques enabled the development of new tracking methods that learn the object appearance in an online manner and adapt to the varying constraints in real time, leading to very robust tracking algorithms that can operate in non-stationary environments to some extent. In this thesis, we start from the observation that different tracking algorithms have different strengths and weaknesses depending on the context. To overcome the varying challenges, we show that combining multiple modalities and tracking algorithms can considerably improve the overall tracking performance in unconstrained environments. More concretely, we first introduced a new tracker selection framework using a spatial and temporal coherence criterion. In this algorithm, multiple independent trackers are combined in a parallel manner, each of them using low-level features based on different complementary visual aspects like colour, texture and shape. By recurrently selecting the most suitable tracker, the overall system can switch rapidly between different tracking algorithms with specific appearance models depending on the changes in the video. In the second contribution, the scene context is introduced to the tracker selection. We designed effective visual features, extracted from the scene context to characterise the different image conditions and variations. At each point in time, a classifier is trained based on these features to predict the tracker that will perform best under the given scene conditions. We further improved this context-based framework and proposed an extended version, where the individual trackers are changed and the classifier training is optimised. Finally, we started exploring one interesting perspective that is the use of a Convolutional Neural Network to automatically learn to extract these scene features directly from the input image and predict the most suitable tracker.

Book Moving Objects Detection Using Machine Learning

Download or read book Moving Objects Detection Using Machine Learning written by Navneet Ghedia and published by Springer Nature. This book was released on 2022-01-01 with total page 91 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book shows how machine learning can detect moving objects in a digital video stream. The authors present different background subtraction approaches, foreground segmentation, and object tracking approaches to accomplish this. They also propose an algorithm that considers a multimodal background subtraction approach that can handle a dynamic background and different constraints. The authors show how the proposed algorithm is able to detect and track 2D & 3D objects in monocular sequences for both indoor and outdoor surveillance environments and at the same time, also able to work satisfactorily in a dynamic background and with challenging constraints. In addition, the shows how the proposed algorithm makes use of parameter optimization and adaptive threshold techniques as intrinsic improvements of the Gaussian Mixture Model. The presented system in the book is also able to handle partial occlusion during object detection and tracking. All the presented work and evaluations were carried out in offline processing with the computation done by a single laptop computer with MATLAB serving as software environment.

Book Exploiting Scene Context for On line Object Tracking in Unconstrained Environments

Download or read book Exploiting Scene Context for On line Object Tracking in Unconstrained Environments written by Salma Moujtahid and published by . This book was released on 2019 with total page 137 pages. Available in PDF, EPUB and Kindle. Book excerpt: With the increasing need for automated video analysis, visual object tracking became an important task in computer vision. Object tracking is used in a wide range of applications such as surveillance, human-computer interaction, medical imaging or vehicle navigation. A tracking algorithm in unconstrained environments faces multiple challenges : potential changes in object shape and background, lighting, camera motion, and other adverse acquisition conditions. In this setting, classic methods of background subtraction are inadequate, and more discriminative methods of object detection are needed. Moreover, in generic tracking algorithms, the nature of the object is not known a priori. Thus, off-line learned appearance models for specific types of objects such as faces, or pedestrians can not be used. Further, the recent evolution of powerful machine learning techniques enabled the development of new tracking methods that learn the object appearance in an online manner and adapt to the varying constraints in real time, leading to very robust tracking algorithms that can operate in non-stationary environments to some extent. In this thesis, we start from the observation that different tracking algorithms have different strengths and weaknesses depending on the context. To overcome the varying challenges, we show that combining multiple modalities and tracking algorithms can considerably improve the overall tracking performance in unconstrained environments. More concretely, we first introduced a new tracker selection framework using a spatial and temporal coherence criterion. In this algorithm, multiple independent trackers are combined in a parallel manner, each of them using low-level features based on different complementary visual aspects like colour, texture and shape. By recurrently selecting the most suitable tracker, the overall system can switch rapidly between different tracking algorithms with specific appearance models depending on the changes in the video. In the second contribution, the scene context is introduced to the tracker selection. We designed effective visual features, extracted from the scene context to characterise the different image conditions and variations. At each point in time, a classifier is trained based on these features to predict the tracker that will perform best under the given scene conditions. We further improved this context-based framework and proposed an extended version, where the individual trackers are changed and the classifier training is optimised. Finally, we started exploring one interesting perspective that is the use of a Convolutional Neural Network to automatically learn to extract these scene features directly from the input image and predict the most suitable tracker.

Book Video Analytics for Business Intelligence

Download or read book Video Analytics for Business Intelligence written by Caifeng Shan and published by Springer Science & Business Media. This book was released on 2012-04-07 with total page 374 pages. Available in PDF, EPUB and Kindle. Book excerpt: Closed Circuit TeleVision (CCTV) cameras have been increasingly deployed pervasively in public spaces including retail centres and shopping malls. Intelligent video analytics aims to automatically analyze content of massive amount of public space video data and has been one of the most active areas of computer vision research in the last two decades. Current focus of video analytics research has been largely on detecting alarm events and abnormal behaviours for public safety and security applications. However, increasingly CCTV installations have also been exploited for gathering and analyzing business intelligence information, in order to enhance marketing and operational efficiency. For example, in retail environments, surveillance cameras can be utilised to collect statistical information about shopping behaviour and preference for marketing (e.g., how many people entered a shop; how many females/males or which age groups of people showed interests to a particular product; how long did they stay in the shop; and what are the frequent paths), and to measure operational efficiency for improving customer experience. Video analytics has the enormous potential for non-security oriented commercial applications. This book presents the latest developments on video analytics for business intelligence applications. It provides both academic and commercial practitioners an understanding of the state-of-the-art and a resource for potential applications and successful practice.