Pedestrian Detection and Vehicle Classification using Deep Convolutional Neural Networks for Video Surveillance Applications

Today, we come across examples of artificial intelligence in many areas of our lives with the development of machine learning and deep learning networks. One of these areas is to combine surveillance cameras with artificial intelligence to perform a determined task without a personnel. To make its job description more understandable, we can say that using artificial intelligence to detect and classify objects in real-time surveillance camera images. Although this task may seem easy for us, it is extremely important to perform the detection task quickly and accurately when performed on the machine side. In this direction, an end-to-end system solution has been developed by using deep learning networks for the detection of intruders in 8 classes including people and vehicles in outdoor live surveillance cameras. First, the state-of-the-art technology algorithms capable of fast and accurate detection that can be used for object detection and classification task to be performed on the end device provided by Videotec company were examined and it was decided to use YOLO version 4 in the preliminary tests. The YOLO model is used with pre-trained weights to reduce time and power costs instead of being trained from scratch. In order to test the performance of the model in harsh conditions, a dataset in MSCOCO standard was created, in which images are taken from live surveillance camera images in addition to samples taken from known datasets such as MSCOCO, iLiDS, UA-DETRAC. The dataset that we call critical conditions includes various scenarios such as black/white, occlusion and bad weather. Afterwards, various tests were carried out using Tensorflow official tools and the values such as inference time and footprint of the model were compared with different models and tested on the specified hardware. Furthermore, some of the real-time tests were performed on Google Colab due to the lack of hardware and the results were recorded. As a result of all these tests, it is seen that the lack of GPU usage determined within the scope of the project is the biggest bottleneck and it is determined that the CPU was insufficient.

Pedestrian Detection and Vehicle Classification using Deep Convolutional Neural Networks for Video Surveillance Applications

ADAR, ANIL

2021/2022

Abstract

Today, we come across examples of artificial intelligence in many areas of our lives with the development of machine learning and deep learning networks. One of these areas is to combine surveillance cameras with artificial intelligence to perform a determined task without a personnel. To make its job description more understandable, we can say that using artificial intelligence to detect and classify objects in real-time surveillance camera images. Although this task may seem easy for us, it is extremely important to perform the detection task quickly and accurately when performed on the machine side. In this direction, an end-to-end system solution has been developed by using deep learning networks for the detection of intruders in 8 classes including people and vehicles in outdoor live surveillance cameras. First, the state-of-the-art technology algorithms capable of fast and accurate detection that can be used for object detection and classification task to be performed on the end device provided by Videotec company were examined and it was decided to use YOLO version 4 in the preliminary tests. The YOLO model is used with pre-trained weights to reduce time and power costs instead of being trained from scratch. In order to test the performance of the model in harsh conditions, a dataset in MSCOCO standard was created, in which images are taken from live surveillance camera images in addition to samples taken from known datasets such as MSCOCO, iLiDS, UA-DETRAC. The dataset that we call critical conditions includes various scenarios such as black/white, occlusion and bad weather. Afterwards, various tests were carried out using Tensorflow official tools and the values such as inference time and footprint of the model were compared with different models and tested on the specified hardware. Furthermore, some of the real-time tests were performed on Google Colab due to the lack of hardware and the results were recorded. As a result of all these tests, it is seen that the lack of GPU usage determined within the scope of the project is the biggest bottleneck and it is determined that the CPU was insufficient.

Scheda

Scheda DC

	Facoltà/Dipartimento
	
				Dipartimento di Ingegneria dell'Informazione - DEI
			
	Corso di studio
	
				ICT FOR INTERNET AND MULTIMEDIA - INGEGNERIA PER LE COMUNICAZIONI MULTIMEDIALI E INTERNET Laurea Magistrale (D.M. 270/2004)
			
	Anno Accademico
	
				2021
			
	Titolo inglese
	
				Pedestrian Detection and Vehicle Classification using Deep Convolutional Neural Networks for Video Surveillance Applications
			
	Abstract in italiano
	
				Today, we come across examples of artificial intelligence in many areas of our lives with the development of machine learning and deep learning networks. One of these areas is to combine surveillance cameras with artificial intelligence to perform a determined task without a personnel. To make its job description more understandable, we can say that using artificial intelligence to detect and classify objects in real-time surveillance camera images. Although this task may seem easy for us, it is extremely important to perform the detection task quickly and accurately when performed on the machine side. In this direction, an end-to-end system solution has been developed by using deep learning networks for the detection of intruders in 8 classes including people and vehicles in outdoor live surveillance cameras. First, the state-of-the-art technology algorithms capable of fast and accurate detection that can be used for object detection and classification task to be performed on the end device provided by Videotec company were examined and it was decided to use YOLO version 4 in the preliminary tests. The YOLO model is used with pre-trained weights to reduce time and power costs instead of being trained from scratch. In order to test the performance of the model in harsh conditions, a dataset in MSCOCO standard was created, in which images are taken from live surveillance camera images in addition to samples taken from known datasets such as MSCOCO, iLiDS, UA-DETRAC. The dataset that we call critical conditions includes various scenarios such as black/white, occlusion and bad weather. Afterwards, various tests were carried out using Tensorflow official tools and the values ​​such as inference time and footprint of the model were compared with different models and tested on the specified hardware. Furthermore, some of the real-time tests were performed on Google Colab due to the lack of hardware and the results were recorded. As a result of all these tests, it is seen that the lack of GPU usage determined within the scope of the project is the biggest bottleneck and it is determined that the CPU was insufficient.
			
	Parola chiave
	
				Object Detection
Neural Networks
Video Surveillance
Image Classification
YOLO
			
	Relatore
	
				ZANELLA, ANDREA
			
	Appare nelle tipologie:
	
				Lauree magistrali

File in questo prodotto:

File	Dimensione	Formato
Adar_Anıl.pdf accesso riservato Dimensione 35.64 MB Formato Adobe PDF	35.64 MB	Adobe PDF

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/29238