FINE-TUNING DEEP LEARNING MODELS FOR PEDESTRIAN DETECTION

Caisse Amisse; Mario Ernesto Jijón-Palma; Jorge Antonio Silva Centeno

FINE-TUNING DEEP LEARNING MODELS FOR PEDESTRIAN DETECTION

Authors

Caisse Amisse Universidade Rovuma, Departamento de Ciências Naturais, Nampula, Moçambique http://orcid.org/0000-0001-9458-5510
Mario Ernesto Jijón-Palma Universidade Federal do Paraná, Programa de Pós-graduação em Ciências Geodésicas, Curitiba - Paraná, Brasil http://orcid.org/0000-0003-4890-2997
Jorge Antonio Silva Centeno Universidade Federal do Paraná, Programa de Pós-graduação em Ciências Geodésicas, Curitiba - Paraná, Brasil. http://orcid.org/0000-0002-2669-7147

Keywords:

fine-tuning, pedestrian detection, training data, deep learning models.

Abstract

Object detection in high resolution images is a new challenge that the remote sensing community is facing thanks to introduction of unmanned aerial vehicles and monitoring cameras. One of the interests is to detect and trace persons in the images. Different from general objects, pedestrians can have different poses and are undergoing constant morphological changes while moving, this task needs an intelligent solution. Fine-tuning has woken up great interest among researchers due to its relevance for retraining convolutional networks for many and interesting applications. For object classification, detection, and segmentation fine-tuned models have shown state-of-the-art performance. In the present work, we evaluate the performance of fine-tuned models with a variation of training data by comparing Faster Region-based Convolutional Neural Network (Faster R-CNN) Inception v2, Single Shot MultiBox Detector (SSD) Inception v2, and SSD Mobilenet v2. To achieve the goal, the effect of varying training data on performance metrics such as accuracy, precision, F1-score, and recall are taken into account. After testing the detectors, it was identified that the precision and recall are more sensitive on the variation of the amount of training data. Under five variation of the amount of training data, we observe that the proportion of 60%-80% consistently achieve highly comparable performance, whereas in all variation of training data Faster R-CNN Inception v2 outperforms SSD Inception v2 and SSD Mobilenet v2 in evaluated metrics, but the SSD converges relatively quickly during the training phase. Overall, partitioning 80% of total data for fine-tuning trained models produces efficient detectors even with only 700 data samples.

Downloads

PDF (Português (Brasil))

Published

2022-07-06

How to Cite

Amisse, C., Jijón-Palma, M. E., & Centeno, J. A. S. (2022). FINE-TUNING DEEP LEARNING MODELS FOR PEDESTRIAN DETECTION. Bulletin of Geodetic Sciences, 27(2). Retrieved from https://revistas.ufpr.br/bcg/article/view/82504

Download Citation

Issue

Vol. 27 No. 2 (2021)

Section

Article

License

Submission of an original manuscript to the Journal will be taken to mean that it represents original work not previously published, that is not being considered elsewhere for publication.

The BCG allows the author(s) to hold the copyright without restrictions and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.

The BCG also allows the authors to retain publishing rights without restrictions.

Biblioteca Digital
de Periódicos
da Universidade Federal do Paraná

FINE-TUNING DEEP LEARNING MODELS FOR PEDESTRIAN DETECTION

Authors

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Information

Keywords

Language

Make a Submission

Biblioteca Digitalde Periódicosda Universidade Federal do Paraná

FINE-TUNING DEEP LEARNING MODELS FOR PEDESTRIAN DETECTION

Authors

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Information

Keywords

Language

Make a Submission

Biblioteca Digital
de Periódicos
da Universidade Federal do Paraná