Research Article |
Object Detection and Recognition in Real Time Using Deep Learning for Visually Impaired People
Author(s) : M. I. Thariq Hussan1, D. Saidulu2, P. T. Anitha3, A. Manikandan4 and P. Naresh5
Published In : International Journal of Electrical and Electronics Research (IJEER) Volume 10, Issue 2 , Special Issue on IEEE-SD
Publisher : FOREX Publication
Published : 13 May 2022
e-ISSN : 2347-470X
Page(s) : 80-86
Abstract
This paper aims at combining object detection at real time and recognition with suitable deep learning methods in order to detect and recognize objects position as well as the names of multiple objects detected by the camera using an object detector algorithm. This is to aid the visually impaired user without the help of any other person. The image and video processing algorithms were designed to take real-time inputs from the camera, Deep Neural Networks were used to predict the objects and uses Google’s famous Text-To-Speech (GTTS) API module for the anticipated voice output precisely detecting and recognizing the category or class of objects and locations contained. Our best result shows that the system recognizes 91 categories of outdoor objects and produces the output in speech i.e. in an audio format even when a reduced amount of spectral information from the data is available.
Keywords: Deep Neural Networks
, GTTS
, Object detection
, Object recognition
, YOLO
M. I. Thariq Hussan, Prof. & Head, Department of IT, Guru Nanak Institutions Technical Campus, Hyderabad, India; Email: thariqhussain@rediffmail.com
D. Saidulu, Asso. Prof., Department of IT, Guru Nanak Institutions Technical Campus, Hyderabad, India; Email: fly2.sai@gmail.com
P. T. Anitha, Asso. Prof., Department of Computer Science, Wollega University, Nekemte, Ethiopia; Email: anithapt74@gmail.com
A. Manikandan, Principal, Muthayammal Memorial College of Arts & Science, Tamilnadu, India
P. Naresh, Asst. Prof., Department of IT, Guru Nanak Institutions Technical Campus, Hyderabad, India; Email: pannanginaresh@gmail.com
[1] Mayur Rahul, Namita Tiwari, Rati Shukla, Devvrat Tyagi and Vikash Yadav (2022), A New Hybrid Approach for Efficient Emotion Recognition using Deep Learning. IJEER 10(1), 18-22. DOI: 10.37391/IJEER.100103.[Cross Ref]
[2] World Health Organization, Blindness and Vision Impairment, World Health Report, Geneva, 2019. https://www.who.int/news-room/fact-sheets/detail/blindness-and-visual-impairment, October 2019.[Cross Ref]
[3] Matusiak, K., Skulimowski, P., Strurnillo, P., Object recognition in a mobile phone application for visually impaired users, International Conference on Human System Interactions, 2013.[Cross Ref]
[4] Dionisi, A., Sardini, E., Serpelloni, M., Wearable object detection system for the blind, IEEE International Instrumentation and Measurement Technology Conference Proceedings, pp. 1255-1258, 2012.
[5] Jack M. Loomis, Digital map and navigation system for the visually impaired, Department of Psychology, University of California, Santa Barbara, 1985.
[6] Muljowidodo, K., Mochammad A. Rasyid, SaptoAdi, N., Agus Budiyono, Vision based distance measurement system using single laser pointer design for under water vehicle, Indian Journal of Marine science, Vol. 38, No. 3 pp. 324-331, 2009.
[7] Neha Bari, Nilesh Kamble, Parnavi Tamhankar, Android based object recognition and motion detection to aid visually impaired, International Journal of Advances in Computer Science and Technology, Vol. 3, No.10, pp. 462-466, 2014.[Cross Ref]
[8] Jason Yip, Object Detection with Voice Feedback YOLO v3 + gTTS, https://towardsdatascience.com/object-detection-with-voice-feedback-yolo-v3-gtts-6ec732dca91.[Cross Ref]
[9] Samkit Shah, CNN based Auto-Assistance System as a Boon for Directing Visually Impaired Person, 3rd International Conference on Trends in Electronics and Informatics, 2019.[Cross Ref]
[10] Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi, You Only Look Once: Unified, Real-Time Object Detection, IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, 2016.
[11] ODSC-Open Data Science, Overview of the YOLO Object Detection Algorithm, https://www.medium.com/@ODSC/overview-of-the-yolo-object-detection-algorithm-7b52a745d3e0.[Cross Ref]
[12] Joseph Redmon, Ali Farhadi, YOLO9000: Better, Faster, Stronger, IEEE Conference on Computer Vision and Pattern Recognition, pp. 6517-6525, 2017.
[13] gTTS, https://pypi.org/project/gTTS/.[Cross Ref]
[14] Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollar, Lawrence Zitnick, C., Microsoft COCO: Common objects in Context, European conference on Computer Vision, pp. 740-755, 2014.
[15] Amikelive Technology Blog, What object categories/labels are in COCO dataset?, https://tech.amikelive.com/node-718/what-object-categories- labels-are-in-coco-dataset/, 2018.
[16] Abdul Vahab, Maruti S Naik, Prasanna G Raikar, Prasad S R, Applications of Object Detection System, International Research Journal of Engineering and Technology, Vol. 6, No. 4, pp. 4186-4192, 2019.[Cross Ref]
[17] Christian Szegedy, Alexander Toshev, Dumitru Erhan, Deep Neural Networks for object detection, Advances in Neural Information Processing Systems 26, 2013.
[18] Joseph Redmon, Ali Farhadi, Yolov3: An incremental improvement, https://arxiv.org/abs/1804.02767, 2018.
M. I. Thariq Hussan, D. Saidulu, P. T. Anitha, A. Manikandan and P. Naresh (2022), Object Detection and Recognition in Real Time Using Deep Learning for Visually Impaired People. IJEER 10(2), 80-86. DOI: 10.37391/IJEER.100205.