CGAN Facilitated Data Augmentation of Voice and Speech Parameters for Detecting Parkinson’s Disease in the Prodromal Phase

Sandhya Chandrabhanu - Karpagam Academy of Higher Education (IN), Shanmugam Hemalatha - Karpagam Academy of Higher Education (IN),

Abstract


Parkinson’s disease is a multi-faceted disease affecting the brain. The enormity of its recent rise is quite alarming. This calls for intense research to diagnose early to hasten the progress of diagnosis. Voice distortion is considered an early precursor for Parkinson’s disease. Though several studies in Machine Learning using voice parameters have provided useful information, none of them have been successful in evolving an efficient and generalized model to detect it.  Deep Learning techniques were applied to improve the performance of the model but its major limitation was the size of the dataset. Hence, a need arose to extend the dataset using an appropriate data augmentation method. At this juncture, the conditional generative adversarial network (CGAN) proved to be a useful technique because of its innate feature for generating synthetic data from input noise. The RNN-LSTM classifier could achieve a training accuracy of 87.32%, testing accuracy of 86.3%, training precision of 87.92 %, and testing precision of 89.94%. The results of the experimental study are compared with other state-of-the-art methods. This technique succeeded in reducing the problem of over-fitting and could elevate the performance of the RNN-LSTM classifier in the prediction of Parkinson’s disease.

Keywords


Parkinson’s disease; machine learning; deep learning; data augmentation; conditional generative adversarial network; recurrent neural networks; long short-term memory

Full Text:

PDF


DOI: http://dx.doi.org/10.70594/brain/15.3/16

(C) 2010-2024 EduSoft