DETECTION OF FILLERS IN THE SPEECH BY PEOPLE WHO STUTTER
Waldemar SUSZYŃSKI
w.suszynski@pollub.pl* Lublin University of Technology, Faculty of Electrical Engineering and Computer Science, Department of Computer Science (Poland)
Małgorzata CHARYTANOWICZ
Lublin University of Technology, Faculty of Electrical Engineering and Computer Science, Department of Computer Science (Poland)
Wojciech ROSA
Lublin University of Technology, Faculty of Technology Fundamentals (Poland)
Leopold KOCZAN
Lublin University of Technology, Faculty of Technology Fundamentals (Poland)
Rafał STĘGIERSKI
Lublin University of Technology, Faculty of Electrical Engineering and Computer Science (Poland)
Abstract
Stuttering is a speech impediment that is a very complex disorder. It is difficult to diagnose and treat, and is of unknown initiation, despite the large number of studies in this field. Stuttering can take many forms and varies from person to person, and it can change under the influence of external factors. Diagnosing and treating speech disorders such as stuttering requires from a speech therapist, not only good profes-sional preparation, but also experience gained through research and practice in the field. The use of acoustic methods in combination with elements of artificial intelligence makes it possible to objectively assess the disorder, as well as to control the effects of treatment. The main aim of the study was to present an algorithm for automatic recognition of fillers disfluency in the statements of people who stutter. This is done on the basis of their parameterized features in the amplitude-frequency space. The work provides as well, exemplary results demonstrating their possibility and effectiveness. In order to verify and optimize the procedures, the statements of seven stutterers with duration of 2 to 4 minutes were selected. Over 70% efficiency and predictability of automatic detection of these disfluencies was achieved. The use of an automatic method in conjunction with therapy for a stuttering person can give us the opportunity to objectively assess the disorder, as well as to evaluate the progress of therapy.
Keywords:
stuttering, fillers disfluency, automatic recognition, fillers detectionReferences
Alharbia, S., Hasana, M., Simonsa, A. J. H., Brumfitt, S., & Green, P. (2020). Sequence labeling to detect stuttering events in read speech. Computer Speech & Language, 62, 101052. http://doi.org/10.1016/j.csl.2019.101052
DOI: https://doi.org/10.1016/j.csl.2019.101052
Google Scholar
Bloodstein, O. (1995). A handbook on stuttering. Singular Publishing Group, Inc.
Google Scholar
Czyżewski, A., Kaczmarek, A., & Kostek, B. (2003). Intelligent processing of stuttered speech. Journal of Intelligent Inform. Systems, 143–171.
DOI: https://doi.org/10.1023/A:1024710532716
Google Scholar
Howell, P., & Sackin, S. J. (1995). Automatic recognition of repetitions and prolongations in stuttered speech, Stuttering. Proceedings of the First World Congress on Fluency Disorders (pp. 372–374). Munich.
Google Scholar
Howell, P., Sackin, S. J., Glenn, K., & Au-Yeung, J. (1997). Automatic stuttering frequency counts, Speech Motor Production and Fluency Disorders. Elsevier.
Google Scholar
Kuniszyk-Jóźkowiak, W., Dzieńkowski, M., Smołka E., & Suszyński, W. (2003). Computer Diagnosis and Therapy of Stuttering. Structures – Waves – Human Health, VIII(2), 133–144.
Google Scholar
Kuniszyk-Jóźkowiak, W., Smołka, E., & Suszyński, W. (2001). Acoustical characteristics alteration in persons who stutter resulting from therapy. Structures-Waves-Biomedical Engineering, X(2), 57–68.
Google Scholar
Kuniszyk-Jóźkowiak, W., Smołka, E., Dzieńkowski, M., & Suszyński W. (2004). Computer therapy of speech non-fluency with automatic adaptation of individual person's difficulties. Structures-Waves-Human Health, VIII(2), 63–70.
Google Scholar
Moore, B. C. J., & Glasberg, B. R. (1983). Suggested formulae for calculating auditory-filter banwidths and excitation patterns. The Journal of the Acoustical Society of America, 74, 750–753.
DOI: https://doi.org/10.1121/1.389861
Google Scholar
Moore, B. C. J., Peters, R. W., & Glasberg, B. R. (1990). Auditory filters shapes at low center frequencies. The Journal of the Acoustical Society of America, 88, 132–149.
DOI: https://doi.org/10.1121/1.399960
Google Scholar
Smołka, E., Kuniszyk-Jóźkowiak, W., Suszyński, W., & Dzieńkowski, M. (2003). Speech syllabic structure extraction with application of Kohonen network. Annales Informatica Universitatis Mariae CurieSkłodowska, AI 1,125–131.
Google Scholar
Stromsta, C. (1993). The nature and management of stuttering. Proceedings Abstracta, Congressus XVIII (pp. 16–18). Societatis Phoniatricae Europaeae, Praga.
Google Scholar
Suszyński, W., Kuniszyk-Józkowiak, W., Smolka, E., & Dzienkowski, M. (2003). Automatic Recognition of Nasals Prolongations in the Speech of Persons who Stutter. Structures-Waves-Human Health, XII(2), 175–184.
Google Scholar
Suszyński, W., Kuniszyk-Jóźkowiak, W., Smołka, E., & Dzieńkowski, M. (2003). Prolongation detection with application of fuzzy logic. Annales Informatica Universitatis Mariae Curie-Skłodowska, AI 1, 133–140.
Google Scholar
Suszyński, W., Kuniszyk-Jóźkowiak, W., Smołka, E., & Dzieńkowski, M. (2005). Speech disfluency detection with correlative method. Annales Informatica Universitatis Mariae Curie-Skłodowska, AI 3, 131–138.
Google Scholar
Świetlicka, I., Kuniszyk-Jóźkowiak, W., & Smołka, E. (2013). Hierarchical ANN system for stuttering identification. Computer Speech & Language, 27(1), 228–242. https://doi.org/10.1016/j.csl.2012.05.003
DOI: https://doi.org/10.1016/j.csl.2012.05.003
Google Scholar
Wingate, M. E. (2002). Foundation of stuttering. Academic Press.
DOI: https://doi.org/10.1163/9789004487192
Google Scholar
Wiśniewski, M, Kuniszyk-Jóźkowiak, W., Smołka, E., & Suszyński, W. (2010). Improved Approach to Automatic Detection of Speech Disorders Based the Hidden Markov Models Approach. Journal of Medical Informatics & Technologies, 15, 145–152. http://doi.org/10.1007/978-3-540-75175-5_56
DOI: https://doi.org/10.1007/978-3-540-75175-5_56
Google Scholar
Wiśniewski, M., & Kuniszyk-Jóźkowiak, W. (2015). Automatic detection of stuttering in a speech. Journal of Medical Informatics & Technologies, 24, 31–37.
Google Scholar
Authors
Waldemar SUSZYŃSKIw.suszynski@pollub.pl
* Lublin University of Technology, Faculty of Electrical Engineering and Computer Science, Department of Computer Science Poland
Authors
Małgorzata CHARYTANOWICZLublin University of Technology, Faculty of Electrical Engineering and Computer Science, Department of Computer Science Poland
Authors
Wojciech ROSALublin University of Technology, Faculty of Technology Fundamentals Poland
Authors
Leopold KOCZANLublin University of Technology, Faculty of Technology Fundamentals Poland
Authors
Rafał STĘGIERSKILublin University of Technology, Faculty of Electrical Engineering and Computer Science Poland
Statistics
Abstract views: 212PDF downloads: 38
License
This work is licensed under a Creative Commons Attribution 4.0 International License.
All articles published in Applied Computer Science are open-access and distributed under the terms of the Creative Commons Attribution 4.0 International License.
Similar Articles
- Qingyu Liu, Roben A. Juanatas, MASK FACE INPAINTING BASED ON IMPROVED GENERATIVE ADVERSARIAL NETWORK , Applied Computer Science: Vol. 19 No. 2 (2023)
- Mohanad ABDULHAMID, Deng PETER, REMOTE HEALTH MONITORING: FALL DETECTION , Applied Computer Science: Vol. 16 No. 1 (2020)
- Marcin BADUROWICZ, DETECTION OF SOURCE CODE IN INTERNET TEXTS USING AUTOMATICALLY GENERATED MACHINE LEARNING MODELS , Applied Computer Science: Vol. 18 No. 1 (2022)
- Nouhaila BOUALOULOU, Taoufiq BELHOUSSINE DRISSI, Benayad NSIRI, CNN AND LSTM FOR THE CLASSIFICATION OF PARKINSON'S DISEASE BASED ON THE GTCC AND MFCC , Applied Computer Science: Vol. 19 No. 2 (2023)
- Abdelrahman Halawa, Shehab Gamalel-Din, Abdurrahman Nasr, EXPLOITING BERT FOR MALFORMED SEGMENTATION DETECTION TO IMPROVE SCIENTIFIC WRITINGS , Applied Computer Science: Vol. 19 No. 2 (2023)
- Mahmoud BAKR, Sayed ABDEL-GABER, Mona NASR, Maryam HAZMAN, TOMATO DISEASE DETECTION MODEL BASED ON DENSENET AND TRANSFER LEARNING , Applied Computer Science: Vol. 18 No. 2 (2022)
- Baldemar ZURITA, Luís LUNA, José HERNÁNDEZ, Federico RAMÍREZ, BOVW FOR CLASSIFICATION IN GEOMETRICS SHAPES , Applied Computer Science: Vol. 14 No. 4 (2018)
- Katarzyna BARAN, APPLICATION OF THERMAL IMAGING CAMERAS FOR SMARTPHONE: SEEK THERMAL COMPACT PRO AND FLIR ONE PRO FOR HUMAN STRESS DETECTION – COMPARISON AND STUDY , Applied Computer Science: Vol. 20 No. 1 (2024)
- Alexis J. LOPEZ, Perfecto M. QUINTERO, Ana K. HERNANDEZ, ANALYTICS AND DATA SCIENCE APPLIED TO THE TRAJECTORY OUTLIER DETECTION , Applied Computer Science: Vol. 16 No. 2 (2020)
- Edyta ŁUKASIK, Wiktor FLIS, EFFICIENCY COMPARISON OF NETWORKS IN HANDWRITTEN LATIN CHARACTERS RECOGNITION WITH DIACRITICS , Applied Computer Science: Vol. 19 No. 4 (2023)
You may also start an advanced similarity search for this article.