An automatic speech recognition approach for controlled medications prescription with natural language processing

Luis Enrique COLMENARES-GUILLÉN; Angel Axel MÉNDEZ-MENESES

doi:10.35784/acs_8699

PDF

Published: Jun 30, 2026

DOI: https://doi.org/10.35784/acs_8699

Issue Vol. 22 No. 2 (2026)

Articles

Path planning in swarm robotics exploration using SARSA and ACO algorithms
Aicha HAFID, Riadh HOCINE, Lahcene GUEZOULI

1-15
Detection of suspicious facial objects in neutral ATMs using deep learning architectures based on YOLOV8 and Faster R-CNN
Marco Manuel ARAGON PAUCAR, Kelvin Yhonson FERNANDEZ ACERO, Erasmo SULLA ESPINOZA

16-32
Assessing the effectiveness of one-stage and two-stage methods for identifying high-voltage power grid equipment in UAV imagery
Thi Thanh Tan NGUYEN, Thi Thu Nga VU

33-47
An automatic speech recognition approach for controlled medications prescription with natural language processing
Luis Enrique COLMENARES-GUILLÉN, Angel Axel MÉNDEZ-MENESES

48-66
Improving image retrieval using CNN with PCA and Optimized K-Means clustering
Mohsin Hasan HUSSEIN, Ali Mohsin Ahmed AL-SABAAWI, Zakaria A. Hamed ALNAISH

67-84
Numerical investigation into the hydrodynamic characteristics of water vortex turbines with varied blade angles
Sarwo EDHY SOFYAN, Zamzami, Akhyar AKHYAR, Suriadi, Agus SASMITO

85-104
Optimization of the corporate cluster structure using the Tabu Search method
Andrzej IMIEŁOWSKI, Łukasz BANAŚ, Bogusław TWARÓG, Janusz BYTNAR

105-116
Application controls audit framework in the context of ERP systems
Sakchai TANGPRASERT, Nalinpat BHUMPENPEIN

117-125
Autonomous AI agents in digital markets: Economic implications for competition, pricing, and regulation
Elmira KYDYRBAYEVA, Balhiya SHOMSHEKOVA, Asset ABZHAKOV, Ainur ASHIMOVA, Assel NURTAYEVA

126-137
Multi-criteria analysis of parameter impact in large-scale robotic 3D printing
Łukasz SOBASZEK, Ivan GAJDOŠ, Pavol ŠTEFČÁK

138-147
Designing cloud-based knowledge management systems to improve organizational innovation
Hayfaa Subhi MALALLAH, Sherzad Mohammad AJEEL

148-168
Data normalisation methods on microarray data
Inggih PERMANA, Shir Li WANG, Hoi Yeh LEE, Suliana SULAIMAN, Hasnatul Nazuha HASSAN

169-179
Log-based learning analytics of gamified Moodle activities: Quantifying student engagement
Iva GRUBJEŠIĆ, Tomislav IVANJKO, Vedran JURIČIĆ

180-192
SFAB-Net: Semantic segmentation network for railway track surface defects based on Spatial Fusion and Adaptive Bottleneck feature enhancement
Qike WU, Sharafiz ABDUL RAHIM, Sai Hong TANG, Muhammad Azim AZIZI, Li ZHANG

193-207
Machine learning approach to detect GAI-disguised academic programming plagiarism
Oscar KARNALIM, Yehezkiel David SETIAWAN, Maresha Caroline WIJANTO, Rossevine Artha NATHASYA

208-224

Authors

Luis Enrique COLMENARES-GUILLÉN

enrique.colmenares@correo.buap.mx

Benemérita Universidad Autónoma de Puebla, Mexico

https://orcid.org/0000-0002-9921-8813

Angel Axel MÉNDEZ-MENESES

angel.mendezmen@alumno.buap.mx

Benemérita Universidad Autónoma de Puebla, Mexico

https://orcid.org/0000-0001-2345-6789

Abstract

The prescription and documentation of controlled medications require strict regulatory compliance and high transcription accuracy to prevent medication errors and ensure traceability. In many hospitals, these processes are still performed manually, increasing the risk of transcription errors, administrative delays, and non-compliance with regulatory standards, particularly for medications classified under fractions II and III of the Mexican General Health Law. Addressing this challenge requires intelligent systems capable of accurately transcribing and structuring medical prescriptions from spoken language. This study presents the design and development of an Automatic Speech Recognition (ASR) system integrated with Natural Language Processing (NLP) to support the generation and transcription of controlled medication prescriptions. The system architecture was developed following an analysis of the clinical workflow for medication requests, management, prescription, and transcription, conducted in collaboration with healthcare professionals from the hospital's Pharmacovigilance Department in Puebla, Mexico, and aligned with hospital operational standards. The methodology involved evaluating and fine-tuning three ASR models to improve transcription accuracy for medication names, dosages, and prescription instructions. NLP techniques were subsequently applied to identify and structure key prescription entities, ensuring compliance with national health regulations. Among the evaluated models, the Wav2Vec2 architecture developed by Jonatas Grosman demonstrated the best performance and was selected for implementation. Experimental results show that the optimized ASR model achieved a Word Error Rate (WER) of 6.30%, a precision of 94.72%, a recall of 91.73%, and an F1-score of 93.22%. These results demonstrate the effectiveness of the proposed approach in improving transcription accuracy while reducing false positives in prescription generation. The proposed system highlights the potential of ASR–NLP integration to enhance efficiency, accuracy, and regulatory compliance in hospital pharmacovigilance processes.

Keywords:

speech recognition, controlled medications, healthcare professionals, pharmacovigilance

Sustainable Development Goals (SDG)

3 - Good health and well-being

References

Ayuzo del Valle, N., González Camid, E., Villegas Macedo, F., Flores Osorio, J., & Bosques Padilla, F. (2021). Impacto del Servicio de Farmacia en la disminución de errores en la medicación en pediatría. Revista de la OFIL, 31(2), 161–165.

Baevski, A., Zhou, H., Mohamed, A., & Auli, M. (2020). wav2vec 2.0: A framework for self-supervised learning of speech representations. ArXiv, abs/2006.11477. https://doi.org/10.48550/arXiv.2006.11477

Báez, P., Arancibia, A. P., Chaparro, M. I., Bucarey, T., Núñez, F., & Dunstan, J. (2022). Natural language processing for clinical text in Spanish: The case of waiting lists in Chile. Revista Medica Clinica Las Condes, 33(6), 576–582. https://doi.org/10.1016/j.rmclc.2022.10.002

Barranco Castañeda, G., Oropeza Cornejo, R., & Posada Galarza, M. E. (2020). Seguridad del paciente y uso de medicamentos, perspectiva del profesional farmacéutico en México enfocado en el macroproceso de la medicación. Latin American Journal of Clinical Sciences and Medical Technology.

Bates, D. W., Leape, L. L., Cullen, D. J., Laird, N., Petersen, L. A., Teich, J. M., Burdick, E., Hickey, M., Kleefield, S., Shea, B., Vander Vliet, M., & Seger, D. L. (1998). Effect of computerized physician order entry and a team intervention on prevention of serious medication errors. JAMA, 280(15), 1311–1316. https://doi.org/10.1001/jama.280.15.1311

Boulkroune, A., Boubellouta, A., Bouzeriba, A., & Zouari, F. (2025). Practical finite-time fuzzy synchronization of chaotic systems with non-integer orders: Two chattering-free approaches. Journal of Systems Science and Systems Engineering, 34(3), 334–359. https://doi.org/10.1007/s11518-024-5635-7

Boulkroune, A., Hamel, S., Zouari, F., Boukabou, A., & Ibeas, A. (2017). Output-feedback controller based projective lag-synchronization of uncertain chaotic systems in the presence of input nonlinearities. Mathematical Problems in Engineering, 2017, Article 8045803. https://doi.org/10.1155/2017/8045803

Cámara de Diputados del H. Congreso de la Unión. (2024). Ley general de salud.

Carchiolo, V., Longheu, A., Reitano, G., & Zagarella, L. (2019). Medical prescription classification: A NLP-based approach. In Proceedings of the 2019 Federated Conference on Computer Science and Information Systems (FedCSIS) (pp. 605–609). IEEE. https://doi.org/10.15439/2019F197

Chala, A. I. i Rebellón-Martínez, I. (2024). Evaluación de la experiencia de telemedicina en consulta de Cirugía de cabeza y cuello en un centro de referencia en Manizales. Revista Colombiana de Cirugía, 39, 386–395. https://doi.org/10.30944/20117582.2498

Comisión Federal para la Protección contra Riesgos Sanitarios. (2018). Guía para comercialización de medicamentos controlados en farmacias. Gobierno de México.

Consejo de Salubridad General. (2023). Modelo único de evaluación de la calidad: Anexo B. Criterios y estándares para hospitales. Dirección General de Calidad y Educación en Salud. https://www.ssaver.gob.mx/ccs/wp-content/uploads/sites/35/2024/01/Anexo_B._Criterios_y_Estxndares_Hospitales._V.20-07-2023.pdf

Fernández-Tapia, J. (2021). Avances y limitaciones en las políticas públicas de e-Salud en México. ComHumanitas: Revista Científica de Comunicación, 12(1), 152–178. https://doi.org/10.31207/rch.v12i1.303

Hodkinson, A., Tyler, N., Ashcroft, D. M., Keers, R. N., Khan, K., Phipps, D., Abuzour, A., Bower, P., Avery, A., Campbell, S., & Panagioti, M. (2020). Preventable medication harm across health care settings: A systematic review and meta-analysis. BMC Medicine, 18(1), Article 313. https://doi.org/10.1186/s12916-020-01774-9

Jeilani, A., & Hussein, A. (2025). Impact of digital health technologies adoption on healthcare workers’ performance and workload: Perspective with DOI and TOE models. BMC Health Services Research, 25(1), Article 142. https://doi.org/10.1186/s12913-025-12414-4

Junta Internacional de Fiscalización de Estupefacientes. (2023). Lista de sustancias sicotrópicas sometidas a fiscalización internacional.

Jurafsky, D., & Martin, J. H. (2026). Speech and language processing: An introduction to natural language processing, computational linguistics, and speech recognition with language models (3rd ed. draft).

Martínez-Ruiz, M. G., Corona-Ruiz, F., Solís-Rivera, A. P., Sifuentes-Franco, S., Sánchez-López, V. A., Guevara-Martínez, S. J., & Huerta-Olvera, S. G. (2023). Potentially inappropriate prescriptions in geriatric patients hospitalized in the internal medicine department of a referral hospital in Mexico. Gaceta Medica de Mexico, 159(2), 150–156. https://doi.org/10.24875/GMM.22000376

Masoumi, S., Amirkhani, H., Sadeghian, N., & Shahraz, S. (2024). Natural language processing (NLP) to facilitate abstract review in medical research: The application of BioBERT to exploring the 20-year use of NLP in medical research. Systematic Reviews, 13(1), Article 13. https://doi.org/10.1186/s13643-024-02470-y

Mejía Vázquez, R., Delgado Cruz, F., Salgado Schoelly, H., & Kai Forzán, J. (2018). Manejo farmacológico de las complicaciones crónicas de la diabetes mellitus (DM). Secretaría de Salud de la Ciudad de México.

Moscoso Paredes, A. J., & Titto Beltran, O. M. (2015). Problemática de las drogas: Orientaciones generales.

Navarro, E. M., Ramos Álvarez, A. N., & Soler Anguiano, F. I. (2022). A new telesurgery generation supported by 5G technology: Benefits and future trends. Procedia Computer Science, 200, 31–38. https://doi.org/10.1016/j.procs.2022.01.202

Qiao, H., Chen, Y., Qian, C., & Guo, Y. (2024). Clinical data mining: Challenges, opportunities, and recommendations for translational applications. Journal of Translational Medicine, 22(1), Article 50. https://doi.org/10.1186/s12967-024-05005-0

Radford, A., Kim, J. W., Xu, T., Brockman, G., McLeavey, C., & Sutskever, I. (2022). Robust speech recognition via large-scale weak supervision. arXiv. https://doi.org/10.48550/arXiv.2212.04356

Rai, V., & Singh, S. (2024). A review paper on pharmacovigilance: An overview. ResearchGate.

Rey-Pineda, E., & Estrada-Hernández, L. O. (2014). Errores de medicación en pacientes del Hospital Regional Lic. Adolfo López Mateos del ISSSTE.

Villena, F., & Dunstan, J. (2019). Obtención automática de palabras clave en textos clínicos: Una aplicación de procesamiento del lenguaje natural a datos masivos de sospecha diagnóstica en Chile. Revista Médica de Chile, 147(10).

World Health Organization. (2022). La OMS pide a los países que actúen urgentemente para lograr la medicación sin daño. https://www.who.int/es/news/item/16-09-2022-who-calls-for-urgent-action-by-countries-for-achieving-medication-without-harm

World Health Organization. (2023). Seguridad del paciente. https://www.who.int/es/news-room/events/detail/2023/09/17/default-calendar/world-patient-safety-day-2023--engaging-patients-for-patient-safety

Yang, R., Zeng, Q., You, K., Qiao, Y., Huang, L., Hsieh, C. C., Rosand, B., Goldwasser, J., Dave, A., Keenan, T., Ke, Y., Hong, C., Liu, N., Chew, E., Radev, D., Lu, Z., Xu, H., Chen, Q., & Li, I. (2024). Ascle—A Python natural language processing toolkit for medical text generation: Development and evaluation study. Journal of Medical Internet Research, 26, Article e60601. https://doi.org/10.2196/60601

COLMENARES-GUILLÉN, L. E., & MÉNDEZ-MENESES, A. A. (2026). An automatic speech recognition approach for controlled medications prescription with natural language processing. Applied Computer Science, 22(2), 48–66. https://doi.org/10.35784/acs_8699

An automatic speech recognition approach for controlled medications prescription with natural language processing

Issue Vol. 22 No. 2 (2026)

Archives

Authors

Abstract

Keywords:

Sustainable Development Goals (SDG)

References

License

Article Sidebar

Issue Vol. 22 No. 2 (2026)

Archives

Main Article Content

Authors

Abstract

Keywords:

Sustainable Development Goals (SDG)

References

Article Details

License