An automatic speech recognition approach for controlled medications prescription with natural language processing
Article Sidebar
Issue Vol. 22 No. 2 (2026)
-
Path planning in swarm robotics exploration using SARSA and ACO algorithms
Aicha HAFID, Riadh HOCINE, Lahcene GUEZOULI1-15
-
Detection of suspicious facial objects in neutral ATMs using deep learning architectures based on YOLOV8 and Faster R-CNN
Marco Manuel ARAGON PAUCAR, Kelvin Yhonson FERNANDEZ ACERO, Erasmo SULLA ESPINOZA16-32
-
Assessing the effectiveness of one-stage and two-stage methods for identifying high-voltage power grid equipment in UAV imagery
Thi Thanh Tan NGUYEN, Thi Thu Nga VU33-47
-
An automatic speech recognition approach for controlled medications prescription with natural language processing
Luis Enrique COLMENARES-GUILLÉN, Angel Axel MÉNDEZ-MENESES48-66
-
Improving image retrieval using CNN with PCA and Optimized K-Means clustering
Mohsin Hasan HUSSEIN, Ali Mohsin Ahmed AL-SABAAWI, Zakaria A. Hamed ALNAISH67-84
-
Numerical investigation into the hydrodynamic characteristics of water vortex turbines with varied blade angles
Sarwo EDHY SOFYAN, Zamzami, Akhyar AKHYAR, Suriadi, Agus SASMITO85-104
-
Optimization of the corporate cluster structure using the Tabu Search method
Andrzej IMIEŁOWSKI, Łukasz BANAŚ, Bogusław TWARÓG, Janusz BYTNAR105-116
-
Application controls audit framework in the context of ERP systems
Sakchai TANGPRASERT, Nalinpat BHUMPENPEIN117-125
-
Autonomous AI agents in digital markets: Economic implications for competition, pricing, and regulation
Elmira KYDYRBAYEVA, Balhiya SHOMSHEKOVA, Asset ABZHAKOV, Ainur ASHIMOVA, Assel NURTAYEVA126-137
-
Multi-criteria analysis of parameter impact in large-scale robotic 3D printing
Łukasz SOBASZEK, Ivan GAJDOŠ, Pavol ŠTEFČÁK138-147
-
Designing cloud-based knowledge management systems to improve organizational innovation
Hayfaa Subhi MALALLAH, Sherzad Mohammad AJEEL148-168
-
Data normalisation methods on microarray data
Inggih PERNAMA, Shir Li WANG, Hoi Yeh LEE, Suliana SULAIMAN, Hasnatul Nazuha HASSAN169-179
-
Log-based learning analytics of gamified Moodle activities: Quantifying student engagement
Iva GRUBJEŠIĆ, Tomislav IVANJKO, Vedran JURIČIĆ180-192
-
SFAB-Net: Semantic segmentation network for railway track surface defects based on Spatial Fusion and Adaptive Bottleneck feature enhancement
Qike WU, Sharafiz ABDUL RAHIM, Sai Hong TANG, Muhammad Azim AZIZI, Li ZHANG193-207
-
Machine learning approach to detect GAI-disguised academic programming plagiarism
Oscar KARNALIM, Yehezkiel David SETIAWAN, Maresha Caroline WIJANTO, Rossevine Artha NATHASYA208-224
Archives
-
Vol. 22 No. 2
2026-06-30 15
-
Vol. 22 No. 1
2026-03-31 15
-
Vol. 21 No. 4
2025-12-31 12
-
Vol. 21 No. 3
2025-09-30 12
-
Vol. 21 No. 2
2025-06-30 12
-
Vol. 21 No. 1
2025-03-31 12
-
Vol. 20 No. 4
2024-12-31 12
-
Vol. 20 No. 3
2024-09-30 12
-
Vol. 20 No. 2
2024-06-30 12
-
Vol. 20 No. 1
2024-03-30 12
-
Vol. 19 No. 4
2023-12-31 10
-
Vol. 19 No. 3
2023-09-30 10
-
Vol. 19 No. 2
2023-06-30 10
-
Vol. 19 No. 1
2023-03-31 10
-
Vol. 18 No. 4
2022-12-30 8
-
Vol. 18 No. 3
2022-09-30 8
-
Vol. 18 No. 2
2022-06-30 8
-
Vol. 18 No. 1
2022-03-31 8
Main Article Content
Authors
enrique.colmenares@correo.buap.mx
angel.mendezmen@alumno.buap.mx
Abstract
The prescription and documentation of controlled medications require strict regulatory compliance and high transcription accuracy to prevent medication errors and ensure traceability. In many hospitals, these processes are still performed manually, increasing the risk of transcription errors, administrative delays, and non-compliance with regulatory standards, particularly for medications classified under fractions II and III of the Mexican General Health Law. Addressing this challenge requires intelligent systems capable of accurately transcribing and structuring medical prescriptions from spoken language. This study presents the design and development of an Automatic Speech Recognition (ASR) system integrated with Natural Language Processing (NLP) to support the generation and transcription of controlled medication prescriptions. The system architecture was developed following an analysis of the clinical workflow for medication requests, management, prescription, and transcription, conducted in collaboration with healthcare professionals from the hospital's Pharmacovigilance Department in Puebla, Mexico, and aligned with hospital operational standards. The methodology involved evaluating and fine-tuning three ASR models to improve transcription accuracy for medication names, dosages, and prescription instructions. NLP techniques were subsequently applied to identify and structure key prescription entities, ensuring compliance with national health regulations. Among the evaluated models, the Wav2Vec2 architecture developed by Jonatas Grosman demonstrated the best performance and was selected for implementation. Experimental results show that the optimized ASR model achieved a Word Error Rate (WER) of 6.30%, a precision of 94.72%, a recall of 91.73%, and an F1-score of 93.22%. These results demonstrate the effectiveness of the proposed approach in improving transcription accuracy while reducing false positives in prescription generation. The proposed system highlights the potential of ASR–NLP integration to enhance efficiency, accuracy, and regulatory compliance in hospital pharmacovigilance processes.
Keywords:
Sustainable Development Goals (SDG)
- 3 - Good health and well-being
References
Ayuzo del Valle, N., González Camid, E., Villegas Macedo, F., Flores Osorio, J., & Bosques Padilla, F. (2021). Impacto del Servicio de Farmacia en la disminución de errores en la medicación en pediatría. Revista de la OFIL, 31(2), 161–165.
Baevski, A., Zhou, H., Mohamed, A., & Auli, M. (2020). wav2vec 2.0: A framework for self-supervised learning of speech representations. ArXiv, abs/2006.11477. https://doi.org/10.48550/arXiv.2006.11477
Báez, P., Arancibia, A. P., Chaparro, M. I., Bucarey, T., Núñez, F., & Dunstan, J. (2022). Natural language processing for clinical text in Spanish: The case of waiting lists in Chile. Revista Medica Clinica Las Condes, 33(6), 576–582. https://doi.org/10.1016/j.rmclc.2022.10.002
Barranco Castañeda, G., Oropeza Cornejo, R., & Posada Galarza, M. E. (2020). Seguridad del paciente y uso de medicamentos, perspectiva del profesional farmacéutico en México enfocado en el macroproceso de la medicación. Latin American Journal of Clinical Sciences and Medical Technology.
Bates, D. W., Leape, L. L., Cullen, D. J., Laird, N., Petersen, L. A., Teich, J. M., Burdick, E., Hickey, M., Kleefield, S., Shea, B., Vander Vliet, M., & Seger, D. L. (1998). Effect of computerized physician order entry and a team intervention on prevention of serious medication errors. JAMA, 280(15), 1311–1316. https://doi.org/10.1001/jama.280.15.1311
Boulkroune, A., Boubellouta, A., Bouzeriba, A., & Zouari, F. (2025). Practical finite-time fuzzy synchronization of chaotic systems with non-integer orders: Two chattering-free approaches. Journal of Systems Science and Systems Engineering, 34(3), 334–359. https://doi.org/10.1007/s11518-024-5635-7
Boulkroune, A., Hamel, S., Zouari, F., Boukabou, A., & Ibeas, A. (2017). Output-feedback controller based projective lag-synchronization of uncertain chaotic systems in the presence of input nonlinearities. Mathematical Problems in Engineering, 2017, Article 8045803. https://doi.org/10.1155/2017/8045803
Cámara de Diputados del H. Congreso de la Unión. (2024). Ley general de salud.
Carchiolo, V., Longheu, A., Reitano, G., & Zagarella, L. (2019). Medical prescription classification: A NLP-based approach. In Proceedings of the 2019 Federated Conference on Computer Science and Information Systems (FedCSIS) (pp. 605–609). IEEE. https://doi.org/10.15439/2019F197
Chala, A. I. i Rebellón-Martínez, I. (2024). Evaluación de la experiencia de telemedicina en consulta de Cirugía de cabeza y cuello en un centro de referencia en Manizales. Revista Colombiana de Cirugía, 39, 386–395. https://doi.org/10.30944/20117582.2498
Comisión Federal para la Protección contra Riesgos Sanitarios. (2018). Guía para comercialización de medicamentos controlados en farmacias. Gobierno de México.
Consejo de Salubridad General. (2023). Modelo único de evaluación de la calidad: Anexo B. Criterios y estándares para hospitales. Dirección General de Calidad y Educación en Salud. https://www.ssaver.gob.mx/ccs/wp-content/uploads/sites/35/2024/01/Anexo_B._Criterios_y_Estxndares_Hospitales._V.20-07-2023.pdf
Fernández-Tapia, J. (2021). Avances y limitaciones en las políticas públicas de e-Salud en México. ComHumanitas: Revista Científica de Comunicación, 12(1), 152–178. https://doi.org/10.31207/rch.v12i1.303
Hodkinson, A., Tyler, N., Ashcroft, D. M., Keers, R. N., Khan, K., Phipps, D., Abuzour, A., Bower, P., Avery, A., Campbell, S., & Panagioti, M. (2020). Preventable medication harm across health care settings: A systematic review and meta-analysis. BMC Medicine, 18(1), Article 313. https://doi.org/10.1186/s12916-020-01774-9
Jeilani, A., & Hussein, A. (2025). Impact of digital health technologies adoption on healthcare workers’ performance and workload: Perspective with DOI and TOE models. BMC Health Services Research, 25(1), Article 142. https://doi.org/10.1186/s12913-025-12414-4
Junta Internacional de Fiscalización de Estupefacientes. (2023). Lista de sustancias sicotrópicas sometidas a fiscalización internacional.
Jurafsky, D., & Martin, J. H. (2026). Speech and language processing: An introduction to natural language processing, computational linguistics, and speech recognition with language models (3rd ed. draft).
Martínez-Ruiz, M. G., Corona-Ruiz, F., Solís-Rivera, A. P., Sifuentes-Franco, S., Sánchez-López, V. A., Guevara-Martínez, S. J., & Huerta-Olvera, S. G. (2023). Potentially inappropriate prescriptions in geriatric patients hospitalized in the internal medicine department of a referral hospital in Mexico. Gaceta Medica de Mexico, 159(2), 150–156. https://doi.org/10.24875/GMM.22000376
Masoumi, S., Amirkhani, H., Sadeghian, N., & Shahraz, S. (2024). Natural language processing (NLP) to facilitate abstract review in medical research: The application of BioBERT to exploring the 20-year use of NLP in medical research. Systematic Reviews, 13(1), Article 13. https://doi.org/10.1186/s13643-024-02470-y
Mejía Vázquez, R., Delgado Cruz, F., Salgado Schoelly, H., & Kai Forzán, J. (2018). Manejo farmacológico de las complicaciones crónicas de la diabetes mellitus (DM). Secretaría de Salud de la Ciudad de México.
Moscoso Paredes, A. J., & Titto Beltran, O. M. (2015). Problemática de las drogas: Orientaciones generales.
Navarro, E. M., Ramos Álvarez, A. N., & Soler Anguiano, F. I. (2022). A new telesurgery generation supported by 5G technology: Benefits and future trends. Procedia Computer Science, 200, 31–38. https://doi.org/10.1016/j.procs.2022.01.202
Qiao, H., Chen, Y., Qian, C., & Guo, Y. (2024). Clinical data mining: Challenges, opportunities, and recommendations for translational applications. Journal of Translational Medicine, 22(1), Article 50. https://doi.org/10.1186/s12967-024-05005-0
Radford, A., Kim, J. W., Xu, T., Brockman, G., McLeavey, C., & Sutskever, I. (2022). Robust speech recognition via large-scale weak supervision. arXiv. https://doi.org/10.48550/arXiv.2212.04356
Rai, V., & Singh, S. (2024). A review paper on pharmacovigilance: An overview. ResearchGate.
Rey-Pineda, E., & Estrada-Hernández, L. O. (2014). Errores de medicación en pacientes del Hospital Regional Lic. Adolfo López Mateos del ISSSTE.
Villena, F., & Dunstan, J. (2019). Obtención automática de palabras clave en textos clínicos: Una aplicación de procesamiento del lenguaje natural a datos masivos de sospecha diagnóstica en Chile. Revista Médica de Chile, 147(10).
World Health Organization. (2022). La OMS pide a los países que actúen urgentemente para lograr la medicación sin daño. https://www.who.int/es/news/item/16-09-2022-who-calls-for-urgent-action-by-countries-for-achieving-medication-without-harm
World Health Organization. (2023). Seguridad del paciente. https://www.who.int/es/news-room/events/detail/2023/09/17/default-calendar/world-patient-safety-day-2023--engaging-patients-for-patient-safety
Yang, R., Zeng, Q., You, K., Qiao, Y., Huang, L., Hsieh, C. C., Rosand, B., Goldwasser, J., Dave, A., Keenan, T., Ke, Y., Hong, C., Liu, N., Chew, E., Radev, D., Lu, Z., Xu, H., Chen, Q., & Li, I. (2024). Ascle—A Python natural language processing toolkit for medical text generation: Development and evaluation study. Journal of Medical Internet Research, 26, Article e60601. https://doi.org/10.2196/60601
Article Details
Abstract views: 6
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
All articles published in Applied Computer Science are open-access and distributed under the terms of the Creative Commons Attribution 4.0 International License.
