Analysis of the impact of machine learning algorithms on the quality of generated sounds
Article Sidebar
Issue Vol. 38 (2026)
-
Comparative analysis of Next.js and Astro frameworks
Patryk Gieda, Marek Miłosz1-5
-
Behavioral analysis of ransomware threats to ESXi Hypervisors: a machine learning-based predictive model
Upakar Bhatta6-10
-
Performance and usability evaluation of a VR virtual museum application
Kamil Gabrysiewicz, Krzysztof Dziedzic11-18
-
Comparative analysis of selected data visualization methods
Damian Węcławski, Radosław Tomczyk, Paweł Powroźnik19-25
-
A performance comparison of web programming interfaces: GraphQL, gRPC and Thrift
Piotr Rożek, Mariusz Dzieńkowski26-31
-
A review of security mechanisms in electronic payment systems
Omniah ALibrahim, Suhair Alshehri32-42
-
Clustering methods in machine learning
Bartłomiej Głuszczak, Paweł Powroźnik43-50
-
Comparative analysis of interpretable artificial intelligence methods
Aleksandra Kuszewska, Małgorzata Charytanowicz51-58
-
Comparative analysis of machine learning classifiers
Łukasz Krukowski, Grzegorz Kozieł59-65
-
Analysis of the impact of machine learning algorithms on the quality of generated sounds
Krzysztof Pedrycz, Mateusz Pikula66-72
-
Comparative analysis of the functionalities of applications supporting the self-control process of anticoagulation therapy
Marcin Furmaga, Vitalii Baida73-80
-
Application of machine learning for predicting Formula 1 race results
Sylwia Krzysztoń, Jakub Smołka81-86
-
Analysis of latency reduction and performance improvement methods in selected VR applications
Mateusz Czapczyński, Krzysztof Dziedzic87-94
-
Integrating deep learning image analysis into Web GIS applications: A Hybrid Flask - Spring Boot architecture for automated place detection
Medjon HYSENAJ95-101
Main Article Content
DOI
Authors
Abstract
Music generation using broadly understood AI is an evolving field with many challenges and opportunities. This thesis explores the use of generative adversarial networks for this endeavour, focusing on and comparing variety of different solutions that are already developed. Various architectures were tested and evaluated, in order to find the most effective approach to generating music. The results demonstrate that, although there are many solutions that can generate music that is both coherent and creative, there is still place for improvement in terms of model stability and created music quality. This work contributes to the understanding of generative adversarial networks in music generation and provides a foundation for future research in this area.
Keywords:
References
[1] I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, Y. Bengio, Generative Adversarial Nets, Advances in Neural Information Processing Systems 27 (2014) 2672-2680, https://doi.org/10.48550/arXiv.1406.2661.
[2] H.-W. Dong, W.-Y. Hsiao, L.-C. Yang, Y.-H. Yang, MuseGAN: Multitrack sequential generative adversarial networks for symbolic music generation and accompaniment, arXiv preprint arXiv:1709.06298 (2017), https://doi.org/10.48550/arXiv.1709.06298. DOI: https://doi.org/10.1609/aaai.v32i1.11312
[3] M. Mirza, S. Osindero, Conditional Generative Adversarial Nets, arXiv preprint arXiv:1411.1784 (2014), https://doi.org/10.48550/arXiv.1411.1784.
[4] A. Radford, L. Metz, S. Chintala, Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks, arXiv preprint arXiv:1511.06434 (2016), https://doi.org/10.48550/arXiv.1511.06434.
[5] E. Denton, S. Chintala, A. Szlam, R. Fergus, Deep Generative Image Models Using a Laplacian Pyramid of Adversarial Networks, Advances in Neural Information Processing Systems 28 (2015) 1486–1494, https://doi.org/10.48550/arXiv.1506.05751.
[6] J.-Y. Zhu, T. Park, P. Isola, A. A. Efros, Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks, arXiv preprint arXiv:1703.10593 (2017), https://doi.org/10.48550/arXiv.1703.10593. DOI: https://doi.org/10.1109/ICCV.2017.244
[7] H. Zhang, I. Goodfellow, D. Metaxas, A. Odena, Self-Attention Generative Adversarial Networks, Proceedings of the 36th International Conference on Machine Learning (2019) 7354–7363, https://doi.org/10.48550/arXiv.1805.08318.
[8] N. Kodali, J. Hays, J. Abernethy, Z. Kira, On Convergence and Stability of GANs, arXiv preprint arXiv:1705.07215 (2017), https://doi.org/10.48550/arXiv.1705.07215.
[9] L. Barillaro, Deep Learning Platforms: Keras, PyTorch, Tensorflow, Reference Module in Life Sciences, Elsevier, 2024, B9780323955027001676, B9780323955027000932, B9780323955027000920, https://doi.org/10.1016/B978-0-323-95502-7.00092-0, https://doi.org/10.1016/B978-0-323-95502-7.00093-2, https://doi.org/10.1016/B978-0-323-95502-7.00167-6. DOI: https://doi.org/10.1016/B978-0-323-95502-7.00167-6
[10] M. Saito, E. Matsumoto, S. Saito, Temporal Generative Adversarial Nets with Singular Value Clipping, arXiv preprint arXiv:1611.06624 (2016), https://doi.org/10.48550/arXiv.1611.06624. DOI: https://doi.org/10.1109/ICCV.2017.308
[11] A. Schnepf, F. Vasile, U. Tanielian, 3DGEN: A GAN-Based Approach for Generating Novel 3D Models from Image Data, arXiv preprint arXiv:2312.08094 (2023), https://doi.org/10.48550/arXiv.2312.08094.
[12] C. Vondrick, H. Pirsiavash, A. Torralba, Generating Videos with Scene Dynamics, arXiv preprint arXiv:1609.02612 (2016), https://doi.org/10.48550/arXiv.1609.02612.
[13] H. Gao, X. Yang, Y. Hu, B. Wang, H. Xu, Z. Liang, H. Mu, Y. Wang, Y. Chen, GANs-generated Synthetic Datasets for Face Alignment Algorithms in Complex Environments, Applied Soft Computing 151 (2024) 112260, https://doi.org/10.1016/j.asoc.2024.112260. DOI: https://doi.org/10.1016/j.asoc.2024.112260
[14] D. Efimov, D. Xu, L. Kong, A. Nefedov, A. Anandakrishnan, Using Generative Adversarial Networks to Synthesize Artificial Financial Datasets, arXiv preprint arXiv:2002.02271 (2020), https://doi.org/10.48550/arXiv.2002.02271.
[15] J. Yu, Z. Lin, J. Yang, X. Shen, X. Lu, T. S. Huang, Generative Image Inpainting with Contextual Attention, arXiv preprint arXiv:1801.07892 (2018), https://doi.org/10.48550/arXiv.1801.07892. DOI: https://doi.org/10.1109/CVPR.2018.00577
[16] K. Schawinski, C. Zhang, H. Zhang, L. Fowler, G. K. Santhanam, Generative Adversarial Networks Recover Features in Astrophysical Images of Galaxies Beyond the Deconvolution Limit, Monthly Notices of the Royal Astronomical Society Letters 467(1) (2017) L110–L114, https://doi.org/10.1093/mnrasl/slx008. DOI: https://doi.org/10.1093/mnrasl/slx008
[17] X. Wang, K. Yu, S. Wu, J. Gu, Y. Liu, C. Dong, C. C. Loy, Y. Qiao, X. Tang, ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks, arXiv preprint arXiv:1809.00219 (2018), https://doi.org/10.48550/arXiv.1809.00219. DOI: https://doi.org/10.1007/978-3-030-11021-5_5
[18] R. V. Bisneto, A. O. Carvalho Filho, D. M. V. Magalhães, Generative Adversarial Network and Texture Features Applied to Automatic Glaucoma Detection, Applied Soft Computing 91 (2020) 106165, https://doi.org/10.1016/j.asoc.2020.106165. DOI: https://doi.org/10.1016/j.asoc.2020.106165
[19] Y. Mirsky, T. Mahler, I. Shelef, Y. Elovici, CT-GAN: Malicious Tampering of 3D Medical Imagery Using Deep Learning, arXiv preprint arXiv:1901.03597 (2019), https://doi.org/10.48550/arXiv.1901.03597.
[20] J. Rothstein, MIDI: A Comprehensive Introduction, A-R Editions, Madison, 1995.
[21] Multimedia Programming Interface and Data Specifications 1.0, TEX–basics, https://www.tactilemedia.com/info/MCI_Control_Info.html, [12.01.2025].
[22] L. Zheng, C. Li, Real-Time Emotion-Based Piano Music Generation Using Generative Adversarial Network (GAN), Proceedings of the IEEE (2024), https://ieeexplore.ieee.org/document/10557596. DOI: https://doi.org/10.1109/ACCESS.2024.3414673
[23] K. van den Broek, MP3net: Coherent, Minute-Long Music Generation from Raw Audio with a Simple Convolutional GAN, arXiv preprint arXiv:2101.04785 (2021), https://doi.org/10.48550/arXiv.2101.04785.
[24] M. Zheng, P. Bai, X. Shi, X. Zhou, Y. Yan, FT-GAN: Fine-Grained Tune Modeling for Chinese Opera Synthesis, Proceedings of the AAAI Conference on Artificial Intelligence 38(17) (2024) 29943, https://doi.org/10.1609/aaai.v38i17.29943. DOI: https://doi.org/10.1609/aaai.v38i17.29943
[25] H.-T. Hung, J. Ching, S. Doh, N. Kim, J. Nam, Y.-H. Yang, EMOPIA: A Multi-Modal Pop Piano Dataset for Emotion Recognition and Emotion-Based Music Generation, arXiv preprint arXiv:2108.01374 (2021), https://doi.org/10.48550/arXiv.2108.01374.
[26] F. Schneider, O. Kamal, Z. Jin, B. Schölkopf, Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion, arXiv preprint arXiv:2301.11757 (2023), https://doi.org/10.48550/arXiv.2301.11757.
[27] N. Trieu, R. Keller, JazzGAN: Improvising with Generative Adversarial Networks, Proceedings of the 21st International Conference on Principles and Practice of Constraint Programming (2018), https://www.semanticscholar.org/paper/JazzGAN-%3A-Improvising-with-Generative-Adversarial-Trieu-Keller/1ccaf621cf9a9d25ee4f81dd287af3b0fd2c7989.
[28] M. Wiatrak, S. V. Albrecht, Stabilizing Generative Adversarial Network Training: A Survey, arXiv preprint arXiv:1910.00927 (2019), https://doi.org/10.48550/arXiv.1910.00927.
[29] B. Yilmaz, R. Korn, A Comprehensive Guide to Generative Adversarial Networks (GANs) and Application to Individual Electricity Demand, Expert Systems with Applications 247 (2024) 123851, https://doi.org/10.1016/j.eswa.2024.123851. DOI: https://doi.org/10.1016/j.eswa.2024.123851
[30] Y. Yazici, C.-S. Foo, S. Winkler, K.-H. Yap, V. Chandrasekhar, Empirical Analysis of Overfitting and Mode Drop in GAN Training, arXiv preprint arXiv:2006.14265 (2020), https://doi.org/10.48550/arXiv.2006.14265. DOI: https://doi.org/10.1109/ICIP40778.2020.9191083
[31] L.-C. Yang, S.-Y. Chou, Y.-H. Yang, MidiNet: A Convolutional Generative Adversarial Network for Symbolic-Domain Music Generation, arXiv preprint arXiv:1703.10847 (2017), https://doi.org/10.48550/arXiv.1703.10847.
[32] A. van den Oord, S. Dieleman, H. Zen, K. Simonyan, O. Vinyals, A. Graves, N. Kalchbrenner, A. Senior, K. Kavukcuoglu, WaveNet: A Generative Model for Raw Audio, arXiv preprint arXiv:1609.03499 (2016), https://doi.org/10.48550/arXiv.1609.03499.
[33] L. Yu, W. Zhang, J. Wang, Y. Yu, SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient, Proceedings of the AAAI Conference on Artificial Intelligence 31(1) (2017) 2852-2858, https://doi.org/10.48550/arXiv.1609.05473. DOI: https://doi.org/10.1609/aaai.v31i1.10804
Article Details
Abstract views: 4

