Acceso abierto

Research on Teaching Innovation Strategies in Music Theory Courses in Colleges and Universities under Digital Technology Support

  
19 mar 2025

Cite
Descargar portada

Introduction

Music has a pivotal position in the aesthetic education in school, it has its unique functions, such as the melodic beauty of the music, the language of the lyrics, the beauty of the mood, etc. to cultivate people’s sentiments, purify people’s minds, so as to form the beauty of the behavior in practice [1-3]. Traditional music education activities only stay in the role of students’ sense of hearing, while ignoring the students’ way of thinking is actually based on concrete image thinking and gradually transformed into abstract thinking. The music image is not easy to directly perceive the abstract image, so to the music education activities in colleges and universities has increased the difficulty, affecting the quality of music education [4-7].

With the popularization of modernization of education, information technology will be more widely used in music teaching, and its advanced technology, superiority, intelligence will be fully reflected. Information technology with its unique audio-visual association of technology, with sound and image as a whole, graphic and text, realistic image, large amount of information and rich resources [8-11], in the classroom teaching, you can get a strong interaction between teachers and students, immediate response, colorful teaching content and lively teaching forms and other teaching effects [12-13]. In addition, due to the organic integration of information technology and music teaching, it breaks through the restrictions of traditional music teaching in time, space and geography, and provides a most direct and fastest way for students to understand the rich cultural heritage behind the music, which is more conducive to the students’ learning and understanding of music [14-17].

Literature [18] examined the innovation of music courses in colleges and universities in the context of digital multimedia VR technology. Taking questionnaires, interviews and considering the implemented digital multimedia VR art course as the object of the study, the results point out that a large percentage of students are positive about this technology, while teachers only occasionally apply it in their teaching. Literature [19] describes the benefits of the use of innovative information technology in teaching and learning that can enhance learning and improve the organization of teaching activities, while information technology in the music classroom is manifested in the form of music players, music training programs, etc., and emphasizes that the introduction of innovative information technology in the system of music education expands the opportunities for learning. Literature [20] points out that the ways to improve the quality of music education in an intelligent educational environment include optimizing the teaching process and restructuring the form and content of music education. It reveals the shortcomings of music education and proposes an innovative model for teaching the integration of information technology and music education. Literature [21] aims to assess the effectiveness of digital technology in music education and its impact on teaching quality. Using statistics, modeling, and other aspects of the study, the results show that the integration of music education and modern digital technology teaching can be conducive to improving teaching effectiveness, stimulating students’ interest in learning and music resources preservation. Literature [22] conducted a study on music information technology with the help of the TPACK integration framework and PBL approach. The results emphasized the important role of this framework in teaching and indicated that music teachers should have the ability to use technology proficiently to enhance teaching and learning, in addition to mastering mathematical content and knowledge. Literature [23] explored intelligent instructional design with artificial intelligence technology. A scientific intelligent music instructional design model was created using big data, artificial intelligence and other information technologies, which is conducive to guiding teachers to carry out intelligent teaching, improving students’ independent learning, and promoting the overall intelligent transformation of teaching, which plays an important role in cultivating high-quality musical talents.

Literature [24] combines machine learning technology with IoT audio technology, aiming to improve the music audio recognition algorithm. It also created the layout and software system architecture of the smart music classroom, and the experimental results specified that IoT audio technology has an important role in music teaching. Literature [25] aims to study the application of smart technology in teaching music and aesthetics. A comparative experiment conducted with students concluded that the application of smart technologies can increase students’ motivation. Literature [26] examined the impact of digital technology on the motivation and professional skill development of Chinese students in learning music, and the findings revealed that the majority of Chinese students are still using the traditional way of teaching music, which suggests that there is a need to develop a training program in digital technology to improve students’ motivation and professional skills. Literature [27] emphasized the reform and innovation of vocal music teaching in the new era. It is pointed out that improving the teaching quality and level of vocal music education requires a reconstruction of its teaching model to adapt to the digital era. The significance of the reform and innovation of vocal music teaching in colleges and universities and the corresponding strategies are analyzed. Literature [28] explores the transformation of music education and teachers’ thinking in the context of technological development. A survey of eight teachers concluded that experienced teachers have re-shifted their teaching strategies in order to transform their views on teaching music technology, while younger teachers are more likely to embrace and utilize technology in music teaching in order to improve teaching effectiveness. Literature [29] reveals the relationship between big data and music teaching, identifies the problems of music teaching in the context of big data and its reform. It also describes the path of diversified design of music education reform, which provides reference for the research on the development and evaluation of music teaching mode. Literature [30] outlines the use of interactive whiteboards, apps, and multimedia materials in the music classroom and analyzes the advantages based on actual cases of various methods. The importance of integrating music teaching and innovative technologies in improving learning outcomes is emphasized.

This paper integrates digital technology and music theory courses and proposes a music teaching model based on the computer music software Cubase12. Using the audio recognition technology in the Cubase12 music software plug-in, the time domain and frequency domain features of the audio are analyzed so as to judge the accuracy of the player’s performance and help teachers adjust their strategies instantly. Comparative analysis of the use of computer music software Cubase12 for teaching and traditional teaching methods. Organize and summarize the results of the two teaching methods in terms of performance, pitch, time domain, frequency, and other factors. Through empirical research, the specific application and effect of music software Cubase12 in these areas are explored.

Pedagogical innovations based on music software
Basics of the music software Cubase12

In the learning process of music majors in higher vocational colleges and universities, music theory course is their compulsory course, which is the foundation and the key to learning, providing strong support for students when they engage in music-related activities, and is very meaningful to ensure the smooth implementation of students’ learning activities and promote the improvement of students’ learning quality and efficiency. The music theory course should be committed to enabling students to master the basic elements of music, common structures and genres, including pitch, intervals, chords, rhythm, melody, etc. Students need to understand the important role of these basic elements in music, and be able to skillfully use them to analyze and create music. Through an in-depth study of music theory, students will be able to understand and analyze different styles of music, as well as improve their performance and compositional skills in practice. Through the study of composition and harmony, students will be able to better understand the construction of musical works and thus better express the music when performing and composing. For example, students should familiarize themselves with the composition and naming of scales and levels, and understand the use of basic scales and levels such as major, minor, and pentatonic scales. At the same time, students also need to master the composition and naming of rhythms and beats, and understand the use of different rhythmic patterns such as duple, triple, and quadruple beats.

Computer music is a product of the perfect fusion of digital technology and musical art, which has caused great changes in the history of music, especially in the field of music composition, promoting unprecedented development. The combination of software and hardware and the emergence of new things, even for the natural world, which cannot naturally produce sounds, have unlimited creative possibilities. Since ancient times, the development of science and technology has been accompanied by the development of music and art, such as the melting and manufacturing technology of bronzes created the birth of chimes, and the maturity of architectural technology realized the integration of musical instruments into the building of large pipe organs, etc., all of which are the progress and integration of science and technology with music and art, and the same is true for computer music. The development of computer music is shown in Figure 1: science and technology will continue to develop with the development of the times, music education will be the same, science and technology to bring the convenience of many aspects, we need to be based on their own actual situation to the reasonable application and development, in order to improve their own at the same time keep pace with the times.

Figure 1.

Computer music development process

Innovative models of teaching and learning

In music teaching, the traditional teaching mode is mostly narrated by the teacher, and students passively accept the knowledge, which can easily cause students’ interest in learning to decrease. Multimedia teaching involves using the computer as a platform, with the help of multimedia technology, to facilitate interactive communication between teachers, students, and computers, which is conducive to improving classroom efficiency. Through multimedia courseware, a good learning environment can be created that stimulates students’ interest and enthusiasm in learning music, resulting in better teaching results. The emergence of this model has given new life to traditional music teaching. In traditional teaching, teachers need to spend a lot of time and energy to prepare lessons, and in order to make students more familiar with the music, teachers need to spend a long time to analyze and study the teaching materials, which will waste a lot of time. Students cannot participate effectively in learning music, and they can only accept the knowledge passively. Nowadays, teaching through computerized music software can make it easier for teachers to prepare lessons and students can participate actively. After the new curriculum reform, the music curriculum also has a new content. Teachers can use computer music software when teaching music, so that students can have a richer experience in learning. Music software is also very helpful to music teaching. For example, when teaching music, teachers can use computer music software to let students experience the different sounds made by different instruments, so that they can better understand the different sounds made by different instruments.

Audio Recognizer Plug-in Platform

Audio Recognizer plug-in platform, also known as the sound library, belongs to one of the plug-ins in the host software, because the host software is equivalent to just an “empty shell”, which provides a creative platform, and many of the tools in the “empty shell” does not, so it is necessary to add other creators want to add material to the platform for creation. Cubase12 itself contains the HALionSonic library of sound source materials, which are both numerous and powerful.

Audio Recognition Technology
Audio Recognition System Framework

Audio recognition technology is a technology that allows computers and other devices to parse and understand human speech through the use of various algorithms. The core of this technology lies in converting sound signals into digital signals that can be processed by machines, and then analyzing these digital signals through complex algorithms to convert them into text or execute the appropriate commands. The process of sound recognition involves sound acquisition, feature extraction, pattern matching, and final interpretation and execution, a sequence of operations that relies on digital signal processing.

The framework of the recognition system is shown in Fig. 2, and the technique consists of the following five main steps: Step 1: Audio Signal Acquisition This operation is the initial stage of pitch detection, which involves the use of a suitable hardware device, such as a microphone or an audio interface, to capture the sound signals and convert them into a numerical model for subsequent data processing. Step 2: Audio Preprocessing Before the audio signal enters the pitch detection algorithm, pre-processing, such as noise reduction, removing noise, adjusting the sampling rate, etc., is usually required to improve the accuracy of the pitch detection. Step 3: Feature Extraction This step involves extracting pitch-related features from the preprocessed audio signal. This usually involves the analysis of time and frequency domain features such as the Short Time Fourier Transform (STFT) or the Mel Frequency Cepstrum Coefficient (MFCC). Step 4: Pitch Estimation After extracting the relevant features, the pitch estimation algorithm estimates the pitch of the audio signal based on these features. This usually involves the application of pattern recognition or machine learning algorithms. Step 5: Result Output Finally, the pitch detection system outputs the result of the pitch estimation, which is usually presented in terms of notes, Hertz (Hz), or other pitch representations. Through these five steps, pitch detection technology is able to accurately identify and measure pitch from an audio signal.

Figure 2.

Voice recognition system framework

Cubase 12 can store and analyze large amounts of data, thus helping teachers better understand student progress and needs. For example, by comparing students’ rhythmic analysis results at different points in time, teachers can clearly see the trajectory of their students’ progress and provide them with more targeted instruction.

Research on Audio Recognition Technology

Each sound in the audio can be considered as a short-time energy pulse signal, so the endpoint of the musical signal can be determined by obtaining the moment of occurrence of the short-time pulse. The energy of the musical signal is continuous in the time dimension, and the original audio signal is divided into frames and windows and its short-time energy is calculated: Ei=n=0L1|x(n)|2

where x(n) denotes the amplitude of the n rd point in the i nd frame of the signal, and L denotes the window length, which takes the value of 2% of the sampling frequency value Fs. The short-time energy difference ΔEi between the two frames before and after is calculated with the following equation: ΔEi=EiEi1

Each note corresponding to the starting point of the band segmentation has a distinct pulse mutation. However, the results also show that there are several smaller peaks around the peak corresponding to the starting point of each note, and thus the range of the segment needs to be filtered so that only one peak is retained for each note. This was achieved by setting two thresholds, minimum peak height and minimum peak spacing, where the minimum peak height was used to identify the maximum energy mutation corresponding to the starting point of the note, and the minimum peak spacing was used to filter out the pseudo-starting point of the note, i.e., to filter out the influence of several smaller wave peaks around it. All peaks obtained from the short-time energy difference are filtered by these two threshold thresholds to initially obtain the starting point for notes.

Then, according to each starting point, its corresponding end point is searched. By setting two thresholds, short-time energy and short-time over-zero rate, when both parameters of the signal are below the thresholds, the point is determined to be the preliminary obtained note endpoint corresponding to the current starting point. In order to reduce the influence of the algorithm by artificial thresholds and to improve the generalization ability and accuracy of the algorithm, two determination means are used. Judgement one is directed towards the preliminary endpoint obtained. If the endpoint corresponding to the current start point is located after the start point of its next note, the endpoint is judged to be found incorrectly, and the first n frame of the start point of the next note is taken as the endpoint corresponding to that start point. Judgment and for the difference of each pair of starting and ending points, set the shortest note length for the shortest duration of each note, calculate the difference of each pair of starting and ending points, if the difference is less than the shortest duration of the note, then the pair of starting and ending points is judged as noise and removed from the set.

Pitch recognition of musical notes

After the audio signal is segmented, the pitch recognition task is carried out for each note obtained from the segmentation. From the frequency point of view, a musical signal is composed of fundamental and overtones. The fundamental is the lowest frequency in the sound wave formed by the vibration of an object. The overtones, which are integer multiples of the fundamental frequency, are distributed differently, resulting in different timbres.

Correlation functions, including the correlation function and the autocorrelation function, describe the degree of similarity between two signals. The correlation function describes the correlation between two signals, while the autocorrelation function describes the synchronization and periodicity of the signals themselves. Consider a discrete periodic signal x(n), whose autocorrelation function is expressed as follows: R(k)=limN12N+1n=NNx(n)x(n+k) where N is the frame length and k is the number of delay sample points.

Generally speaking, the short-time autocorrelation function has the following properties:

The short-time autocorrelation function of a periodic signal has the same period as it.

The short-time autocorrelation function is an even function.

The maximum value of the short-time autocorrelation function occurs at k=0, i.e., the signal and the signal itself have the highest correlation.

The above properties show that the signal period is T, and the maximum value of its short-time autocorrelation function must be at an integer multiple of the period, then the fundamental frequency can be found by the distance between the peaks of the short-time autocorrelation function. If a frame of the signal, its short-time autocorrelation function between the neighboring peaks of the number of sampling points for k, then the fundamental frequency of f=Fsk $f = {\raise0.5ex\hbox{$\scriptstyle {{F_s}}$} \kern-0.1em/\kern-0.15em \lower0.25ex\hbox{$\scriptstyle k$}}$, of which Fs is the sampling frequency, this paper takes Fs = 44.1kHz.

Take the actual piano playing note A5 as an example, seek its short-time autocorrelation function, the curve is as follows, according to the nature of the autocorrelation function, obtain the position between the two wave peaks, can be calculated as the fundamental frequency of 888.89Hz, the fundamental frequency frequency of A5 in the piano pitch frequency control table is 880Hz, with an error of 1%, to verify the effectiveness of the method.

The short-time amplitude difference function adopts the difference operation, retaining the short-time autocorrelation function of similar nature, which can reduce the amount of calculation. Consider the discrete periodic signal x(n), whose amplitude difference function is expressed as follows: γn(k)=m=nn+Nk1|xω(m+k)xω(m)|

Where N is the frame length and k is the number of delayed sample points.

Similarly, the fundamental frequency of the current frame is calculated to be 888,89 Hz, which is an effective method and faster calculation.

The fundamental component is not always the highest intensity component, and the harmonic components can have a significant impact on the actual waveform, which can have a significant impact on the fundamental frequency solution. It should be noted that when using the above two methods to extract the fundamental frequency, the calculation result may be half or twice the actual fundamental frequency, because most of the peaks or valleys are generated by the resonance characteristics, resulting in a large number of redundant calculations, affecting the efficiency and accuracy.

Fundamental frequency extraction based on cepstrum is one of the frequency domain methods. Cepstrum analysis is a homomorphic signal analysis method, and its calculation process is as follows:

Find the Fourier transform of the signal.

Find the logarithm of the result of the previous step and take the absolute value.

Take the Fourier inverse transform.

For continuous signal x(n), its complex inverse spectrum is: x^(n)=12πππln|X^(ejω)|ejωndω

X^(ejω) is generally complex, and if an inverse transformation is performed on its real part, then the real inverse spectrum c(n) can be obtained, calculated as follows: c(n)=12πππln|X(ejω)|ejωndω

The calculated fundamental frequency is 872.73 Hz with an error of 1%. It should be noted that the cepstrum method has the disadvantages of large computational volume, high complexity, and poor immunity to noise.

Analysis of the effectiveness of the application of digitization technologies
Teaching Practices for Music Recognition

In this section, Cubase12 captures a piece of music in .wav format from a music learner’s performance of the piano work “Ode to Joy”, and the system reads the performance information and automatically extracts the main (important) musical features of the performance. The system reads the performance information, automatically extracts the main (important) musical features of the performance, extracts the basic musical features of the performance, and outputs the feature recognition results of the performance. In order to make the results more intuitive and to make it easier for the system user to analyze the music played, the feature analysis results are presented in the form of images. Of course, in the future, with further development work, the system will provide textual evaluation of the performer’s performance and suggest improvements accordingly.

The pitch and timing information is shown in Figure 3. The result of the music feature recognition system shows quantitative information such as pitch, time value, number of key touches, etc. The system also displays the number of keys touched by the player. Among them, “Ode to Joy” has the highest pitch during the frequency range of 200~400Hz.

Figure 3.

Frequency domain diagram

The results of the “Ode to Joy” analysis are shown in Figure 4, where the standard requirements for each note can be clearly seen along with the playing pitch and time value, and the differences can be seen. In the figure, the horizontal coordinate indicates the number of beats, the vertical coordinate is the key number, and the black horizontal line indicates the standard pitch and its time value (i.e., the standard pitch and its time value). The purple horizontal line indicates the played pitch and its time value (i.e., the actual pitch and its time value) From the image, the user of the software can well see the information of the time value of the pitch played by the player, and the requirement of the musical score. The playing accuracy is 98.2% and there are 21 keys. In conclusion, music performance recognition software extracts features and analyzes a recording file of an actual performance. Users of the software can now run and use the entire software with the help of Matlab to obtain the music-related information and evaluation they desire. On the one hand, the software can be used as an aid to music teaching, helping music performance learners to analyze whether they are playing the right music or not, and to continuously improve their performance. On the other hand, piano learners can also use this software to analyze the known correct performance of music, and get the corresponding information, especially the information of the tuning tonality, this software can help them to understand the tuning tonality of the music, and solve the problem that many people don’t understand the tuning tonality of the piano music in the learning of piano music.

Figure 4.

Identification system processing result interface diagram

Analysis of the effect of music software application

This study mainly utilizes the educational action research method with the music students of University X as its research subjects. The study used a quiz method where the experimental class, i.e., Group A (using Cubase12 music teaching software) and the control class, i.e., Group B (traditional teaching media), responded to the quiz questions to illustrate whether the Cubase12 music software has a relevant impact on the teaching and learning of music in junior high schools.

Three factors were taken into account in the design of the quizzes. First, the accuracy of the quiz should be high, i.e., the validity of the quiz. According to the teaching content of the eight lessons, the order of compilation is from easy to difficult, starting from the students’ existing music level, combining with the teaching materials, and taking into account the characteristics of Cubase12 music software and traditional teaching media. Secondly, the reliability level of the quiz should be high. The design of the test questions includes fill-in-the-blank, multiple-choice, connect-the-dots, and listening questions. The fill-in-the-blank questions are mainly about basic knowledge. The multiple-choice questions focus on a wide range of topics. The connecting questions examine the correspondence between musical works and real life as well as the close association between music and society. Listening and analyzing questions are unique to music exams, and the two classes will use different teaching methods to play and answer the questions. Third, the design of the test paper should be based on the current music curriculum standards for emotional attitude and values, process and method, and knowledge and skills of music teaching objectives classification, the design of the questions need to have a clear goal orientation, that is, for traditional music teaching and the use of Cubase12 music software teaching methods made by the analysis of the topic.

The analysis of the quiz data in this study was mainly made using EXCEL tables, the key to an overview of the overall class performance, as well as the analysis of the relevant types of questions designed.

Overall test data

A total of 60 test papers were recovered, and after analyzing the data as a whole, the distribution of grades was as shown in Figure 5, with an average score of 76.35 for Group A and 68.84 for Group B. We can see that the grades of the two classes were mostly distributed at the level of 70-90, with fewer scores below 50.

Figure 5.

Test data

Data situation of different question types

The whole paper includes four types of questions: fill in the blanks (1~4), multiple choice questions (5~8), connecting questions (9~12) and listening and distinguishing questions (13~15), and from the data collected, the data situation of different types of questions is as shown in Fig. 6: This is a graphical representation of the distribution of the correct rate of the four major types of questions, and according to the above, it can be seen that for the listening and distinguishing questions, the students of the two groups are relatively weak, and in general, students from the class of group A are In general, the students of Group A class were relatively outstanding in some aspects, and the two groups were almost in the same proportion in terms of grasping the basic music theory. However, the listening questions in Group A were mainly played through the Cubase 12 music software, and in Group B, the traditional MP3 playback or video playback was chosen, and the questions were exactly the same. We found that the similarity between Group B and Group A is mainly for listening to the song title and writing the instrumentation, but in Group A the transformation of the score to the pentatonic score is very good. From the final results, in the 2nd, 4th and 9th questions, Group A did not have much obvious advantage, but in the 13th question about “the descending sign is located in the (direction) of the note” Group A, who had used Cubase12 music software, had a correct rate of 83%, which shows that for music teaching, sound and image should be synchronized, or mainly based on perceptual understanding.

Figure 6.

Different questions

Overall both classes had the highest error rate for the final multiple choice questions in terms of the distribution of error rates. From the correct rate, the highest percentage of questions were connecting, listening, and recognizing. However, from the perspective of Group A and Group B, the Cubase 12 music software class had a higher percentage of correct listening questions than Group B. There was also a higher percentage of fill-in-the-blanks questions on note writing than Group B. There was no significant difference between multiple choice and linking questions. In terms of the overall distribution of errors, the highest error rate was in the multiple-choice questions, and most of the errors were due to details, such as choosing the wrong instrument or getting the wrong time value for rhythmic notes. In the listening questions, the lowest error rate was found when listening to popular songs, which shows that popular music genres other than books are very popular among students when selecting music courses.

Survey and Analysis of Music Theory Programs

A statistical survey was conducted on the music majors of College X who used Cubase 12. From the point of view of learning interest, there was a significant increase in students’ motivation to participate in the music theory course, a significant increase in students’ concentration time while studying, and a significant increase in the quality of classroom assignments as well as student work. At the end of the course, a post-study survey was conducted with the same students, and the results of the survey are shown in Figure 7. The survey showed that 31.2% of the students found the difficulty of the course to be easy, 58.25% found it to be moderate, and 10.55% found it to be difficult. The survey on the interestingness of the course design, 67.5% thought it was interesting, 20.8% thought it was okay, and 11.7% thought it was boring. The difficulty of coursework was considered easy by 36.1%, moderate by 47.2%, and difficult by 16.7%. The data shows that the overall course design of the music theory course was acceptable to most of the students.

Figure 7.

Use the music software course questionnaire

Conclusion

This paper utilizes computer music software, an emerging model that integrates digital technology and music art, to improve the quality of music teaching and assist music education. Through questionnaire surveys, comparative experiments, and other methods of music teaching, we aim to explore the effectiveness of music software as a teaching mode. The main research’s findings are as follows:

Through the audio recognition plug-in in the software, the learner’s Ode to Joy is compared and analyzed with the standard audio, and the pitch, timing and frequency information of the Ode to Joy is shown in detail, and the accuracy rate of the performance reaches 98.2%.

The average performance of the Cubase 12 music software was higher than that of the traditional music teaching method in the test of Group B of Combination A. Most students believed that the music software program was more challenging than the traditional music teaching method. Most of the students thought that the music software was moderately difficult, and that the course design was interesting, but the coursework was moderately difficult.

Funding:

This research was supported by the Ministry of Education Humanities and Social Sciences Planning Fund Project: Historical Exchange and development of Silk Road Music and dance culture around Altay (23YJA760063).