decoding k-pop: a computational analysis of musical elements

k-pop has taken the western pop music scene by a storm in recent years. i wondered why, beyond the vibrant visuals, extravagant outfits, and unrivaled choreography - what makes k-pop different?

background

Korean Pop (K-Pop) is a geographically-characterized genre of music that originated from South Korea, drawing from diverse genres such as Hip-Hop, rap, new jack swing, EDM, rock, dance, and even 90s pop. K-Pop has risen to be a global phenomenon thanks to increased broadcasting and exposure to Korean culture. There is a general understanding that K-pop places a high emphasis on choreography, production values, and attractive idol branding creating a fan frenzy phenomenon that generates a dedicated following (Romano, 2018). It has been described as “a maximalist dreamland full of color” with “a plethora of performers and unrivaled choreography” (Sherman, 2020). However, despite the popularity of K-pop, most of the research on its impact has been discussed and analyzed through the sociocultural lens without much musical analysis beyond general characteristic descriptions. But is it possible to understand the popularity of K-Pop solely through its musical elements? This study seeks to delve deeper into the auditory intricacies of K-pop by analyzing Spotify's Audio Features dataset. This thesis addresses the following questions: What makes K-Pop “K-Pop”? Are there fundamental stylistic differences between Korean Pop music and Western Pop music? If so, what musical elements, including key modulation, tonality, formal structures, and timbre, define K-Pop?

These questions will be addressed using a comparative corpus study between two curated data sets consisting of 165 songs to represent K-Pop and Western music. Firstly, a key-finding algorithm will be used to visualize key modulations and key profiles of each song within the corpus. Then, a K-Pop classifier will be used to address the specific audio features that are pertinent to the classification of K-Pop. A comparison between various statistical models will be conducted to identify the accuracy and validity of each model. A regression analysis and odds ratio will also be conducted to better understand how the model deciphers K-Pop. Lastly, an analysis of the correlations between audio features will be conducted to visualize how each variable interacts with one another.

example of key-finding plot - dynamite by bts

figure 1. key-finding plot (dynamite by bts)

classifier analysis

regression analysis

Odds Ratio Graph:

figure 2. key-finding plot (as it was by harry styles)

methods

Constructing a Corpus

In order to better understand the potential differences between Western and Korean pop music, I aimed to create a fair depiction of what is considered to be the most “popular” in both genres from a data analytics perspective. Many musical platforms such as Billboard and Apple Music provide musical rankings in popularity, but I found Spotify to be the most accurate in depicting popularity in terms of geographic and personalized streaming preferences. Thus, popularity was operationalized by stream count from Spotify’s API data, including track and artist popularity provided by the End-of-Year Wrapped from Spotify Newsroom.

The study conducts a comparative corpus study on two datasets: the Top Hits in the USA (2020-2024) and the Top K-Pop Hits (2020-2024) on Spotify. Initially, the scope was restricted to the Top 50 Hits in the USA and the Top 50 K-Pop Hits in 2022; however, this was expanded as the initial corpus proved insufficient for discerning significant insights. The current corpus consists of 165 songs in each corpus, made up of the top 10 to 50 songs from each year, and the most popular songs in both categories as of April 2024 (See Appendix A). While recognizing the limitations of defining Western music as “American,” I decided to not utilize the Top Global Hits as the Western corpus since there were K-Pop songs in those playlists that could potentially confound the results.

2. Computational Methods

To address my research question, I programmed Python scripts to extract the track ID, track key, key confidence, and mode confidence for the Top 50 songs in the USA and K-Pop (2022) corpus. Based on the track IDs, I plotted key plots using Albrecht and Shanahan’s key-finding algorithm with humdrumR and compmus R packages (2013). The original graphs were created using Krumhansl-Schmuckler Key-Finding Weightings to serve as the basis for comparison. Then, I generated a K-Pop Major and Minor Key Profile that was trained on the K-Pop corpus, and a Western Major and Minor Key Profile trained on the Western corpus. The three key-finding weightings were used to visualize selected tracks and compare how successful each key profile was at predicting their keys. Then, I created a K-Pop classifier, employing a neural net trained on the features generally associated with K-Pop, and then tested the accuracy of that model using R scripts. The audio features I chose to focus on were ***acousticness, valence, loudness, liveness, danceability, speechiness,* and *mode confidence.***

To better understand the model fit for each audio feature of interest, I used the Akaike Information Criterion (AIC) measure to gauge which specific group of audio features is most relevant to predicting K-Pop. The AIC is defined as a summary statistic used in comparing the relative goodness of fit of two or more models for a given set of data, while taking into account the number of parameters in each model. The model with the lowest AIC is considered the best among all models specified (APA, 2018).

Based on the findings of AIC, I conducted a regression tree analysis to assess how the model distinguishes what is considered “K-Pop”. I compared the original neural network model against a logistic regression (glm), K-nearest neighbor (knn), learning vector quantization (LVQ), gradient boosted machine (GBM), support vector machine (SVM), and random forest (rf). The model then produced a summary of its resamples for each dataset. In addition to the regression tree, I also conducted a logarithmic odds ratio analysis to understand each audio feature variable’s predictive value in classifying a song as K-Pop. The increase in the corresponding predictor variable is associated with an increase in the log odds of the song being classified as K-pop, while a negative coefficient indicates the opposite.

key-finding algorithm

results

This research utilized a variety of computational methods to investigate how different musical elements, such as key and timbre, demarcate the unique sound of K-Pop. I aimed to address the following research questions:

What makes K-Pop “K-Pop”?
Are there fundamental stylistic differences between Korean Pop music and Western Pop?
If so, what musical elements, including key modulation, chord progressions, formal structures, and timbre, define K-Pop?
What do K-Pop’s audio features reveal about K-Pop’s stylistic fingerprint?

Based on the results from the various statistical tests we conducted, it is obvious that there are musical elements, specifically the understanding of tonality and timbre that sets K-Pop apart from Western pop music. The key-finding weightings and key-finding plots illustrate that there are variations in how Western pop music, Western classical music and K-Pop differ in the way they understand tonality and the evaluation of key and modality. The fact that differences were able to be observed based on the corpus-trained algorithms suggest that there are stylistic differences between the two genres.

Looking further into timbre, operationalized as audio features, we consistently found that high speechiness, valence, mode confidence, and acousticness play pivotal roles. Our regression analysis indicated that speechiness and valence are critical in distinguishing K-Pop, along with mode confidence and acousticness also contributing significantly. The odds ratio analysis revealed that increases in mode confidence, valence, and liveliness increased the likelihood of a song being classified as K-Pop. Conversely, an increase in acousticness and key confidence increased the likelihood of a song being classified as not K-Pop. Interestingly, our classifier analysis showed that different predictors emerged as significant depending on the combinations of features. When mode confidence and acousticness were low, loudness, valence, and speechiness were prominent predictors. Conversely, when loudness and speechiness were low, mode confidence, acousticness, and valence were most influential. These findings highlight the nuances of timbral qualities that define K-Pop's unique sound.

In conclusion, this study demonstrates that K-Pop's unique sound is characterized by specific tonal and timbral elements that distinguishes itself from Western pop music. These insights into K-Pop's stylistic fingerprint provide a deeper understanding of the genre's musical identity and contribute to the broader discourse on global pop music differentiation.

decoding k-pop: a computational analysis of musical elements

k-pop has taken the western pop music scene by a storm in recent years. i wondered why, beyond the vibrant visuals, extravagant outfits, and unrivaled choreography - what makes k-pop different?

background

classifier analysis

regression analysis

Odds Ratio Graph:

methods

Constructing a Corpus

2. Computational Methods

key-finding algorithm

results

This research utilized a variety of computational methods to investigate how different musical elements, such as key and timbre, demarcate the unique sound of K-Pop. I aimed to address the following research questions:

What makes K-Pop “K-Pop”?

Are there fundamental stylistic differences between Korean Pop music and Western Pop?

If so, what musical elements, including key modulation, chord progressions, formal structures, and timbre, define K-Pop?

What do K-Pop’s audio features reveal about K-Pop’s stylistic fingerprint?

beyond legal reformation: trans community mutual aid through tiktok