Eurovision: Love Thy Neighbour

26th September 2021 / Andrew Gustar / Leave a comment

In the annual Eurovision Song Contest, it is well known that countries often tend to be generous in their votes for the songs of their neighbours. This article looks at the evidence for this in the 2021 competition, and uses it as an opportunity to illustrate the technique of “bootstrapping” to assess statistical significance.

Continue reading →

British Music Plaques

21st July 2020 / Andrew Gustar / Leave a comment

Many British buildings are adorned with plaques, marking the birthplace or residence of a famous person, or the site of a significant event. Details of these plaques are available in an online database, and I thought it would be interesting to see how many of them have a musical connection.

Continue reading →

The Teacher-Student Network

11th May 2020 / Andrew Gustar / Leave a comment

I recently stumbled across this page on Wikipedia, listing music students and their teachers. This is an ideal dataset to explore as a network diagram, or “graph”, in which a set of points (or “nodes”) are connected by lines (or “edges”). Here, the nodes are individuals, and there is an edge between them if one taught the other.

Continue reading →

Song Lyrics 8: Parts of Speech Tagging

23rd November 2019 / Andrew Gustar / Leave a comment

In this previous post in the series, we used capitalisation to identify proper nouns (names, places, etc) in our dataset of song lyrics. Other parts of speech – verbs, adjectives, etc – are not so easy to identify, although software exists to do just that.

Continue reading →

Song Lyrics 7: Rhyme Time

28th October 2019 / Andrew Gustar

Previously, we have looked at repetition in our dataset of song lyrics. This seventh article in the series considers a related issue – rhyming patterns. We are only interested here in the last word of each line – i.e. the string of characters between the last space and the end-of-line character \n.

Continue reading →

Song Lyrics 6: Principal Components

9th October 2019 / Andrew Gustar / Leave a comment

This is the sixth in a series of articles looking at different ways of analysing a dataset of song lyrics. In this article we will be venturing into hyperspace to explore the differences and similarities between artists, in terms of the words they use in their songs.

Continue reading →

Song Lyrics 5: Sunday in New York with Mary and John

29th August 2019 / Andrew Gustar / Leave a comment

In a later post in this series of articles analysing a dataset of song lyrics, I will consider the more general question of identifying parts-of-speech (nouns, verbs, adjectives, etc.), which can greatly expand what can be learned from statistical text analysis. However, in this article, I will focus on a particular part of speech: proper nouns.

Continue reading →

Song Lyrics 4: Sentiment Analysis

19th August 2019 / Andrew Gustar / Leave a comment

In this fourth article in the series looking at our song lyrics dataset, we will begin to consider the meaning of the lyrics, rather than just treating the words as abstract objects. A simple technique for quantifying the meaning of texts is known as ‘sentiment analysis’.

Continue reading →

Song Lyrics 3: Repetition and Compression

9th August 2019 / Andrew Gustar / Leave a comment

We all know that a good song depends on repetition – both of the tune and the lyrics. Too much repetition and it is just boring; too little, and it can lack structure. This article looks at different aspects of repetition in song lyrics.

Continue reading →

Song Lyrics 2: n-grams

3rd August 2019 / Andrew Gustar / Leave a comment

In the previous article in this series we looked at counting the frequency of words in a dataset of song lyrics. This time we will look at combinations of words – or n-grams.

Continue reading →

Statistics in Historical Musicology

Category: Techniques

Eurovision: Love Thy Neighbour

British Music Plaques

The Teacher-Student Network

Song Lyrics 8: Parts of Speech Tagging

Song Lyrics 7: Rhyme Time

Song Lyrics 6: Principal Components

Song Lyrics 5: Sunday in New York with Mary and John

Song Lyrics 4: Sentiment Analysis

Song Lyrics 3: Repetition and Compression

Song Lyrics 2: n-grams