Eighteenth Century London Concerts: 6 – Repertoire

This article looks at the types of repertoire included in eighteenth-century London concerts. As discussed in the first article of this series, information on the works performed is encoded, in a complicated way, in the “programme” field of the dataset.

The data is based on concert advertisements in newspapers, so there is considerable variation in the detail provided. Some advertisements spell out details of all of the works and who will perform them, but it is more typical for the focus to be on the performers, with the works often vaguely specified, such as “a concerto by Handel” (if you are lucky, it will say what instrument it is for).

Continue reading →

Eighteenth Century London Concerts: 1 – The Data

Ranelagh Gardens 1754

This is the first in a series of articles looking at Simon McVeigh’s fascinating dataset “Calendar of London Concerts 1750-1800“. In this article I will describe the data and consider how it can be put into a form suitable for statistical analysis. A second article will look at finding the locations of the concert venues, and I will then move on to some analysis of the dataset.

Continue reading →

Desert Island Discs

“Desert Island Discs” is the UK’s longest running radio show, still going strong on BBC Radio 4 over 78 years after the first episode on the BBC Forces Programme in January 1942. Each episode, an interviewed “castaway” chooses the eight pieces of music they would take with them to a desert island. They also select their favourite of these tracks, plus a book and a luxury item.

Continue reading →

What’s in a name?

Many datasets of composers tell us relatively little about them, so we sometimes have to guess details from the information available – such as the composer’s name. Forenames, for example, are often a good indicator of gender, as described in this previous article. Titles – associated with the church, aristocracy or royalty – can also reveal gender, and tell us about occupation or social class. This article looks at what names can tell us about nationality – based on a recent attempt to identify Italian composers among the many obscure and unknown names listed in the British Library’s music catalogue.

Continue reading →

Reading the Musical Times

A few weeks ago I noticed that those nice people at JSTOR have a scheme whereby researchers can apply to access large chunks of their data in order to carry out quantitative research projects.1 I sent off an application to see if they would let me have all copies of The Musical Times and Singing Class Circular in order to carry out a statistical analysis of the text (a technique which I will cover at some point in a future article). Lo and behold, after a couple of emails and a few days, I received a link from JSTOR to download the data I had asked for.

Continue reading →

Deduplication

Deduplication is an important, though often messy and time-consuming, part of many statistical investigations. It is usually required when data comes from several different sources, to identify all of the records that actually refer to the same thing. For example, I have recently been deduplicating the names appearing in the ‘women composers’ sources listed in this previous article. Deduplication may also be needed where several publications of the same work are described in different ways in a library catalogue. Continue reading →

Reading a scanned book

I have recently been working on extracting data on women composers from the various sources listed in this previous article. The first source on that list is a scanned copy of a French translation of a book – Les femmes compositeurs de musique – compiled in 1910 by Otto Ebel. It is available at archive.org here. Although I’ve not had great success in the past in extracting usable data from scanned books, this appears to be a reasonably tidy scan of Ebel, which looks like a useful source on women composers, so I thought I would give it a go. Continue reading →

Triangulation

Surveyor 2Triangulation is a research technique that involves looking at the same thing from two different perspectives. In surveying, it enables positions and distances to be calculated by measuring angles from two locations. In the social sciences, it can increase the reliability of conclusions if they are found by two (or more) different methods. And in statistical historical musicology, looking for the same works or composers in two or more datasets can tell us a lot about the characteristics of the datasets, and about the works’ patterns of survival or dissemination. Continue reading →

The Semantic Web vs Tidy Data

I have recently been trying to collect data from the Listening Experience Database (LED) in order to put together a proposal for a conference paper. The LED is a nicely constructed database using linked open data and a structure based on something called the ‘Semantic Web’. Rather than traditional databases that have a hierarchical ‘tree’ structure, the Semantic Web concept is a true ‘network’, where anything can be linked to anything else. The LED, for example, includes links to data on a number of other databases. Have a look at the LED and follow a few links and you will see what this means – a very rich and flexible means of linking data together. Continue reading →