HDS logo (sidebar) titlebartitlebar-curveHomeSite MapSearch CatalogueSite SearchContact Usend-menu
About Us
Accessing Data
Depositing Data
Creating Data
Projects
News and Events
Advice and Training
Staff
Great Britain Historical Database Online
Website email
Address
AHDS logo/link
 

Great Britain Historical Database Introduction

Scope

As explained below, the first version of this database was constructed with a fairly specific focus: measures of localised economic distress in the period 1850-1914. Even that quite limited theme, which began with research into Trade Union-run Unemployment Insurance schemes, means that the database also includes:

  • Official unemployment statistics, both pre-WW1 and inter-war
  • Poor Law Statistics, again pre-WW1 and inter-war
  • Records of Small Debt Cases
  • Data from the Registrar General's Reports on numbers of marriages
  • Wage Rates in individual towns
  • Some limited information on Short-Time Working

Contextualising economic distress meant that we also gathered Employment Statistics from the census, while a separate project to create an Atlas of Industrial Protest (Charlesworth et al, 1996) led to a major database of strikes being assembled.

More recently, new collaborations and new interests have led to the addition of:

  • Demographic statistics from the census, especially on age and sex structure
  • Mortality statistics
  • Electoral statistics

Geographical Coverage

The geographical coverage of tables in the database depends on the original source. Where the same report covered Scotland or Ireland, we have included them, but so far we have not actively sought out Scottish or Irish sources to supplement data on England and Wales. In general, data gathered by trade unions or the pre-1914 Board of Trade covers the entire British Isles; data gathered by the inter-war Ministry of Labour covers Great Britain; and data gathered by the linked census, vital registration and poor law systems covers England and Wales. In the longer term, we aim to add systematic data for Scotland but we have no intention of duplicating the work of the Database of Irish Historical Statistics at Queen's University, Belfast.

History

Until 1996 the system was referred to as the Labour Markets Database and was originally developed as part of research on the regional economies of pre-1914 Britain funded by the Leverhulme Trust between 1989 and 1991, which built on Humphrey Southall's doctoral research. The first version of the database was constructed using INGRES on the QMW Geography department's MicroVAX by David Gilbert, and was named simply 'gilbert'. Due to the limited power of the MicroVAX, each original report from the source material was held as a separate table in the database, leading to considerable complexity in the construction of time series and to the database containing over 600 tables. When the MicroVAX was closed down in the summer of 1992, this database was moved to beta, an ICL DRS-6000 Unix mini-computer operated by QMW Computing Services, and reconstituted as the 'qmwlmdb' database without modifying its structure. Note that the closure of the MicroVAX meant that we could no longer support the Hypercard-based data access method described in D.Gilbert and H.R.Southall, 'Data Glasnost: a user-friendly system for access to research databases across wide-area networks', History and Computing, Vol.3 (1991), pp.119-28.

The 'lmdb' database was created by Humphrey Southall in December 1993 as a supplement to and eventual replacement for the 'qmwlmdb' database. There were three reasons for its construction:

  • The number of tables in the 'qmwlmdb' database made it difficult to manage, and difficult to interrogate; in particular, the construction of time series was complex because values of the same variable for different dates were generally in many separate tables. In general, the current database stores all records of a given type in a single table, with additional fields indicating the year and sometimes month to which the data refer. It is still possible to retrieve data in 'raw' form by selecting the year and month and requesting fields in the order they appeared in the original.
  • The gazetteer system used in the original database has proved too crude, and a more sophisticated system is being implemented. This is described elsewhere.
  • In the course of the transfer from the MicroVAX to beta, a number of errors and in particular missing observations became apparent; this is much more of a problem when concern is with time series for single towns rather than with cross-sectional data for regions or counties. Given that checking such an extensive system is inevitably a protracted process, the safest solution was to create a wholly new system which contains only checked data and whose tables were fully documented as they were created.

From 1995 onwards, new funding permitted the addition of substantial new bodies of data: a range of economic statistics from the inter-war period, funded by a 1995 grant to us from the Nuffield Foundation; a variety of demographic data, much of it provided by other researchers; and various tables linked to the construction of the GIS.

In the summer of 1996, the database moved again, from INGRES running on 'beta' to the Oracle SQL database running on a Sun workstation in the QMW Geography department. This move was made necessary by changes in hardware and software availability, but should permit closer integration with the GIS. As part of the move the database was renamed to reflect the wider range of data now included, and particular the addition of electoral statistics and cause of death data which took it clearly beyond a focus on labour markets. At the same time, copies of many of the tables were supplied to the History Data Service for incorporation into the Great Britain Historical Database Online, a new on-line information system being created under their JISC-funded 'History Special Collections' programme.

What's missing from the database

  • Almost all of the data input in the original Leverhulme-funded project on the origins of the depressed areas is now held in 'lmdb'. The principle exception is unemployment data for the United Society of Boilermakers. We also have a long run of photocopies of unemployment statistics for the Friendly Society of Ironfounders which have yet to be input.
  • A substantial body of data concerning trade unions and strikes was input as part of research for the Atlas of Industrial Protest, using funding from the Nuffield Foundation. This has still to be added to 'lmdb'.
  • We expect to be adding additional census data for 1861 and 1871 supplied by David Gatley, and 1951 census data supplied by Daniel Dorling.
  • Data from the Steam Engine Makers' Society reports, and related sources, are held in a separate database, 'sem'. There is also a large collection of census enumerators' book data for Bolton and the Isle of Dogs which it is not intended to add to 'lmdb'.

What's not described here

This document aims to describe all tables in the database which contain original data, or meta-data such as gazetteers. It does not, in general, describe tables created entirely from data held in other tables, many of which have been created in the course of generating results from the data.

The QMW Historical Geographical Information System

Since October 1994, the ESRC have funded the construction of a GIS closely associated with the Great Britain Historical Database. Existing historical GISs generally relate to a specific date, or to a series of discrete dates such as census years. However, the database contains statistics which were recorded for large numbers of separate geographical units once or twice a year, and we have therefore had to construct a system which continuously tracked changes in the network of reporting units. In doing this, we have relied heavily on reports of changes in the boundaries of Registration Districts which appeared in the Registrar General's reports, and have been greatly aided by a Wellcome Trust-funded project at the ESRC Cambridge Group which input many of these lists of changes. This information, specifying changes to the precise day, forms part of the GIS and permits it to construct appropriate base maps for any date within the period covered.

More documentation of the GIS is provided elsewhere.

Making use of the database

We want to work with other researchers interested in the regional economies of 19th and early 20th century Britain and related topics which require access to our data for the whole country, not just a particular locality. We will be happy to either supply data or arrange for access to our system over the Internet once we have arranged some basis for collaboration:

  • If you wish to use our data to provide divisors, explanatory variables or suchlike in research whose central concern is with data you currently hold, we would expect that any resulting publication would mention the Great Britain Historical Database as the source of the information and acknowledge both ourselves, Humphrey Southall, Ian Gregory and David Gilbert, and the relevant funding body (the Leverhulme Trust for pre-1914 data, the Nuffield Foundation for inter-war data and the ESRC for the GIS). We would also expect to be sent copies of any such publications - this matters a great deal to our attempts to obtain further funding to expand the system.
  • If it appears to us that a principal focus of your proposed research would be the data we were supplying, we may ask to be included as co-authors of any resulting publications. In return for this, we would be supplying not just our data but also our expertise on what the data mean: most of the data we hold is full of pitfalls for the unwary, only a few of which can be explained here.

In either case, we would expect that any related data you hold would be made available to us on similar conditions, directly or through the History Data Service. This applies in particular to data from printed census reports, the Registrar General's Annual Reports and Decennial Supplements, reports of the poor law system, labour market data from trade unions or the Board of Trade/Ministry of Labour, and electoral statistics. A wealth of geographically-located data is available for Britain for the 19th and 20th centuries, and many important lines of research can only be followed by combining data from different sources. Potential uses of machine-readable data are often not apparent to the researcher who originally organised input; for example, two of our future research plans require data on Registration District boundaries to study voting behaviour, and mortality data for research on migration. Given the very limited resources available for historical research in the U.K., the repeated inputting of the same large datasets by different researchers is an unacceptable waste, even if funding bodies will pay for it; for further discussion see the introduction to our paper 'Data Glasnost', cited above. However, fears that others will simply plunder datasets which have taken years or decades to assemble are understandable, and for this reason we seek to establish a 'circle of co-operation' within which researchers will feel able to more freely exchange data. The Great Britain Historical Database and GIS, implemented using 'industrial strength' software running on Internet-accessible multi-user systems, can hopefully make a large contribution to promoting such a collaboration.

Anyone wishing to discuss possible collaborations should contact Humphrey Southall.

Funding

Parts of the database date back to Humphrey Southall's work as a research student, funded by the Social Science Research Council. Most pre-1914 statistics were gathered through a project on 'the Origins of the Depressed Areas' funded by the Leverhulme Trust. Inter-war unemployment and poor law statistics were input using funding from the Nuffield Foundation. The Economic and Social Research Council funded construction of the GIS.

Other Acknowledgments

The existing database and GIS have already benefited from many people's efforts. We are particularly grateful to Daniel Dorling (Newcastle), Eilidh Garrett, Alice Reid and Jenny Wood (ESRC Cambridge Group), David Gatley (Staffordshire), Paul Johnson (LSE), and Clive Lee (Aberdeen) who have generously provided data, in most cases without imposing any restrictions on subsequent use. Most data entry at QMW has been the work of Pat Furniss and Maureen Watters, and the size of tables such as 'ase_mr' and 'lui' is a tribute to the speed of their fingers. The GIS is largely the creation of Ian Gregory, who has done a remarkable job of constructing a coherent system from a strange array of historical sources.


(c) Humphrey Southall, David Gilbert and Ian Gregory, 1996.


GBHD Online Documentation Home Page

 
Copyright & Disclaimer