Correlations

STAGE I

Putting the Census Records in a Computer Database

A team of volunteers used the "Warren County, Tennessee: Census of 1850" transcription (published in 1958 by Womack Printing Co., McMinnville, Tennessee) as a starting point. Spot checks comparing this transcription with original census records on microfilm showed a high level of reliability.

Each person in the 1850 census was entered as a separate record in a PAF database (Personal Ancestral File version 2.31, a genealogy program).

Members of a household were linked into a family wherever it seemed "plausible." NOTE: THESE LINKS CONSTITUTE A STARTING POINT ONLY! The PAF database will be constantly modified in Stage II as new information comes to light.

Second marriages are especially likely to be problematic, as illustrated by the following family:

    Jacob Herndon   60
    Charlott [sic]  22
    Nancy           19
    Ruth            15
    Robert          13
    Joseph          11
    Susan           78

We interpreted this as Jacob, a widower with 5 children living at home plus his mother. I received a Herndon database, and it turns out that Charlotte is Jacob's second wife. Jacob's first wife died in 1840, after bearing him 12 children. Jacob and Charlotte married in 1850 and had another 14 children! Nonetheless, we can expect a good percentage of family links will later prove to be correct.

Records were entered in the same order as the census listings. No spellings were changed, even when it seemed likely that the census taker was mistaken. Certain conventions were followed (the "field" and "level" terminology below is from PAF, but most genealogy programs will have equivalents):

Certain fields of the PAF database were then extracted (using a custom program I wrote) into a comma-delimited record format for importing into a general purpose database program. No family links are present in this file; it is essentially a census database. The first line contains field names, in more or less the same order as the Womack transcription:

The first field, RIN, is provided as a convenience in cross-referencing the general purpose database with the genealogy database. It may be stripped without any adverse effects, since the records will sort the same way by using the ID field as a key.

Some advantages of a database program are the ability to search and sort on various fields, such as

Various statistical summaries can also be performed with a database program, such as the number of children in school at various ages or the ratio of lawyers to farmers. If any readers would like to prepare such reports for the web site, I would be pleased to add them here or link to your home page.

Click here to download the database WCTN1850.ZIP (104K).

The author gratefully acknowledges the assistance of Kensey Alsman, Jim Davis, Doris Good, James Matthews, Sam Powell, Howard Powers, and Cynthia Whitlock in preparing Stage I. Together they entered and proofed the 8468 names in the 1850 census.

Home