1
2
3
4
5
#

Name:

Description:

Data on age, smoking, and mortality from a one-in-six survey of the electoral roll in Whickham, a mixed urban and rural district near Newcastle upon Tyne, in the UK. The survey was conducted in 1972-1974 to study heart disease and thyroid disease. A follow-up on those in the survey was conducted twenty years later.

Variables:

A data frame with 1314 observations on women for the following variables.

  • outcome survival status after 20 years: a factor with levels Alive Dead

  • smoker smoking status at baseline: a factor with levels No Yes

  • age age (in years) at the time of the first survey

Details

This dataset contains a subset of the survey sample: women who were classified as current smokers or as never having smoked. The data were synthesized from the summary description tables given in the Appleton et al al paper.

Link To Google Sheets:

Rows:

Columns:

License Type:

References/Notes/Attributions:

References

DR Appleton, JM French, MPJ Vanderpump. "Ignoring a covariate: an example of Simpson's paradox". (1996) American Statistician, 50(4):340-341.

R Dataset Upload:

Use the following R code to directly access this dataset in R.

d <- read.csv("https://www.key2stats.com/Data_from_the_Whickham_survey_1296_31.csv")

R Coding Interface:


Datasets Tag Questions & Instructional Blocks

NumberContentType
No results found.