Peter Ralph
3 November – Advanced Biological Statistics
developed the correlation coefficient and the chi-squared test; early use of the \(p\)-value
head of the first Department of Applied Statistics, as the Galton chair of Eugenics
focus on “biometry”: statistical theory of biological measurements, especially evolution and inheritance.
had a lengthy feud with Udny Yule over the correct definition of the correlation coefficient for a \(2\times 2\) table
developed ANOVA, F-distribution, methods for maximum likelihood, lots of evolutionary theory
Galton chair of Eugenics, succeeding K. Pearson
What’s up with this connection to eugenics?
How can we reconcile these people being very smart about some things but not about others?
Has this history had an impact on how we do statistics?
Now, if you are going to take Darwinism as your theory of life and apply it to human problems, you must not only believe it to be true, but you must set to, and demonstrate that it actually applies. […] It was not a light task, but it gave for many years the raison d’etre of my statistical work.
upper-class Nordic eugenicists thought that everyone except upper-class Nordic people were dangerously degenerate
they opposed curing tuberculosis and justified the genocide of the Americas
this inspired forced sterilization and anti-miscengenation laws in the US, targeting poor and people of color
and this directly inspired the Nazis
PSA: eugenics is a bunch of bullshit
Bystanders:
William Gosset (\(t\)-test): focused on practical applications
Egon Pearson (hypothesis testing): pushed to separate statistics and eugenics departments
Opponents:
Lancelot Hogben: biologist, opponent of eugenics
Thomas Morgan: biologist, opponent of eugenics
caused by Mycobacterium tuberculosis
occurs more often among relatives
major risk factors: silicosis (30x), malnutrition, smoking, alcoholism, crowding
minor: genetics (mostly immune-related)
Pearson: it’s hereditary!
niacin deficiency
was common among poor Americans in the early 1900s
… since corn lacks niacin unless nixtamalized.
Davenport: it’s hereditary!
Hill & Doll: retrospectively compare cancer rates between matched smoking/nonsmoking pairs
Hill & Doll: prospectively compare cancer rates between smoking/nonsmoking doctors
etcetera
Fisher: Correlation is not causation. Let’s calmly study this a while longer.
what is being “smart”, anyways?
well, numbers aren’t racist?
but how we use them might be
\(p\)-values and hypothesis tests focus on whether there’s a difference between groups,
… ignoring whether any difference is important.
This is a major problem with reflexive reporting of “statistical significance”.
Is this because of “eugenics thinking”?
A small methodological fix:
the confidence interval
What questions are being asked?
What data is being collected? (and how)
What assumptions are being made?
What are the important conclusions?
Pearson & Fisher had a big impact in part because (they were pushy and) they were working towards something.
Today we have many problems that we need quantitative work to solve.
Where do you want to have an impact?
It is a false view of human solidarity, a weak humanitarianism, not a true humanism, which regrets that a capable and stalwart race of white men should replace a dark-skinned tribe which can neither utilize its land for the full benefit of mankind, nor contribute its quota to the common stock of human knowledge.