This is the assumption that because two things are related, or co-occur, that one causes the other.
Do the names of NFL quarterbacks affect the Miami Dolphins' success?
In 2021-2022, this team won every game where the quarterback in the opposing team had a last name containing the letter "o", and lost every other game (source: Reddit). This is an example of coincidence—these things are completely unrelated.
Does eating icecream cause drowning?
A famous study on how to use and interpret correlations in academic research used the the strong correlation of icecream sales to number of drownings to demonstrate how correlations can hide other relationships (in this case, weather; hotter days cause both more icecream consumption and more swimming).
We could use this relationship to make rough predictions ("If the local icecream shop has sold this much icecream by 10am, put more lifeguards out on the beach to watch for people in trouble!"), but we couldn't prevent someone from drowning by telling them not to buy icecream.
Getting from correlation to causation
Statisticians and scientists have developed sophisticated ways to test correlations further, so that we can pick apart what's pure chance (like the NRL games), what's a non-causal relationship (like ice cream sales and drownings), and what's actually causal. Briefly:
- Replication. Scientists will copy each others' studies to make sure the original findings aren't just a fluke. The more studies find the same relationship, the more confident we can be that there's something real.
- Sampling and probability calculations. Scientists will try to do experiments with lots of participants, and then use special statistical calculations to figure out how likely the results they got were assuming it was all just due to chance. The more unlikely their results are, the more confident they can be that they found something real.
- Controlling variables. Scientists will try to limit the amount of noise and uncertainty in their experiments by testing as few variables at a time. They'll try to have groups of participants that are as similar to each other as possible, and then only change one thing, and see what effect that has. If there's an effect, that indicates a possible causal relationship! (But only if other scientists can replicate it and we're confident it's not just random chance.)