Submitted by beingsubmitted t3_z4el7h in dataisbeautiful
beingsubmitted OP t1_ixqekpt wrote
Data from data.gov 100% sample of social security application names 1910-2020*.
Made with Python (w/ plotly, PIL)
*There were no names with 'eigh' in 1910
EDIT: Forget Leighanne. Leighanne sucks. The first use of EIGH not at the end was way back in 1914, 6 boys names LEIGHTON born in, you guessed it, NY.
JillHasSkills t1_ixr2g22 wrote
You called out Creighton and then said that Leighanne was the first use of “eigh” not at the end. Otherwise super interesting!
beingsubmitted OP t1_ixr5zew wrote
Oh that's weird. I'll go back and check my data. Obviously one of those is wrong.
Yeah, I was wrong on leighanne - not sure what happened. Ran it again and it's Leighton in NY 1914.
royalpatch t1_ixrwt4u wrote
Just FYI that's not 100%.
The SSA data only includes names for which there are at least 5 people with that name that year.
beingsubmitted OP t1_ixs7nfb wrote
That's good to know. It does say it's a 100% sample, but that would explain why the lowest count I can find is 5. Do you have a better source?
[deleted] t1_ixsgq83 wrote
[removed]
Viewing a single comment thread. View all comments