Show HN: I made a static site for exploring names
namex.lyall.coI've been trying to find some good baby names, so naturally I've been playing around with the US Social Security Administration's name data[0].
I've made various visualizations, tried to analyze names that are going to be popular in the future, etc. I recently tried making a name recommendation system that lets you rate names, then recommends names based on your inferred preferences. It worked OK, but along the way I ended up making namex[1], which has turned out fun and maybe useful enough to be worth sharing with a wider audience. Give it a try!
Basic overview:
- Corpus of ~24,000 names taken from SSA data
- Any name with 15+ registrations attributed to it from 2022-24 was included
- The names are scored by an LLM (Claude Sonnet 4.5) across ~40 subjective dimensions, such as "toughness", "trendiness", "easy to spell"
- A further ~15 dimensions are computed based off of the (LLM inferred) pronunciation of the name
- E.g. "vowel rich", "ends nasal", "syllable count"
- 3 dimensions are computed from the raw SSA data related to popularity and gender distribution
- Names are then represented by a 60-dimensional vector
- User selections create a weight vector
- Names are ranked against the weight vector using fancy linear algebra (or, uh, dot products)
- Static data is loaded from server, everything else is run client-side
Disclaimer: there is some potential for offense to be taken at the characterization of names. The LLM was instructed to score the subjective dimensions according to the American cultural context. So what is considered e.g. easy to spell, or associations with certain cultures or religions, is based on the LLM's interpretation of that. So there are probably biases that stem from the LLM's training or American culture, or likely both.
0: https://www.ssa.gov/oact/babynames/
1: https://namex.lyall.co/ One of my friends was searching for a name that is easy to say and spell in Spanish and Japanese. The intersection is quite small. I’m going through something similar but with English instead of Spanish. That’s why I added the “kana readable” and “kana faithful” dimensions. The UI is pretty good.You could add pagination instead of load more for better accessibility :) Nice work nice i dont think it has my name lol If you’ve got a name that is very rare in the US then it’s possible! Any name that had at least 15 registered births over the last three years in the SSA data is included. oh sick how do i check? I don’t have a way to check built in right now. But you can look at the 2020 data here: https://raw.githubusercontent.com/hackerb9/ssa-baby-names/re... If your name is on there it’s probably included in namex. I could also manually check the data of course but that would require knowing your name :) my username alhwyn