Settings

Theme

A Crisis comes to Wordle: Reusing old words

forkingmad.blog

90 points by cyanbane 15 hours ago · 100 comments

Reader

thxg an hour ago

The original Wordle came with a pre-baked ordered list of 2315 "secret" words, off which the daily secret word was looked up (I think based on local time). The list was right there in the javascript code of the game (alongside the list of 12972 allowed guess words). It covered dates from 2021-06-19 to 2027-10-20.

Then in January 2022, the NYT bought Wordle, and started tweaking both lists, first shrinking the secret word list to 2309 entries, but leaving the logic otherwise intact. Fast forward to today, I looked up the current code [1], and it seems that there are now 14855 allowed words. The first 12546 are ordered alphabetically (0: "aahed", 12545: "zymic"), and the next 2309 are not. This may suggest that the latter are the secret words, but the logic for picking them has changed: I found no obvious sequence, when compared to the last few days' secret words. So it's either a more complex sequence, or the secret word is picked server-side.

In any case, I guess they decided to re-shuffle the list now at day 1689 / 2309 in order to avoid giving particularly assiduous player an additional bit of information: they can exclude all previous secret words. (To be accurate, I think this would be 1.897 bits, but my information theory is rusty.)

[1] https://www.nytimes.com/games-assets/v2/9003.896ec900f2a1ce8...

trothamel 13 hours ago

If I remember correctly, the original version of wordle used a word list that was run past the creator's wife, who had learned English later in life. The result was a really accessible game - none of the words felt like ones you wouldn't know. It probably makes sense to reuse words than risk losing that accessibility.

(I kept a copy of original wordle, and it seems to have 2,315 words that are possible answers.)

  • hyperbovine 11 hours ago

    It’s this. There are many five letter words that are not “wordley”. Words such as, idk, bokeh, are technically part of the lexicon but would never appear as a solution. The wordle bot will even tell you this if you guess them — “good guess, but unlikely to appear as a solution”. The crossword has a similar sort of unwritten rule, maybe not as strict, but really hard technical words seldom appear.

    • gretch 11 hours ago

      > The crossword has a similar sort of unwritten rule, maybe not as strict, but really hard technical words seldom appear.

      Not my experience at all.

      Ask me how I know what an EPEE is

      • rhplus 8 hours ago

        EPEE is a common fill word from a lexicon informally known as crosswordese.

        https://en.wikipedia.org/wiki/Crosswordese

        Really no harder than memorizing all the 2 and 3 letter words in Scrabble and many players will pick most up in a few months.

        • cyode 5 hours ago

          I didn’t know it was called crosswordese! I wonder what the most common term used is. As a very occasional player, for some reason ARIA, IBIS, and VENI/VIDI/VICI stick out, but I’m sure it’s actually one with an E.

          • all_factz 4 hours ago

            VENI/VIDI/VICI are easy for anyone who studied Latin (as indeed used to be common), and ARIA is similarly easy for anyone who knows about opera. Basically, the crossword is for snobs.

            • swores 2 hours ago

              I agree that crosswords often include cultural references that lean towards certain demographics / assuming particular education, and that can feel exclusionary if you don’t share that background - and there's even an argument to suggest snobbery might be behind those choices.

              But I disagree that that makes it for snobs. Snobbery is more about an attitude of looking down on others or their tastes, whereas knowing Latin or being a fan of opera is really just about exposure.

              Sure, there exist some (too many) opera fans who would say something like "it's real art compared to pop or hip hop being low class trash", but that's not a defining part of liking opera and plenty of people who like opera aren't snobs. Ironically it's a different form of snobbery (sometimes called reverse snobbery though personally I hate that term), to dismiss anyone who learned Latin or who likes opera as being a snob!

      • busyant 10 hours ago

        > EPEE

        They love that one.

        • enlyth 7 minutes ago

          And any 4 letter instrument is usually OBOE and a fish related clue is EELS

        • wombatpm 9 hours ago

          If you took fencing at an Ivy League school for you PR requirement you would know all about foil, saber, and epee fencing. Not everyone gets to row crew.

          • hvb2 2 hours ago

            Wholly offtopic but just posting because I thought it was awesome...

            During Covid I saw an ad for a fencing school how it was the best sport during Covid.

            You wear a mask

            You keep your distance

            And if someone doesn't, you stick em with the pointy end

            :)

      • TimorousBestie 6 hours ago

        > Ask me how I know what an EPEE is

        That’s when you’re like, only tangentially involved with the making of a movie or tv show, but too famous to go without a credit?

      • ted_bunny 8 hours ago

        Ah yes, good old ARA Parseghian. That guy.

    • groggo 6 hours ago

      IMO scrabble would be improved by a similar limitation. There's too many nonsense words.

      • ameliaquining 5 hours ago

        Scrabble is a competitive game, not a puzzle, and therefore subject to a different set of constraints. (Players in a competitive game are trying to win; a puzzle author, if they're any good at their job, is ultimately trying to lose.)

        In particular, you have to consider the equilibrium. If you only allow a subset of words in Scrabble, this replaces the competitive advantage from knowing lots of words that no one uses in real life, with a competitive advantage from knowing the exact contours of the border between acceptable and unacceptable words. I would argue that this is even worse; at least if you learn lots of Scrabble words you're learning something about the real world.

        By contrast, Wordle can self-impose whatever constraints they want on solutions, and people don't have to know what those constraints are in order to solve the puzzle. (It can help a little on the margin, which in a perfect world would not be the case, but it's much less of a problem for the puzzle-solving experience than the Scrabble equivalent would be.)

        • enlyth 4 minutes ago

          Will Anderson has an excellent Scrabble related channel on YouTube, would recommend to anyone who is interested

        • groggo 5 hours ago

          Ya that's a good point for competitive scrabble. However today I think a lot of people's main exposure to Scrabble comes from WordsWithFriends (and recently, the new NYT games version). In those games, there's no penalty for getting a wrong word, it just won't let you play it. In that context, I at least think it would be nice to have a setting with a more limited list... it could be like Chess timed variants.

          It's obviously an impossible challenge to draw those contours in language. Wordle did pretty well though! And going the other direction, just allowing everything that could possibly a word, just starts getting ridiculous.

    • badgersnake an hour ago

      Caulk is in there, I would say that’s fairly technical. My wife didn’t know it.

  • knuckleheads 11 hours ago

    Yes, that's correct! Took her about a year off and on, he had made a little app for her to go through and categorize everything.

    As an aside, for about $200, you can ask a true/false question of every word in the English language with a frontier LLM, and get mostly good answers. I make word games in my free time and was sort of shocked when I realized how cheap intelligence has been getting.

    • amluto 8 hours ago

      $200? Does this use reasoning? Does it involve forgetting to use KV caching?

      This should cost well under $1. Process the prompt. Then, for each word, input that word and then the end of prompt token, get your one token of output (maybe two if your favorite model wants to start with a start-of-reply token), and that’s it.

  • jonwinstanley 13 hours ago

    Yes there’s no point using technically correct words if hardly anyone know them.

    • sobkas 13 hours ago

      Language or the way we use it is often used to exclude "undesired", so there is a point in using them. Not a very nice point, but a point nevertheless.

    • hyperbovine 11 hours ago

      Sure there is, as long as your audience does.

  • NewJazz 6 hours ago

    Also they seem to never use vulgar words like my opener, penis.

  • BurningFrog 11 hours ago

    This may well be why the game became such a hit among everyone.

hombre_fatal 13 hours ago

1. Wordle's word list is going to be a lot more curated than TFA's word list because people want to guess words they use or have heard of, not "aahed".

2. Only a tiny group of people care to "card count" Wordle to rule out words that have already been played because they think that sort of min/maxing is fun. Most people don't even think about that, so whether Wordle reuses words every few years is trivial to them.

  • deanputney 12 hours ago

    I will say that having used the same starter word the whole time that has not come up yet, it's a little disappointing that it may now take even longer to appear.

  • tzs 11 hours ago

    > Wordle's word list is going to be a lot more curated than TFA's word list because people want to guess words they use or have heard of, not "aahed"

    The Times sure doesn't think that about the people who do Letter Boxed. One LB had "polymethylmethacrylate" in its dictionary.

    I've saved the daily dictionaries from 2024-03-30 and that's the longest word out of the 93 393 total distinct words in the 674 dictionaries I've saved. They average 1199.47 words per dictionary.

    They have some truly ridiculous words, such as "troughgeng". WTF is a troughgeng? Googling that gives a couple of pages in Chinese (or a similar looking language) and a Scottish dictionary entry for "Throu" which in one of the examples of "throu" as an adverb lists a bunch of phrases is it used in, including:

    > (8) througang, throw-, throoging, trough-geng, -geong (Sh., Ork.), (i) a going over or through; a passage (I.Sc. 1972); specif. (ii) a narration, a recital (of a story); (iii) a full rotation of crops, a shift; (iv) a thoroughfare, lane, passageway, corridor open at either end (Sc. 1808 Jam.; Sh. 1908 Jak. (1928); Rxb. 1923 Watson W.-B.; Ork., w.Lth., wm.Sc. 1972). Also attrib.; (v) = (5); (vi) energy, drive (Bnff. 1866 Gregor D. Bnff. 192);

  • pseudosavant 11 hours ago

    The Wordle list is available here (in addition to many other places): https://github.com/pseudosavant/ps-web-tools/blob/main/wordl...

    • venusenvy47 7 hours ago

      Has anyone confirmed if they still use only this original list? I would think the NY Times could change the word list however they choose.

      • pseudosavant 38 minutes ago

        I believe these lists are more like what is described in the blog post. Diction of words, filtered to 5 letter words, no plurals, etc. It most likely has 99%+ of the words, but maybe some they don't actually use in Wordle.

      • Rebelgecko 6 hours ago

        They changed some words pretty much right after the acquisition. There was some controversy when they started doing "themed" words (like Christmas stuff in December) vs more "random" words. Some words were also removed for having negative vibes/political liability

  • thaumasiotes 8 hours ago

    > people want to guess words they use or have heard of, not "aahed"

    That isn't a correct diagnosis; people have heard of aahed. You'll find it naturally in the expression "[someone] oohed and aahed".

    People don't want aahed, and their instinct that it shouldn't count is reasonable, but unfamiliarity isn't the problem with it.

    • zarzavat 8 hours ago

      Ooh and aah aren't words, they're sounds (onomatopoeia). A sound is just a sequence of letters used for their phonological values.

      You can spell the sound "ah" however you like: ah, ahh, aah, aahh, there's no wrong way to spell it.

      If you write "the washing machine tringged when it finished", 'tring' is not a word, even though it's following the rules of English morphology, you could have written any sequence of letters that most faithfully reproduces the sound of the washing machine. You could have written katrigged or puh-tringged.

      • paulcole 5 hours ago

        Ooh and aah most certainly are words. Is meow not a word? Can I spell it miough and sit smugly correct?

      • thaumasiotes 8 hours ago

        That is false; the fact that you can conjugate aah (or tring) into the past tense is sufficient to prove it's a word.

oliwary 37 minutes ago

In https://squareword.org (2D variant) I was also running into this problem. It's a bit different though, since I need to find valid 5x5 squares, with 5 words down and 5 across. Surprisingly, there is quite a limited number of such squares.

Ive been able to solve it by slowly injecting more challenging words over time, which has the side effect of also introducing a difficulty gradient. Players seem happy so far :)

furyofantares 12 hours ago

"Crisis" is a massively overblown word for this. And the "wordle community" is a drop in the bucket of regular players, and not remotely representative.

I did have a similar reaction personally to the "exciting news" framing but I'm not actually sure it's wrong. The original list of words was an excellent list, and it's been over 4 years.

  • teeray 11 hours ago

    > "Crisis" is a massively overblown word for this.

    Given that it is Wordle, “panic” would be a far more appropriate word.

    • alexfoo 11 hours ago

      Alarm, dread, scare, shock, start, worry.

      • hyperbovine 11 hours ago

        Alarm is a good guess. On average I can solve a wordle in 3.6 turns when I start with this guess.

        • maxkfranz 4 hours ago

          I used to use “stare” or “stale” as the starting guess when I played Wordle, thinking you’d want to start off with the most common letters, like R-S-T-L-N-E from Wheel of Fortune.

          • manarth 2 hours ago

            "stale" was used a while back - since then I've been starting with "slate"

        • ted_bunny 8 hours ago

          Repeated letters are wasted utility. Wouldn't that make it suboptimal for a first move?

          • muti 7 hours ago

            Suboptimal - likely. There is some utility: a green letter is more useful than a yellow. Checking for a in two locations when a is a very commonly used letter is __useful__. Still likely much more useful to check for the presence of a fifth letter than a chance at knowing more precisely the location of an a.

          • kQq9oHeAz6wLLS 6 hours ago

            I used to use alert, until it was the word one day (got it in one!). Then I switched.

            Apparently I should switch back, since it could be the word again.

            • CamperBob2 3 hours ago

              I always used the previous day's word as the starting word. IMHO that should have been how the game worked all along.

sowbug 13 hours ago

It seems about right. They reshuffled the deck about three-quarters of the way through (1689 ÷ 2315 = 72.9%). Blackjack shoes are typically shuffled around the same point. Different games, but similar considerations in this respect.

A1kmm 2 hours ago

They just need more bits of entropy - going from IPv4 to IPv6 involved quadrupling it, but this transition is much more minor. They could just go to 6 characters for now, and go to 7 later.

  • jedberg an hour ago

    6 characters would be vastly harder. You'd need more than six rows for sure.

brikym 13 hours ago

For my game redactle.net, I blacklist the Wikipedia article for 2 years. I figure there is a tradeoff between novelty and allowing the pool of articles to shrink. The Wikipedia vital level 4 category has 10k articles and probably half of them actually meet the criteria (length, number of languages etc) for making the cut.

huhtenberg 11 hours ago

Seems like a good post to plug a recent find and my new favourite -

https://puzzlist.com/stackdown

It's from the person who made https://wafflegame.net if you are familiar with it, one of many that came on the tails of the original Wordle.

In comparison, the Stackdown is less rushed and way more rewarding when solved. Also, more interesting in structure.

  • bmalicoat 2 hours ago

    That's cool to see. I made a mobile game, Downwordly, that has the same mechanic in its puzzle mode. It came out almost five years ago and still has a decent set of versus players.

    I'm more proud of a later word game that you can play free at https://wellwordgame.com/en If you give it a try, let me know what you think!

  • timenotwasted 8 hours ago

    Hopefully this is an ok place to plug my own word game, https://spellrush.com/. It's very different from Wordle but that was a conscious decision since there are so many clones out there these days. Really wanted to put a fresh spin on word games.

  • mastermedo 10 hours ago

    stackdown seems very hard. Took me over 10min for todays puzzle.

werdnapk 9 hours ago

I've been waiting years for my word to be my first guess and still nothing... it's been my opener for years. I know my word hasn't been used as I've checked the list of used words.

So for me, reusing words is not what I want to hear.

pseudosavant 11 hours ago

I've used my own tool (https://pseudosavant.github.io/ps-web-tools/wordle-solver/) for understanding how many words are left after each guess. It'll show hints if you want them too, but they are disabled by default. I like understanding how my guesses reduce the word space well (or not).

It uses the list of all of the words that can be in Wordle, and there are so many words I can't imagine anyone guessing. And I come from a family with large vocabularies.

fercircularbuf 11 hours ago

My friend and I labored over the word lists for our word game subletters.fun. We wanted the word pairs and at least one optimal path for each word pair to be from words on one list, which were simpler words that we would expect everyone to be familiar with. But players could use their own more advanced vocabulary to solve the puzzles on their own without feeling restricted. Then we bundled literally 10 years of unique word pairs into the game and shipped it.

iambateman 6 hours ago

This is lame. The original creator of Wordle would’ve been more Spiny.

geophile 12 hours ago

The analysis misses a point. Wordle uses two lists of five letter words: words that are in the dictionary, and can be used in a guess; and those that can be used as the daily secret word. The latter list is smaller, and sticks to more common words. Wordle has been around for 1550 days, so they have used 67% of the possible words. In another couple of years, they have to either start using uncommon words, or recycle. There's no rush, so it's unclear why this is happening now.

jackgavigan an hour ago

Yet the current word list apparently doesn't include "Irish" (even though Welsh, Scots and Brits are all valid). ¯\_(ツ)_/¯

windowshopping 7 hours ago

If anyone's looking for new word games, I built The Daily Baffle which might appeal to some of you. Check it out at dailybaffle.com!

angry_octet 11 hours ago

At the risk of being accused of obscurantism, I would like to know more of the words on the 5-letter list that are excluded by Microsoft Word.

BurningFrog 11 hours ago

I'm surprised they weren't reusing words already.

Obviously a finite resource will run out after a while.

tuwtuwtuwtuw 13 hours ago

I am guessing a high percentage of wordle players prefer a wordle version which uses common words, and New York Times would prefer cater to those, rather than a smaller group of enthusiasts.

znkynz 13 hours ago

Connections is better anyway.

  • AndrewDucker 13 hours ago

    It's a very different kind of game. I don't think it's at all comparable.

  • dobladov 12 hours ago

    My favorite right now is https://tiledwords.com/, not affiliated to it in any way, I just enjoy it.

    • paulhebert 12 hours ago

      Hey, thanks! I’m glad you’re enjoying it! (I’m the creator)

      • abosherid 7 hours ago

        It’s become tradition in my house to play tiled words with my wife just before bed. It’s the last thing we do together before falling asleep each night! Thanks for bringing us together with a bit of joy!

  • AnotherGoodName 13 hours ago

    I recommend anything at https://www.merriam-webster.com/games for these sorts of games. Lots of wordle variations and all add free.

    • BurningFrog 11 hours ago

      I find Quordle a much better game than Wordle, since there is some real strategy involved, but still not overly much.

  • Lerc 12 hours ago

    Connections is infuriating.

    Not only are they using regional specific knowledge, but they use regional relative concepts.

    Many people do not agree that ant rhymes with aunt.

    The recent Homophones of words meaning brutal.

    Gorey, Grimm, Grizzly, Scarry.

    I am guessin that Grimm is a eponym which makes it nebulous at best, eponyms take a lot of use to be regarded in objective terms rather than as invoking an arbartrary property of the name holder. Kafkaesque rises to that use. I don't think Grimm does.

    I have no idea if Scarry is supposed to be a homonym for scary. Which it neither sounds like nor means brutal.

    Perhaps there is another word that means brutal that sounds like however the person who makes connections thinks Scarry is pronounced.

    In which case it would be a homonym of a synonym of brutal.

    I also do not live in the same country as only connect, yet do not have such issues with their walls.

    The real problem is that while you might be wrong about an answer, once you lose faith that the puzzle setter is right, you can never be sure if your guess is wrong or they are wrong. It is no longer a puzzle and you are playing 'what have I got in my pocket?'.

    • wedog6 8 hours ago

      'Grimm' is a homophone of 'grim', 'Grizzly' is a homophobe of 'grisly', 'Scarry' is a homophone in US English of 'scary', 'Gorey' is a homophone of 'gory'.

      'Gory', 'grisly', 'grim' and 'scary' do all roughly mean brutal.

      'Grimm' as the name of the brothers is a red herring connection, with Gorey and Scarry also names of children's authors.

      • Lerc 8 hours ago

        Gory, grisly and grim can be seen as synonymous on a axis maybe close to brutal. They refer to the appearance. brutal evokes the action that happened. The other words are about how things ended up.

        An autopsy can be gory, grisly and depending on circumstances, grim. It is not brutal.

        Scary is about a state of mind.

        so you have appearance, appearance, appearance, and state-of-mind being considered similar to an action descriptor.

    • jeffgreco 10 hours ago

      Isn't the point of homophones that they sound like the equivalent word, thus gory, grim, grisly, scary?

      • extra88 9 hours ago

        I think the confusion is about what "Gorey, Grimm, Scarry" mean. They, along with "Silverstein" in that game, are last names of children's authors.

        • Lerc 7 hours ago

          And that would be OK as a clue if Silverstein was a red herring, Grizzly was also a children's author and Scarry sounded like scary (and also meant something in the same ballpark as Gory, Grim, and Grisly)

          • quuxplusone 4 hours ago

            Richard Scarry's surname is indeed pronounced "scary," rather than (as I assumed for many years) "scarr-ry."

            That is, it rhymes with Harry, Larry, carry, parry, tarry, and marry, rather than... uh, starry, I guess?

            • quesera 3 hours ago

              Where I come from, Scarry rhymes with Harry, but Harry does not rhyme with scary.

                Harry does not rhyme with hairy
                Scarry does not rhyme with scary
                Marry does not rhyme with Mary. Nor with merry!
              
              You can probably triangulate my childhood home with that information. :)
fortran77 4 hours ago

I start with the same word every day. I hate to change it, because I want the joy of getting the wordle in 1 someday.

croisillon 11 hours ago

is "valew" related to the Brazilian "valeu", expressing gratitude/satisfaction?

  • thaumasiotes 8 hours ago

    Depends on your point of view.

    The most direct thing we can say is "no, because there is no such word as valew". It's not in Merriam-Webster, it's not in Samuel Johnson's 18th-century dictionary, it's not in the Collins dictionary (for British English).

    It is in the Oxford English Dictionary, where it is noted as a "[spelling] variant of value" from the 14th century. It has never been a word with any other meaning than that of value, and using it now would be a pure error if someone used it, which obviously nobody will ever do. Accepting it in Wordle makes as much sense as accepting vvest on the theory that that was an acceptable spelling of west in the past.

    There is an etymological connection between Portuguese valeu and English value, in that they both descend from Latin valeo, but value has no sense of gratitude or satisfaction. (I'm guessing the blog author was misled by https://en.wiktionary.org/wiki/valew#Portuguese , which says that valew is Portuguese internet slang for valeu.)

Aardwolf 13 hours ago

Every now and then I play quordle, octordle, and once a thousand-word variation (which breaks down gameplaywise to just getting every letter at every spot).

A bit of reuse of the same word in the one-word version can't hurt I think

arcfour 11 hours ago

It doesn't beg the question, it raises it. Begging the question is a type of logical fallacy in which you assume the truth of your conclusion. It doesn't mean something "begs for the question to be asked."

I have no idea why this incorrect use of the term drives me so nuts; however, you'd think a blog post about English words and Wordle wouldn't make this mistake.

  • slibhb 11 hours ago

    I agree with you. But it's clear that "begging the question" is going the way of "literally," and there's (sadly) nothing we can do about it.

    I suppose some time in the future, someone will invent a new phrase meaning "assuming your conclusion".

    • viccis 3 hours ago

      At what point did dictionaries providing descriptive views of the English languages turn into a prescriptive one that emboldens people to just point to repeated wrong usage rather than admit they were wrong?

    • fsckboy 7 hours ago

      assuming your conclusion, why would we need a new phrase?

    • arcfour 11 hours ago

      Well, I for one won't be party to it. I think informing everyone I can is my drop in the bucket in the fight against the incorrect usage of words. :-)

      • mauvehaus 8 hours ago

        When you win that battle, would you please fight iOS predictive text vs proper apostrophe use next?

  • Ericson2314 11 hours ago

    I think the idea was NYT was trying to imply they were running out.

    To me, "begging the question" doesn't mean assuming the conclusion in particular, it just means that some of the premises used are less obvious than they are being passed off as. Assuming the conclusion is merely an especially egregious form of that.

    • arcfour 11 hours ago

      I was objecting to the incorrect use of the phrase at the end of the article.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection