Settings

Theme

Show HN: BBC Recipe search engine. Open source, super fast

auntiesrecipes.co.uk

12 points by user24 10 years ago · 7 comments

Reader

sheraz 10 years ago

In other recipe news, some devs @nytimes recently released some __VERY__ interesting code that handles unstructured data from the NYTIMES recipe archives:

CRF Ingredient Phrase Tagger https://github.com/NYTimes/ingredient-phrase-tagger

I used to work with a lot of recipe data in multiple languages, so this topic remains close to my heart.

This could be an interesting project to add to the trained data.

  • user24OP 10 years ago

    Very cool. I was thinking of expanding mine to work with other recipe sites, so this may come in handy, thanks!

user24OP 10 years ago

On Tuesday the Guardian reported that the BBC (fondly aka Auntie) would be archiving their recipes, so I quickly scraped the site and wrote a search engine for it.

Not sure how the site will evolve, if at all, but it was a fun side project!

Code is here if you want to play: https://github.com/user24/auntiesrecipes

  • rajington 10 years ago

    I thought archiving meant it would be no longer on their website, it would be awesome if they just released it all under something like the GPL. Could do some fun machine learning stuff with it...

    • user24OP 10 years ago

      I'd love them to release it as open data so that I'm not in murky waters.

      If they move the recipes I can update the links. If they take the recipes down (which they've said they won't, now) I have got all the data so could rehost them.

      edit: but, if you want to do some fun ML work, my scraper should help get you started!

  • spacecowboy_lon 10 years ago

    Cool - you might want to look at using levenshtein distance algo to improve the search.

    • user24OP 10 years ago

      nice idea. Eventually, if I add more recipes, I'll need to move the search serverside too.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection