Ask HN: CLI tool to download webpages and convert to Markdown?
Is there a CLI tool that downloads webpages and converts them to Markdown?
I've just started using Obsidian https://obsidian.md/ and would like a way to save interesting blog posts and articles. I'd like to add that Calibre (ebook-convert on the CLI) has a mode for outputting a .txt file with markdown formatting. It can take HTML as input. I tried that on a HackerNews page and the content of the output file foo.md was in html. Sorry, had only tested it on my own site, which worked as expected: I find Joplin https://joplinapp.org does a good job of producing markdown from web pages and already has sync capability built in. Looks like it would be ideal for working with Obsidian Exported my Joplin markdown and opened it up in Obsidian. Works like a dream and the best part is Jopli already has it's own web clipper. Seems like a superb match I made this a couple weeks ago: https://www.npmjs.com/package/@dougskinner/markdowner Thank you, Anand. Despite being 8 years old, Aaron's html2text.py worked perfectly to convert the HN homepage to Markdown. His memory (and code) continues to be a blessing!
From Converting HTML to Markdown using Pandoc http://www.cantoni.org/2019/01/27/converting-html-markdown-u... curl --silent https://example.com/foo.html | pandoc --from html --to markdown_strict -o foo.md
curl --silent https://tinyapps.org/ | pandoc --from html --to markdown_strict -o index.md