Settings

Theme

Download Coursera videos in batch

github.com

73 points by jplehmann 14 years ago · 27 comments

Reader

jplehmannOP 14 years ago

http://coursera.org is creating some fantastic, free educational videos (algorithms, machine learning, natural language processing, SaaS).

This script allows one to batch download videos for a Coursera class. Given a class name and related cookie file, it scrapes the course listing page to get the week and class names, and then downloads the related videos into appropriately named files and directories.

Why is this helpful? Before I was using wget, but I had the following problems:

  1. Video names have a number in them, but this does not correspond to the
     actual order.  Manually renaming them is a pain.
  2. Using names from the syllabus page provides more informative names.
  3. Using a wget in a forloop picks up extra videos which are not posted/linked,
     and these are sometimes duplicates.
Naming is intentionally verbose, so that it will display and sort properly using MX Video on my Andriod phone.

Inspired in part by youtube-dl (http://rg3.github.com/youtube-dl) by which I've downloaded many other good videos such those from Khan Academy.

Let me know if you like it.

  • alexsb92 14 years ago

    Awesome! I was actually planning on writing such a script over the weekend. I haven't take a look at this semester's courses, but I know last semester the quizzes and tests were quite useful for someone with no previous practice in the subject at hand. I can see your script doesn't try to get all that right?

    In that case i'll still have a weekend project.

    • jeremyarussell 14 years ago

      In the NLP class there are programming assignments with special formatting, headers, etc. I kind of want to write a script that uses NLP to snag NLP's programming instruction pages (as well as example code, etc.) Seems like that would be fun to do.

      • alexsb92 14 years ago

        But in that case wouldn't you be looking to get the essence, the plain text useful stuff of an HTML document, in which case wouldn't parsing using regular expressionism or something be better than NLP? I haven't really done scraping and parsing of documents/text so I'm not too sure.

        • jeremyarussell 14 years ago

          It's possible yeah, though I like the formatting and highlighting and borders etc, it groups the different sections of the instructions together.

          I see what you mean though, it's not really full NLP either way, I just used that term in place of regular expressions because it was in the NLP class that I learned about them (first homework is a phone and email scraper.) Probably my fault for using semantics wrong.

    • jplehmannOP 14 years ago

      Only support for videos right now.

      • jplehmannOP 14 years ago

        Update -- now downloads all lecture materials on the videos page (pptx, pdf, etc).

nzmsv 14 years ago

Some shameless self-promotion: I wrote a Chrome extension for downloading Udacity videos (http://nzmsv.github.com/udacity-dl/). If there's any interest in a batch version I could look into it. Alternatively, feel free to write it and let me know :)

dmn001 14 years ago

I use the downthemall firefox extension and to keep the videos in order I add a number to the renaming mask:

  *num*_*name*.*ext*
I like how jplehmann's tool can rename them using the titles on the page.
fvieira 14 years ago

You can also check my script over here: https://github.com/fvieira/coursera_resources_downloader It has the advantage of not requiring a cookies file, it can authenticate with your user and password. Otherwise, it does pretty much the same as jplehmann's script, although with some minor changes which you might or might not like.

By the way, congratulations on your script, jplehmann! Wish I had found yours before losing time doing mine...

jeremyarussell 14 years ago

I'm already using it, after sometime I got a connection forcibly closed by remote host error. I can't access the Coursera website either, not sure why though. (mayhaps a bunch of people suddenly using this script crashed their servers? or they blocked us)

It's back up, must have been a small glitch. Might I add that I love the fact the script picked up on the video I dropped earlier.

christiangenco 14 years ago

Nice! I actually found your project last week through google but wrote my own in js (https://gist.github.com/2225519) after struggling with the python dependencies.

I think coursera really needs to come out with a native solution and a standard way of numbering/organizing videos.

tuxguy 14 years ago

I have tried all of the projects mentioned in the comments here.

To me the most simple & quick was this bookmarklet.

https://github.com/christiangenco/Coursera-Video-Downloader-...

radicade 14 years ago

Another version that downloads coursera videos, and also optionally downloads slides and subtitles:

https://github.com/LoganDing/Coursera.org-Downloader

jwr 14 years ago

Thank heavens! Er, I mean thank you (the OP) for this tool. I already wrote scripts that renamed files to something sane, but this will make my life so much easier.

  • jplehmannOP 14 years ago

    That's great -- I am happy if just a couple of other people get use out of it as well.

checkraiser 14 years ago

Do you know which technologies to create the interactive video lectures (subtitles + quizzes ) like those in Coursera courses ?

themonk 14 years ago

Thanks a lot, does it skip downloaded videos in next run?

floggit 14 years ago

Can someone create an iOS app for this please ?

A_A 14 years ago

this looks sweet. Thanks. I had been manually downloading the vids on my PC, but I hope this tool will now reduce the pain.

pimentel 14 years ago

How does this work with in-video quizzes?

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection