BeautifulSoup for Coursera
Siddhant Shrivastava
February 17, 2015
Filed under “Hacker Spirit”
In a significant bandwidth-saving move, the MOOC organization Coursera removed the downloadable links from its course videos page. Here are the implications:
Earlier
Use the DownThemAll Firefox plugin/coursera-dl/wget to get all links to all course videos and start downloading.
Current Scenario
The student has to take a scattershot approach through the UI maze to reach the online video player. There is no single page with all downloadable links.
How does this affect the MOOC user?
- Courses will be chosen wisely
- Watch only what you need.
- Salvaged Bandwidth - content loads faster
Workaround
I am trying to learn and use the Python-based parsing library BeautifulSoup to parse the html pages and extract the video links at one place so that the tools like DownThemAll become instrumental to this end again.
It’s only a matter of time that this impending inconvenience be resolved - await the nifty hacks!
Till then, Ta!