Commit Graph

887 Commits (0b63aed8dfd36a5a7f5ae6518b9c385d9a43e735)

Author SHA1 Message Date
Jaime Marquínez Ferrándiz 78fb87b283 Don't accept '>' inside the content attribute in OpenGraph regexes 11 years ago
Jaime Marquínez Ferrándiz ab2d524780 Improve the OpenGraph regex
* Do not accept '>' between the property and content attributes.
* Recognize the properties if the content attribute is before the property attribute using two regexes (fixes the extraction of the description for SlideshareIE).
11 years ago
Jaime Marquínez Ferrándiz 85d61685f1 [tvp] Update the title and the description of the test video 11 years ago
Jaime Marquínez Ferrándiz b9643eed7c [youtube:channel] Fix the extraction of autogenerated channels
The ajax pages are empty, now it looks directly in the channel's /videos page
11 years ago
Philipp Hagemeister 0e145dd541 Merge branch 'master' of github.com:rg3/youtube-dl 11 years ago
Philipp Hagemeister 9f9be844fc [youtube] Fix protocol-independent URLs (Fixes #1768) 11 years ago
Jaime Marquínez Ferrándiz e3b9ab5e18 [soundlcoud] Set the correct extension for the tracks (fixes #1766)
Some tracks are not in mp3 format, they can be wav files.
11 years ago
Jaime Marquínez Ferrándiz c66d2baa9c [livestream] Add an extractor for the original version of livestream (closes #1764)
The two versions use different systems.
11 years ago
Jaime Marquínez Ferrándiz ca715127a2 Don't assume the 'subtitlesformat' is set in the params dict (fixes #1750) 11 years ago
Jaime Marquínez Ferrándiz ea7a7af1d4 [gamekings] Fix the test video checksum 11 years ago
Philipp Hagemeister c3a3028f9f [tvp] Minor improvements (#1730) 11 years ago
Philipp Hagemeister 6c5ad80cdc Merge remote-tracking branch 'saper/tvp' 11 years ago
Philipp Hagemeister 384b98cd8f [gamekings] Minor fixes (#1759) 11 years ago
Jelle van der Waa eb9b5bffef Add extractor for gamekings.tv 11 years ago
migbac 0bd59f3723 Add support for d8.tv 11 years ago
Jaime Marquínez Ferrándiz 8b8cbd8f6d [vine] Fix uploader extraction 11 years ago
Philipp Hagemeister eb0a839866 [common] Simplify og_search_property 11 years ago
Jaime Marquínez Ferrándiz 0ed05a1d2d Use the 'rtmp_live' field for the live parameter of rtmpdump 11 years ago
Jaime Marquínez Ferrándiz 1008bebade Merge remote-tracking branch 'rzhxeo/rtmpdump_live' 11 years ago
Jaime Marquínez Ferrándiz be6dfd1b49 [ted] Return a single info_dict for talks urls
It failed with the --list-subs option
11 years ago
Jaime Marquínez Ferrándiz 231516b6c9 Merge pull request #1705 from iemejia/master
[ted] support for subtitles
11 years ago
Jaime Marquínez Ferrándiz fb53d58dcf Merge pull request #1726 from saper/escaped
Fix AssertionError when og property not found
11 years ago
Jaime Marquínez Ferrándiz f470c6c812 [arte] Improve the format sorting
Also use the bitrate.
Prefer normal version and sourds/mal version over original version with subtitles.
11 years ago
Jaime Marquínez Ferrándiz 566d4e0425 [arte] Make sure the format_id is unique (closes #1739)
Include the bitrate and use the height instead of the quality field.
11 years ago
Jaime Marquínez Ferrándiz 81be02d2f9 [cnn] Accept www.cnn.com urls (fixes #1740) 11 years ago
Jaime Marquínez Ferrándiz c2b6a482d5 [brightcove] the format function requires to specify the index in python2.6 11 years ago
Jaime Marquínez Ferrándiz 12c167c881 [soundcloud] Allow to download tracks marked as not 'streamable'
They use the rtmp protocol but if the are marked as 'downloadable' it can use the direct download link.
11 years ago
Jaime Marquínez Ferrándiz 20aafee7fa [kankan] Fix the video url
It now requires two additional parameters, one is a timestamp we get from the getCdnresource_flv page and the other is a key we have to build.
11 years ago
Jaime Marquínez Ferrándiz dd5bcdc4c9 [brightcove] Set the 'Referer' header if the url has the 'linkBaseUrl' parameter (fixes #1553) 11 years ago
Jaime Marquínez Ferrándiz b1a80ec1a9 [xnxx] Accept urls that start with 'www' (fixes #1734) 11 years ago
Jaime Marquínez Ferrándiz 51040b72ed [brightcove] Support redirected urls from bcove.me (fixes #1732)
'bctid' needs to be changed to '@videoPlayer', and 'bckey' to 'playerKey'.
11 years ago
Jaime Marquínez Ferrándiz 4f045eef8f [youtube:channel] Fix the extraction
The page don't include the 'load more' button anymore, now we directly get the 'c4_browse_ajax' pages.
11 years ago
Jaime Marquínez Ferrándiz 5d7b253ea0 Add an extractor for eitb.tv (fixes #1608)
The BrighcoveExperience object doesn't contain the video id, the extractor adds it and passes the url to BrightcoveIE.
11 years ago
Jaime Marquínez Ferrándiz b0759f0c19 [brightcove] Extract all the available formats 11 years ago
Jaime Marquínez Ferrándiz 065472936a Add an extractor for space.com (fixes #1718)
It uses Brightcove, but requires some special process for getting a url with the playerKey field in some videos
11 years ago
Jaime Marquínez Ferrándiz fc4a0c2aec [brightcove] Change the 'videoId' or 'videoID' field to '@videoPlayer' (fixes #1697)
It seems to be needed when using the htmlFederated page
11 years ago
Jaime Marquínez Ferrándiz eeb165e674 [brightcove] Add the extraction of the url from generic 11 years ago
Jaime Marquínez Ferrándiz 9ee2b5f6f2 tests: don't run the test if any of the extractors listed in the 'add_ie' field is marked as not working 11 years ago
Marcin Cieślak 5137ebac0b [tvp] Telewizja Polska: new extractor for tvp.pl, fixes #1719
Thanks-To: mplonski

https://github.com/mplonski/linux/blob/master/tvp-dl.py
11 years ago
Marcin Cieślak a8eeb0597b Fix AssertionError when og property not found
On tvp.pl some webpages contain OpenGraph
metadata and some don't.

If og property is not found, _og_search_description
fails with

WARNING: unable to extract OpenGraph description; please report this issue on http://yt-dl.org/bug
Traceback (most recent call last):
  File "/usr/home/saper/bin/youtube-dl", line 18, in <module>
    youtube_dl.main()
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/__init__.py", line 766, in main
    _real_main(argv)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/__init__.py", line 719, in _real_main
    retcode = ydl.download(all_urls)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/YoutubeDL.py", line 715, in download
    videos = self.extract_info(url)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/YoutubeDL.py", line 348, in extract_info
    ie_result = ie.extract(url)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/common.py", line 125, in extract
    return self._real_extract(url)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/tvp.py", line 56, in _real_extract
    info['description'] = self._og_search_description(webpage)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/common.py", line 331, in _og_search_description
    return self._og_search_property('description', html, fatal=False, **kargs)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/common.py", line 325, in _og_search_property
    return unescapeHTML(escaped)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/utils.py", line 494, in unescapeHTML
    assert type(s) == type(u'')
AssertionError

The patch allows me to use:

  try:
    info['description'] = self._og_search_description(webpage)
    info['thumbnail'] = self._og_search_thumbnail(webpage)
  except RegexNotFoundError:
    pass
11 years ago
Ismaël Mejía 4ed3e51080 [ted] fixed error in case of no subtitles present
I created a test, but I leave it commented since TED videos get
new subtitles frequently.
11 years ago
rzhxeo 2dcf7d8f99 [GenericIE] Also detect youtube if src url of iframe is embedded in ' instaed of " 11 years ago
Jaime Marquínez Ferrándiz 19b0668251 [canal2c] Accept more urls (fixes #1723)
The url only needs to have the 'idVideo' field in the query, in any position.
We have to set the 'void=oui' in the webpage url, so that we get the file name.
11 years ago
Jaime Marquínez Ferrándiz e7e6b54d8a [teamcoco] Parse the xml file and extract all the formats 11 years ago
Jaime Marquínez Ferrándiz 2a1a8ffe41 Merge pull request #1693 from alexvh/teamcoco_fix
[teamcoco] Fix video url extraction for some videos
11 years ago
Philipp Hagemeister 08fb86c49b [youtube] Add description for YoutubeSearchDateIE (#1710) 11 years ago
Philipp Hagemeister 3633d77c0f Merge remote-tracking branch 'CBGoodBuddy/ytsearchtime' 11 years ago
Philipp Hagemeister 12ebdd1506 [viddler] Support non-digit IDs (Fixes #1714) 11 years ago
Jaime Marquínez Ferrándiz 1baf9a5938 Merge pull request #1698 from rzhxeo/cinemassacre
[CinemassacreIE] Support more embed urls
11 years ago
Jaime Marquínez Ferrándiz a56f9de156 Style fixes for extractors: remove spaces around (,),{ and } 11 years ago
Jaime Marquínez Ferrándiz fa5d47af4b Merge pull request #1679 from rzhxeo/mofosex
Add support for http://www.mofosex.com
11 years ago
Jaime Marquínez Ferrándiz d607038753 Merge pull request #1677 from rzhxeo/xtube
Add support for http://www.xtube.com
11 years ago
Jaime Marquínez Ferrándiz 9ac6a01aaf Merge pull request #1676 from rzhxeo/extremetube
Add support for http://www.extremetube.com
11 years ago
Jaime Marquínez Ferrándiz 9103bbc5cd Add the 'webpage_url' field to info_dict
The url for the video page, it must allow to reproduce the result.
It's automatically set by YoutubeDL if it's missing.
11 years ago
Craig Markwardt a3dd924871 Add YoutubeSearchDateIE extractor to youtube.py & __init__.py, which searches by publication date. 11 years ago
rzhxeo 137bbb3e37 [XTubeIE] Add description to TEST 11 years ago
rzhxeo 86ad94bb2e [ExtremeTubeIE] Set age_limit to 18 and fix uploader extraction 11 years ago
Jaime Marquínez Ferrándiz 3e56add7c9 Merge pull request #1678 from rzhxeo/keezmovies
[KeezMoviesIE] Detect URLs with numbers in the SEO part correct
11 years ago
Jaime Marquínez Ferrándiz f52f01b5d2 [brightcove] Don't set the extension
If the video only has the 'FLVFullLengthURL' key, it can still be an mp4 file.
11 years ago
Jaime Marquínez Ferrándiz 98d7efb537 [exfm] skip tests
The site is down too often.
11 years ago
Jaime Marquínez Ferrándiz cf51923545 [youtube] Remove vevo test
The video is no longer available and it seems that vevo video don't use encrypted signatures anymore.
11 years ago
Ismaël Mejía 38fcd4597a Merge remote-tracking branch 'iemejia/master' 11 years ago
Jaime Marquínez Ferrándiz 165e3bb67a [bambuser] Add an extractor for channels (closes #1702) 11 years ago
Ismaël Mejía 38db46794f Merge branch 'ted_subtitles' 11 years ago
Ismaël Mejía a9a3876d55 [ted] Added support for subtitle download 11 years ago
Ismaël Mejía 1f343eaabb [subtitles] refactor to support websites with subtitle information the
webpage.

I added the parameter webpage, so now it's similar to the way automatic
captions are handled. This is an improvement needed for websites like
TED.
11 years ago
Jaime Marquínez Ferrándiz 72a5b4f702 Add an extractor for bambuser.com (#1702) 11 years ago
rzhxeo 0a43ddf320 [CinemassacreIE] Add live paramter to extracted info as a workaround 11 years ago
Philipp Hagemeister 8eddf3e91d [youtube] Encode subtitle track name in request (Fixes #1700) 11 years ago
Jaime Marquínez Ferrándiz 60d142aa8d Add an extractor for vk.com (closes #1635) 11 years ago
Jaime Marquínez Ferrándiz 66cf3ac342 [metacafe] Fix support for age-restricted videos (fixes #1696)
The 'Content-Type' header must be set for disabling the family filter.
The 'flashversion' cookie  is only needed for AnyClip videos.
Added tests for standard metacafe videos and for age-restricted videos.
Also set the 'age_limit' field.
11 years ago
rzhxeo ab4e151347 [CinemassacreIE] Support more embed urls 11 years ago
Alex Van't Hof ac2547f5ff [teamcoco] Fix video url extraction for some videos
Video url extraction failed for some videos,
e.g. http://teamcoco.com/video/old-time-baseball

The url extracted was also occasionally suboptimal quality,
e.g. http://teamcoco.com/video/louis-ck-interview-george-w-bush
11 years ago
Jaime Marquínez Ferrándiz 5f1ea943ab [livestream] fix the extraction of events
It now uses a json dictionary from the webpage.
11 years ago
Philipp Hagemeister 9f1109a564 [dailymotion] Fix support for age-restricted videos (Fixes #1688) 11 years ago
Philipp Hagemeister 72321ead7b [vevo] Readd support for SMIL (Fixes #1683) 11 years ago
Philipp Hagemeister b5d0d817bc Remove superfluous space 11 years ago
Filippo Valsorda b9a836515f Update the Vimeo test vector md5
confirmed that this is indeed the first 10241 (we went off by one with
byte range 0-10240) of the full, playing mp4, so they probably
reencoded or something
11 years ago
Jaime Marquínez Ferrándiz 21c924f406 [arte] Download the 'Originalversion' version if it's the only one available (fixes #1682) 11 years ago
Philipp Hagemeister e54fd4b23b [vevo] Add more format details 11 years ago
Philipp Hagemeister 912cbf5d4e [vevo] Fix timestamp handling
( / 1000 is implicit float division )
11 years ago
Philipp Hagemeister f7ff55aa78 Merge remote-tracking branch 'origin/master' 11 years ago
Philipp Hagemeister 795f28f871 [youtube] Fix login (Fixes #1681) 11 years ago
Jaime Marquínez Ferrándiz 321a01f971 [mtv] Remove the templates from the mediagen url 11 years ago
Jaime Marquínez Ferrándiz 2563bcc85c Add an extractor for MySpace (closes #1666) 11 years ago
rzhxeo dcc2a706ef Add support for http://www.xtube.com 11 years ago
rzhxeo 2bc67c35ac [KeezMoviesIE] Detect URLs with numbers in the SEO part correct 11 years ago
rzhxeo 77ae65877e Add support for http://www.mofosex.com 11 years ago
rzhxeo 32a35e4418 Add support for http://www.extremetube.com 11 years ago
Philipp Hagemeister 1003d108d5 [vimeo] Support hash in URL (Fixes #1669) 11 years ago
Philipp Hagemeister c1002e96e9 Let extractors omit ext in formats 11 years ago
Philipp Hagemeister 77d0a82fef [addanime] Use new formats system 11 years ago
Philipp Hagemeister ebc14f251c Merge remote-tracking branch 'origin/master' 11 years ago
Philipp Hagemeister d41e6efc85 New debug option --write-pages 11 years ago
Filippo Valsorda 8ffa13e03e [Instagram] get the non-https link, as they are serving Akamai cert from a instagram.com domain 11 years ago
Filippo Valsorda 750e9833b8 Add the missing age_limit tags; added a devscript to do a superficial check for porn sites without the age_limit tag in the test 11 years ago
Filippo Valsorda 82f0ac657c Merge pull request #1657 by @rzhxeo
[YouPornIE] Extract all encrypted links and remove doubles at the end
11 years ago
Filippo Valsorda eb6a2277a2 Merge pull request #1659 by @rzhxeo
Add support for http://www.tube8.com
11 years ago
Filippo Valsorda f8778fb0fa Merge pull request #1663 by @rzhxeo
Add support for http://www.spankwire.com
11 years ago
Filippo Valsorda e2f9de207c Merge pull request #1664 by @rzhxeo
Add support for http://www.keezmovies.com
11 years ago