Sergey M․
41d06b0424
[extractor/common] Improve _request_webpage
...
* Do not ignore data, headers and query for Requests
* Default values for headers and query switched to dicts since these are used by urllib itself
9 years ago
Sergey M․
b22ca76204
[extractor/common] Filter out unsupported encrypted media for f4m formats ( Closes #8573 )
9 years ago
Sergey M․
19dbaeece3
Remove _sort_formats from _extract_*_formats methods
...
Now _sort_formats should be called explicitly.
_sort_formats has been added to all the necessary places in code.
Closes #8051
9 years ago
Sergey M․
15707c7e02
[compat] Add compat_urllib_parse_urlencode and eliminate encode_dict
...
encode_dict functionality has been improved and moved directly into compat_urllib_parse_urlencode
All occurrences of compat_urllib_parse.urlencode throughout the codebase have been replaced by compat_urllib_parse_urlencode
Closes #8974
9 years ago
remitamine
49dea4913b
Merge pull request #8513 from remitamine/dash-sort
...
[extractor/common] fix dash formats sorting
9 years ago
Sergey M․
0fdbb3322b
[extractor/common] Add _parse_f4m_formats routine
9 years ago
remitamine
09f572fbc0
[extractor/common] add transform_source to _download_smil and _extract_smil_formats
9 years ago
remitamine
15bf934de5
Merge pull request #8819 from remitamine/simple-webpage-requests
...
[extractor/common] simplify using data, headers and query params with _download_* methods
9 years ago
remitamine
cdfee16818
[extractor/common] add data, headers and query params to _request_webpage
9 years ago
Yen Chi Hsuan
a6c8b75904
[common] Use mimeType to determine file extensions ( #8766 )
9 years ago
Yen Chi Hsuan
64f08d4ff2
Merge pull request #8766 from yan12125/dash-detect-ext
...
Detect file extensions of DASH formats from their codecs
9 years ago
Yen Chi Hsuan
af7d5a63b2
[common] Document protocol http_dash_segments
9 years ago
Yen Chi Hsuan
2def60c5f3
[common] Use codec2ext for DASH formats ( #8764 )
9 years ago
Yen Chi Hsuan
e9c0cdd389
[jython] Introduce compat_os_name
...
os.name is always 'java' on Jython
9 years ago
Sergey M․
7bcd2830dd
[extractor/common] Document uploader_url
9 years ago
Sergey M․
2bc0c46f98
[extractor/common] Document license metafield
9 years ago
Sergey M․
d77ab8e255
Add --mark-watched feature ( Closes #5054 )
9 years ago
Sergey M․
9cdffeeb3f
[extractor/common] Clarify rationale on media playlist detection
9 years ago
Sergey M․
fbb6edd298
[extractor/common] Properly extract audio only formats in master m3u8 playlists
9 years ago
Sergey M․
f5bdb44443
[extractor/common] Add _remove_duplicate_formats
9 years ago
remitamine
cafcf657a4
add more subtitles mime types to mimetype2ext and fix the platform subtitle extraction
9 years ago
Sergey M․
611c1dd96e
[refactor] Single quotes consistency
9 years ago
Sergey M․
d800609c62
[refactor] Do not specify redundant None as second argument in dict.get()
9 years ago
Sergey M․
bb20526b64
[extractor/common] Improve base url construction
9 years ago
remitamine
c349456ef6
[extractor/common] strip http urls in smil manifest
9 years ago
remitamine
81e1c4e2fc
[extractor/common] remove duplicate rtmp formats in smil manifest
9 years ago
remitamine
dd86780596
[extractor/common] fix dash formats sorting
9 years ago
remitamine
154c209e2d
[extractor/common] improve dash format ids
9 years ago
remitamine
51e9094f4a
[extractor/common] extract youtube dash formats filesize( fixes #8480 )
9 years ago
remitamine
d413095f7e
[extractor/common] remove duplicated formats and subtiles in smil manifests
9 years ago
remitamine
6a3828fddd
[common] use float conversion instead of using division from __future__
9 years ago
remitamine
91cb6b5065
rename _parse_mpd to _parse_mpd_formats and add default value for mpd namespace
9 years ago
remitamine
0826a0b555
[common] sort dash formats
9 years ago
remitamine
255732f0d3
[common] fix segment duration calculation
9 years ago
remitamine
53c269c6fd
[common] fix media_template string formating
9 years ago
remitamine
675d001633
[common] skip drm protected dash formats
9 years ago
remitamine
d577c79632
[common] ignore ISO 639-2 generic codes
9 years ago
remitamine
f14be22816
[common] remove duplicate reference to namespace
9 years ago
remitamine
9c74423510
[common] fix media template regex
9 years ago
remitamine
1bac34556f
[common] add a generic support for mpd manifests
9 years ago
Yen Chi Hsuan
2d2fa82d17
[common] Add _extract_dash_manifest_formats
9 years ago
Yen Chi Hsuan
c94678957f
[common] Remove unused arguments
9 years ago
Yen Chi Hsuan
16f38a699f
[common] Rename to namespace
...
For consistency with _parse_smil_*
9 years ago
Yen Chi Hsuan
df374b5222
[common] Prefer the manifest than formats_dict in determining codecs
9 years ago
Yen Chi Hsuan
5ea1eb78f5
[common] Fix for youtube
9 years ago
Yen Chi Hsuan
b323e1707d
[common] Modify _parse_dash_manifest for use in Facebook
9 years ago
Yen Chi Hsuan
17b598d30c
[common] _parse_dash_manifest() from youtube.py
9 years ago
Sergey M․
350cf045d8
[extractor/common] Restrict checks when auto calculating tbr
9 years ago
remitamine
a9d5f12fec
Merge pull request #8328 from remitamine/hls-master-detect
...
[extractor/common] detect media playlist in _extract_m3u8_formats
9 years ago
remitamine
7f32e5dc35
[extractor/common] detect media playlist in _extract_m3u8_formats
9 years ago
Sergey M․
b0d21deda9
[extractor/common] Auto calculate tbr when missing
9 years ago
Yen Chi Hsuan
77f785076f
[common] Keep full codec name from m3u8 manifests
...
See #8293 . This is for consistency between YouTube and HLS formats.
9 years ago
Yen Chi Hsuan
0b26ba3fc8
[extractor/common] Allow passing more parameters to _search_json_ld
9 years ago
Sergey M․
4ca2a3cf3c
[extractor/common] Add initial support for JSON-LD metadata extraction into info_dict
9 years ago
Jakub Wilk
dfb1b1468c
Fix typos
...
Closes #8200 .
9 years ago
Sergey M
3f3343cd3e
Merge pull request #8061 from dstftw/introduce-chapter-and-series-fields
...
Introduce chapter and series fields
9 years ago
Sergey M․
27bfd4e526
[extractor/common] Introduce number fields for chapters and series
9 years ago
Philipp Hagemeister
32f9036447
[ccc] Add language information to formats
9 years ago
Sergey M․
7109903e61
[extractor/common] Document chapter and series fields
9 years ago
Sergey M․
7e5edcfd33
Simplify formats accumulation for f4m/m3u8/smil formats
...
Now all _extract_*_formats routines return a list
9 years ago
remitamine
39d60b715a
Merge pull request #7769 from remitamine/sort
...
[common] lower (m3u8,rtmp,rtsp) format preference only if required program is not available
9 years ago
remitamine
d497a201ca
[common] use specific variable for protocol preference in _sort_formats
9 years ago
remitamine
8d29e47f54
[common] simplify the use of _extract_m3u8_formats and _extract_f4m_formats
9 years ago
Sergey M․
9b9c5355e4
Rename error_to_str to error_to_compat_str
9 years ago
Sergey M․
7f8b271465
Properly convert errors to strings
9 years ago
Sergey M․
dd85e4d707
[extractor/common] Properly decode error string on python 2 ( Closes #1354 , closes #3957 , closes #4037 , closes #6449 )
9 years ago
Sergey M․
62d231c004
[extractor/common] Clarify duration can be float
9 years ago
Sergey M?
5c2266df4b
Switch codebase to use sanitized_Request instead of
...
compat_urllib_request.Request
[downloader/dash] Use sanitized_Request
[downloader/http] Use sanitized_Request
[atresplayer] Use sanitized_Request
[bambuser] Use sanitized_Request
[bliptv] Use sanitized_Request
[brightcove] Use sanitized_Request
[cbs] Use sanitized_Request
[ceskatelevize] Use sanitized_Request
[collegerama] Use sanitized_Request
[extractor/common] Use sanitized_Request
[crunchyroll] Use sanitized_Request
[dailymotion] Use sanitized_Request
[dcn] Use sanitized_Request
[dramafever] Use sanitized_Request
[dumpert] Use sanitized_Request
[eitb] Use sanitized_Request
[escapist] Use sanitized_Request
[everyonesmixtape] Use sanitized_Request
[extremetube] Use sanitized_Request
[facebook] Use sanitized_Request
[fc2] Use sanitized_Request
[flickr] Use sanitized_Request
[4tube] Use sanitized_Request
[gdcvault] Use sanitized_Request
[extractor/generic] Use sanitized_Request
[hearthisat] Use sanitized_Request
[hotnewhiphop] Use sanitized_Request
[hypem] Use sanitized_Request
[iprima] Use sanitized_Request
[ivi] Use sanitized_Request
[keezmovies] Use sanitized_Request
[letv] Use sanitized_Request
[lynda] Use sanitized_Request
[metacafe] Use sanitized_Request
[minhateca] Use sanitized_Request
[miomio] Use sanitized_Request
[meovideo] Use sanitized_Request
[mofosex] Use sanitized_Request
[moniker] Use sanitized_Request
[mooshare] Use sanitized_Request
[movieclips] Use sanitized_Request
[mtv] Use sanitized_Request
[myvideo] Use sanitized_Request
[neteasemusic] Use sanitized_Request
[nfb] Use sanitized_Request
[niconico] Use sanitized_Request
[noco] Use sanitized_Request
[nosvideo] Use sanitized_Request
[novamov] Use sanitized_Request
[nowness] Use sanitized_Request
[nuvid] Use sanitized_Request
[played] Use sanitized_Request
[pluralsight] Use sanitized_Request
[pornhub] Use sanitized_Request
[pornotube] Use sanitized_Request
[primesharetv] Use sanitized_Request
[promptfile] Use sanitized_Request
[qqmusic] Use sanitized_Request
[rtve] Use sanitized_Request
[safari] Use sanitized_Request
[sandia] Use sanitized_Request
[shared] Use sanitized_Request
[sharesix] Use sanitized_Request
[sina] Use sanitized_Request
[smotri] Use sanitized_Request
[sohu] Use sanitized_Request
[spankwire] Use sanitized_Request
[sportdeutschland] Use sanitized_Request
[streamcloud] Use sanitized_Request
[streamcz] Use sanitized_Request
[tapely] Use sanitized_Request
[tube8] Use sanitized_Request
[tubitv] Use sanitized_Request
[twitch] Use sanitized_Request
[twitter] Use sanitized_Request
[udemy] Use sanitized_Request
[vbox7] Use sanitized_Request
[veoh] Use sanitized_Request
[vessel] Use sanitized_Request
[vevo] Use sanitized_Request
[viddler] Use sanitized_Request
[videomega] Use sanitized_Request
[viewvster] Use sanitized_Request
[viki] Use sanitized_Request
[vk] Use sanitized_Request
[vodlocker] Use sanitized_Request
[voicerepublic] Use sanitized_Request
[wistia] Use sanitized_Request
[xfileshare] Use sanitized_Request
[xtube] Use sanitized_Request
[xvideos] Use sanitized_Request
[yandexmusic] Use sanitized_Request
[youku] Use sanitized_Request
[youporn] Use sanitized_Request
[youtube] Use sanitized_Request
[patreon] Use sanitized_Request
[extractor/common] Remove unused import
[nfb] PEP 8
9 years ago
Sergey M․
019839faaa
[extractor/common] Use baseURL from f4m manifest for recursive manifest extraction
9 years ago
Sergey M
30eecc6a04
Merge pull request #7296 from jaimeMF/xml_attrib_unicode
...
Use a wrapper around xml.etree.ElementTree.fromstring in python 2.x (…
9 years ago
Sergey M․
dbd82a1d4f
[extractor/common] Fix m3u8 extraction on failure
9 years ago
Sergey M․
dc519b5421
[extractor/common] Make ie_key and IE_NAME return unicode string
9 years ago
Jaime Marquínez Ferrándiz
36e6f62cd0
Use a wrapper around xml.etree.ElementTree.fromstring in python 2.x ( #7178 )
...
Attributes aren't unicode objects, so they couldn't be directly used in info_dict fields (for example '--write-description' doesn't work with bytes).
9 years ago
remitamine
3711304510
[extractor/common] get the redirected m3u8_url in _extract_m3u8_formats
9 years ago
Jaime Marquínez Ferrándiz
865d1fbafc
[extractor/common] Remove unused import
9 years ago
Sergey M․
943a1e24b8
[extractor/common] Use more generic URLError in _is_valid_url
9 years ago
Sergey M․
02835c6bf4
[extractor/common] Document repost_count
9 years ago
Sergey M․
448ef1f31c
[extractor/common] Allow angle brackets in attributes in _og_regexes ( #7215 )
9 years ago
Sergey M․
7a6d76a64d
[extractor/common] Require closing quote in _og_regexes ( Closes #7174 )
...
E.g. do not match `property='og:video:type'` when `og:video` is requested.
9 years ago
Sergey M․
4180a3d8b7
[extractor/common] Allow quoteless content attribute in og regexes ( Closes #7115 )
9 years ago
Yen Chi Hsuan
57935b2564
[extractor/common] Allow HTML5 unquoted attribute values
...
Fixes #7108
HTML5 allows unquoted attribute values. See the "Unquoted attribute value
syntax" section [1] for more information
[1] http://www.w3.org/TR/html5/syntax.html
9 years ago
Sergey M․
4bba371644
[YoutubeDL] Autocalculate ext for subtitles when missing
9 years ago
Sergey M․
e5851b963a
[extractor/common] Make f4m extraction for SMIL non fatal
9 years ago
Sergey M․
4de6131090
[extractor/common] Add fatal to _extract_f4m_formats
9 years ago
Sergey M․
3a1341a7bc
[extractor/common] Make m3u8 extraction for SMIL non fatal
9 years ago
Sergey M․
c78e48177c
[extractor/common] Check validity of direct URLs
9 years ago
Sergey M․
647eab4541
[extractor/common] Extract upload date from SMIL
9 years ago
Sergey M․
1e5bcdec02
[extractor/common] Extract images from SMIL
9 years ago
Sergey M․
e7d8e98a9f
[extractor/common] Allow float bitrates
9 years ago
Sergey M․
8aab976bbd
[extractor/common] Document release_date field
9 years ago
Sergey M․
c430802e32
[extractor/common] Add raise_geo_restricted
9 years ago
Sergey M․
586f1cc532
[extractor/common] Skip html comment tags ( Closes #6822 )
9 years ago
Sergey M․
73eb13dfc7
[extractor/common] Case insensitive inputs extraction
9 years ago
Sergey M․
be0e5dbd83
[extractor/common] Extract submit inputs
9 years ago
Sergey M․
43e7d3c945
[extractor/common] Add raise_login_required
9 years ago
Jaime Marquínez Ferrándiz
8c97f81943
[common] Follow convention of using 'cls' in classmethods
9 years ago
Yen Chi Hsuan
f738dd7b7c
[common] Remove debugging codes
9 years ago
Yen Chi Hsuan
912e0b7e46
[common] Add _merge_subtitles()
9 years ago
Yen Chi Hsuan
03bc7237ad
[common] _parse_smil_subtitles: accept `lang` as the subtitle language
9 years ago
Sergey M․
5cdefc4625
[extractor/common] Add more subtitle mime types for guess when ext is missing
9 years ago
Sergey M․
ce00af8767
[extractor/common] Add default subtitles lang
9 years ago
Yen Chi Hsuan
f877c6ae5a
[theplatform] Use InfoExtractor._parse_smil_formats()
9 years ago
Sergey M․
e64b756943
[extractor/common] Interactive TFA code input
9 years ago
Sergey M․
201ea3ee8e
[extractor/common] Improve _hidden_inputs
9 years ago
Sergey M․
8b9848ac56
[extractor/common] Expand meta regex
9 years ago
Sergey M․
942acef594
[extractor/common] Extract _parse_xspf
9 years ago
Sergey M․
98044462b1
[extractor/common] Use playlist id as default title
9 years ago
Sergey M․
e0b9d78fab
[extractor/common] Clarify playlists can have description field
9 years ago
Sergey M․
8d6765cf48
[extractor/generic] Add generic support for xspf playist extraction
9 years ago
Sergey M.
d5d7bdaeb5
Merge pull request #6428 from dstftw/improve-generic-smil-support
...
Improve generic SMIL support
9 years ago
Sergey M․
5b0c40da24
[extractor/common] Expand meta regex
9 years ago
Sergey M․
17712eeb19
[extractor/common] Extract namespace parse routine
9 years ago
Sergey M․
41c3a5a7be
[extractor/common] Fix python 3
9 years ago
Sergey M․
a107193e4b
[extractor/common] Extract f4m and m3u8 formats, subtitles and info
9 years ago
remitamine
799207e838
[viewster] extract the api auth token
...
Closes #6406 .
9 years ago
Sergey M․
864f24bd2c
[extractor/common] Add _meta_regex and clarify tags field
9 years ago
Purdea Andrei
5316bf7487
Documented tags as a possible dict key
9 years ago
Sergey M․
10952eb2cf
[extractor/common] Consistent URL spelling
9 years ago
Jaime Marquínez Ferrándiz
297a564bee
[youtube] Extract end_time
9 years ago
Jaime Marquínez Ferrándiz
7c80519cbf
[youtube] Extract start_time
...
From the 't=*' in the url.
Currently youtube-dl doesn't use the value, but it was requested for the mpv plugin.
9 years ago
Sergey M․
74fe23ec35
[extractor/common] Style
10 years ago
Yen Chi Hsuan
a38436e889
[extractor/common] Add 'transform_source' parameter to _extract_f4m_formats()
10 years ago
Sergey M․
31c746e5dc
[extractor/common] Keep going in some media_url is missing
10 years ago
Sergey M․
70f0f5a8ca
[extractor/common] Recursively extract child f4m manifests
10 years ago
Sergey M․
cc357c4db8
[extractor/common] Properly handle full URLs
10 years ago
Sergey M․
97f4aecfc1
[extractor/common] Handle malformed f4m manifests
10 years ago
Sergey M․
cf61d96df0
[extractor/common] Add _form_hidden_inputs
10 years ago
Sergey M․
f8da79f828
[extractor/common] Improve _form_hidden_inputs and rename to _hidden_inputs
10 years ago
Sergey M․
27713812a0
[extractor/common] Add method for extracting form hidden input fields as dict
10 years ago
Yen Chi Hsuan
13af92fdc4
[common] Add 'fatal' to _extract_m3u8_formats
10 years ago
Sergey M․
5414623791
[extractor/common] Remove superfluous line
10 years ago
Sergey M․
c342041fba
[extractor/common] Use NO_DEFAULT from utils
10 years ago
Yen Chi Hsuan
621ed9f5f4
[common] Add note and errnote field for _extract_m3u8_formats
10 years ago
Sergey M․
baa43cbaf0
[extractor/common] Relax valid url check verbosity
10 years ago
Yen Chi Hsuan
c1c924abfe
[utils,common] Merge format_srt_time and _subtitles_timecode
...
format_srt_time uses a comma as the delimiter between seconds and
milliseconds while _subtitles_timecode uses a dot. All .srt examples I
found on the Internet uses a comma, so I use a comma in the merged
version. See http://matroska.org/technical/specs/subtitles/srt.html and
http://devel.aegisub.org/wiki/SubtitleFormats/SRT
10 years ago
Yen Chi Hsuan
05d5392cda
[common] Ignore subtitles in m3u8
10 years ago
Sergey M․
74f728249f
[extractor/common] Fallback to empty string for (yet) missing `format_id` in `_sort_formats` ( Closes #5624 )
10 years ago
Jaime Marquínez Ferrándiz
2ddcd88129
Remove code that was only used by the Grooveshark extractor
10 years ago
zouhair
cf0649f8b7
Typo: twice "the the" to "the"
10 years ago
Sergey M․
3ded7bac16
[extractor/common] Add ability to specify custom field preference for `_sort_formats`
10 years ago
Jaime Marquínez Ferrándiz
08f2a92c9c
InfoExtractor._search_regex: Suggest updating when the regex is not found (suggested in #5442 )
...
Reuse the same message from ExtractorError
10 years ago
Yen Chi Hsuan
c9a779695d
[extractor/common] Add the encoding parameter
...
The QQMusic info extractor need forced encoding for correct working.
10 years ago
Sergey M․
830d53bfae
[utils] Add `video_title` for `url_result`
10 years ago
Sergey M․
e21a55abcc
[extractor/common] Remove f4m section
...
It's now provided by `f4m_id`
10 years ago
Sergey M․
4a34f69ea6
[extractor/common] Add subtitles timecode formatter
10 years ago
Sergey M․
f207019ce5
[extractor/common] Remove 'm3u8' from quality selection URL
10 years ago
Sergey M․
8dc9d361c2
[extractor/common] Fix format_id when `last_media` is None and always include `m3u8_id` if present
...
The rationale behind `m3u8_id` was to resolve duplicates when processing several m3u8 playlists within the same media that give equal resulting `format_id`'s,
e.g. `youtube-dl http://www.rts.ch/play/tv/passe-moi-les-jumelles/video/la-fee-des-bois-mustang-les-chemins-du-vent?id=3854925 -F`
10 years ago
Philipp Hagemeister
a0bb7c5593
[extractor/common] Improve m3u format IDs ( #5143 )
10 years ago
Sergey M․
2f0f6578c3
[extractor/common] Assume non HTTP(S) URLs valid
10 years ago
Philipp Hagemeister
72a406e7aa
[extractor/common] Pass in video_id ( #5057 )
10 years ago