Sergey M․
4328ddf82b
[extractor/common] Add support for AMP tags in _parse_html5_media_entries
7 years ago
Sergey M․
c69701c6ab
[extractor/common] Improve _json_ld
7 years ago
Sergey M․
96a2daa1ee
[extractor/common] Improve jwplayer subtitles extraction
8 years ago
Yen Chi Hsuan
6a9cb29509
[extractor/common] Fix json dumping with --geo-bypass
...
The line "[debug] Using fake IP %s (%s) as X-Forwarded-For." was printed
to stdout even with -j/-J, which breaks the resultant JSON.
8 years ago
Sergey M․
0a268c6e11
[extractor/common] Improve jwplayer formats extraction ( closes #13379 )
8 years ago
Sergey M․
1afd0b0da7
[extractor/common] Return unicode string from _match_id
8 years ago
Sergey M․
f2e2f0c777
[extractor/common] Fix rtmp and rtsp formats' URLs in _extract_wowza_formats
8 years ago
Sergey M․
6f76679804
[extractor/common] Add support for schemeless URLs in _extract_wowza_formats ( closes #13088 , closes #13092 )
8 years ago
Sergey M․
76d5a36391
[extractor/common] Respect Width and Height attributes in ISM manifests
8 years ago
Remita Amine
ff6f9a6704
[extractor/common] fix typo in _extract_akamai_formats
8 years ago
Tithen-Firion
c89267d31a
Merge branch 'master' into openload-phantomjs-method
8 years ago
remitamine
55949fede6
[common] introduce chapters field
8 years ago
Sergey M․
33a81c2c6f
[extractor/common] Extract view count from JSON-LD
8 years ago
Sergey M․
c89b49f743
[extractor/common] Add manifest_url for explicit group rendition formats
8 years ago
Sergey M․
ff99fe529e
Don't list master m3u8 playlists in format list ( closes #12832 )
8 years ago
Sergey M․
ac9c69ace7
[extractor/common] Improve jwplayer regex
8 years ago
Tithen-Firion
40e41780f1
[phantomjs] add cookie support
8 years ago
Sergey M․
3019cb0c99
[extractor/common] Rephrase comment
8 years ago
Sergey M․
ddd258f922
[test_InfoExtractor] Add m3u8 parsing test for NAME attribute in EXT-X-STREAM-INF tag
8 years ago
Sergey M․
9c99bef704
[extractor/common] Use float for scaled tbr
8 years ago
Sergey M․
cb2520802d
[extractor/common] Improve m3u8 extraction ( closes #12211 )
...
* Extract m3u8 parsing to separate method
* Improve rendition groups extraction
* Build stream name according stream GROUP-ID
* Ignore reference to AUDIO group without URI when stream has no CODECS
+ Add test coverage for parsing m3u8 from #11507 , #11995 , #12211 and twitch vod
8 years ago
Sergey M․
bae1404893
[extractor/common] Add support for video of WebPage context in _json_ld ( closes #12778 )
8 years ago
Remita Amine
bf1b87cd91
[common] Relax JWPlayer regex and remove duplicate urls( #12768 )
8 years ago
Remita Amine
40fcba5edb
improve coding style
8 years ago
Sergey M․
fd47550885
[extractor/common] Add coding cookie
8 years ago
Sergey M․
4457823dda
[extractor/common] Move censorship checks to a separate method and add check for just another ISP
8 years ago
Random User
4f06c1c9fc
Merge branch 'master' of github.com-rndusr:rg3/youtube-dl into fix/str-item-assignment
8 years ago
Random User
c73e330e7a
_find_jwplayer_data() returns dict or None
...
This simplifies code for callers of `_find_jwplayer_data()` which no longer have
to run `_parse_json()` on the return value.
It also makes sure that `_find_jwplayer_data()` returns either a `dict` or
`None` and nothing else.
8 years ago
John Hawkinson
46b18f2349
[BostonGlobe] New. Nonstandard version of Brightcove.
...
Has a "data-brightcove-video-id" instead of a "data-video-id," otherwise
pretty much just Brightcove. Except the Globe isn't all Brightcove
videos, so fallback to Generic, too.
Also, abstract playlist_from_matches() from generic.py to common.py, and use
it here.
History of these changes can be found in
51170427d4b1143572a498dedaee61863a5b2c5b.
8 years ago
Sergey M․
b51dc9db0e
[extractor/common] Extract SMIL formats from jwplayer
8 years ago
Sergey M․
1a2192cb90
[extractor/common] Pass arguments to _parse_jwplayer_formats and PEP8
8 years ago
Sergey M․
0236cd0dfd
[extractor/common] Improve height extraction and extract bitrate
8 years ago
Sergey M․
ed0cf9b383
[extractor/common] Move jwplayer formats extraction in separate method
8 years ago
Yen Chi Hsuan
eeb0a95684
[extractor/common] Add 'preference' to _parse_html5_media_entries
...
Some websites, like NJPWorld, put different qualities on different
player pages.
8 years ago
Sergey M․
eea0716cae
[extractor/common] Print origin country for fake IP
8 years ago
Sergey M․
336a76551b
[extractor/common] Do not quit _initialize_geo_bypass on empty countries
8 years ago
Sergey M․
dc0a869e5e
[extractor/common] Fix typo
8 years ago
Sergey M․
e39b5d4ab8
[extractor/common] Allow calling _initialize_geo_bypass from extractors ( #11970 )
8 years ago
Sergey M․
3ccdde8cb7
[extractor/common] Emphasize geo bypass APIs are experimental
8 years ago
Sergey M․
4248dad92b
Improve geo bypass mechanism
...
* Rename options to preffixly match with --geo-verification-proxy
* Introduce _GEO_COUNTRIES for extractors
* Implement faking IP right away for sites with known geo restriction
8 years ago
Sergey M․
0a840f584c
Rename bypass geo restriction options
8 years ago
Sergey M․
0016b84e16
Add faked X-Forwarded-For to formats' HTTP headers
8 years ago
Sergey M․
773f291dcb
Add experimental geo restriction bypass mechanism
...
Based on faking X-Forwarded-For HTTP header
8 years ago
Yen Chi Hsuan
a4a554a793
[generic] Try parsing JWPlayer embedded videos ( closes #12030 )
8 years ago
Remita Amine
08a00eef79
[extractor/common] skip m3u8 manifests protected with Adobe Flash Access
8 years ago
Sergey M․
242a14a1f6
[extractor/common] Fix audio only with audio group in m3u8 ( closes #11995 )
8 years ago
Sergey M․
2aec7256ae
[extractor/common] Speed-up media tags regex ( closes #11979 )
8 years ago
Mattias Wadman
45024183ae
[infoq] Add audio only format if available ( #11565 )
...
* [infoq] Add audio only format if available
Refactor cookie code into a function.
Renamed formats to http_video, http_audio, rtmp_video
Renamed extract functions to video instead of videos as they return
one or no video.
* [infoq] Rename to _extract_cookies as it more than one
* [infoq] Remove redundant determine_ext
* [infoq] Add comment about hardcoded URL
* [infoq] Use _hidden_inputs instead of messy regex
* [infoq] Probe if audio URL is valid
Make it possible to pass headers to _is_valid_url
* [infoq] Add audio only test
8 years ago
Sergey M․
e228616c6e
[extractor/common] Fix initialization template ( closes #11605 , closes #11825 )
8 years ago
Sergey M․
c58c2d63cb
[extractor/common] Document forgotten fragment base and path interfaces
8 years ago
Sergey M․
d04621daf4
[extractor/common] Fix duration per dash segment ( closes #11868 )
8 years ago
Remita Amine
c4251b9aaa
[common] add possibility to customize akamai manifest host
8 years ago
Yen Chi Hsuan
4606c34e19
[extractor/common] Allow non-lang in subtitles' keys
...
See 264e77c406
8 years ago
Remita Amine
a07588369f
[common] improve detection for video only formats and m3u8 manifest( fixes #11507 )
8 years ago
Remita Amine
7fe1592073
[common] fix dash codec information for mixed videos and fragment url construction( #11490 )
8 years ago
Sergey M․
87a449c1ed
[extractor/common] Recognize DASH formats in html5 media entries
8 years ago
Remita Amine
8821a718cf
[common] recognize hls manifests that contain video only formats( #11394 )
8 years ago
Sergey M․
4ea4c0bb22
[extractor/common] Fix Bandwidth substitution in media template ( closes #11175 )
8 years ago
Sergey M․
f076d7972c
[extractor/common] Improve thumbnail extraction from JSON-LD
8 years ago
Sergey M․
c4c9b8440c
[extractor/common] Tolerate malformed RESOLUTION attribute in m3u8 manifests ( closes #11113 )
8 years ago
Sergey M․
3365ea8929
[extractor/common] Remove unused code
8 years ago
Sergey M․
1616f9b452
[extractor/common] Fix typo
8 years ago
Sergey M․
02dc0a36b7
[utils] Introduce base_url
8 years ago
Remita Amine
b2758123c5
add Basic support for Smooth Streaming protocol( #8118 )
8 years ago
Sergey M․
b82c33dd67
[extractor/common] Improve mpd base URL extraction ( closes #10909 , closes #11079 )
8 years ago
Remita Amine
0384932e3d
[extractor/common] try to extract non smil wowza mpd manifests
8 years ago
Remita Amine
edd6074cea
[extractor/common] detect f4m audio only formats
8 years ago
Sergey M․
8d76bdf12b
[extractor/common] Mention podcast in series fields section
8 years ago
Yen Chi Hsuan
cea364f70c
[extractor/common] Support HTML media elements without child nodes
8 years ago
Yen Chi Hsuan
98763ee354
[extractor/common] Add id and title helpers for generic IEs
8 years ago
Remita Amine
e71a450956
[common] add hdcore sign to akamai f4m formats
8 years ago
Yen Chi Hsuan
5968d7d2fe
[extractor/common] Improved support for HTML5 subtitles
...
Ref: #10625
In a strict sense, <track>s with kind=captions are not subtitles. [1]
openload misuses this attribute, and I guess there will be more
examples, so I add it to common.py.
Also allow extracting information for subtitles-only <video> or <audio>
tags, which is the case of openload.
[1] https://www.w3.org/TR/html5/embedded-content-0.html#attr-track-kind
8 years ago
Sergey M․
30d0b549be
[extractor/common] Add manifest_url for hls and hds formats
8 years ago
Sergey M․
86f4d14f81
Refactor fragments interface and dash segments downloader
...
- Eliminate segment_urls and initialization_url
+ Introduce manifest_url (manifest may contain unfragmented data in this case url will be used for direct media URL and manifest_url for manifest itself correspondingly)
* Rewrite dashsegments downloader to use fragments data
* Improve generic mpd extraction
8 years ago
Sergey M․
b4c1d6e800
[extractor/common] Expose fragments interface for dashsegments formats
8 years ago
Sergey M․
a0d5077c8d
[extractor/common] Introduce fragments interface
8 years ago
Remita Amine
6ad0219556
[common] add helper method for Wowza Streaming Engine format extraction
8 years ago
Sergey M․
dcce092e0a
[extractor/common] Simplify _get_netrc_login_info and carry long lines
8 years ago
Sergey M․
32443dd346
[extractor/common] Update _get_login_info's comment
8 years ago
Sergey M․
2133565cec
[extractor/common] Simplify _get_login_info
8 years ago
Sergey M․
c849836854
[utils] Improve _hidden_inputs
8 years ago
Remita Amine
1b6712ab23
[adobepass] add specific options for adobe pass authentication
...
- add --ap-username and --ap-password option to specify
TV provider username and password in the cmd line
- add --ap-retries option to limit the number of retries
- add --list-ap-msi-ids to list the supported TV Providers
8 years ago
Remita Amine
0def758782
[internetvideoarchive] extract all formats
8 years ago
Christian Pointner
dedb177029
Fix parsing of HTML5 media elements
...
This fixes an error in _parse_html5_media_entries in case
an audio or video tag directly uses a src attribute insted
of <source> elements in it's body.
8 years ago
Sergey M․
e816c9d158
[extractor/common] Simplify _extract_m3u8_formats
8 years ago
Sergey M․
9250181f37
[extractor/common] Restore NAME usage from EXT-X-MEDIA tag for formats codes in _extract_m3u8_formats ( Closes #10522 )
8 years ago
Remita Amine
f39ffc5877
[common] extract formats from #EXT-X-MEDIA tags
8 years ago
Remita Amine
c7c43a93ba
[common] add helper method to extract akamai m3u8 and f4m formats
8 years ago
Yen Chi Hsuan
ad120ae1c5
[extractor/common] Change the default m3u8 protocol in HTML5
...
Helper functions should have consistent default values
8 years ago
Yen Chi Hsuan
520251c093
[extractor/common] Recognize m3u8 manifests in HTML5 multimedia tags
8 years ago
Remita Amine
2118fdd1a9
[common] add separate method for getting netrc ligin info
8 years ago
Sergey M․
321b5e082a
[extractor/common] Respect default in _search_json_ld
8 years ago
Remita Amine
37768f9242
[common] correctly lower the preference of m3u8 master manifest format
8 years ago
Remita Amine
d16b3c6677
[common] extract partOfTVSeries info in json-ld
8 years ago
Remita Amine
c57244cdb1
[common] lower the preference of m3u8 master manifest format
8 years ago
Sergey M․
46933a15d6
[extractor/common] Support root JSON-LD lists ( Closes #10203 )
8 years ago
Remita Amine
8b40854529
[common] lower proto_preference of rtsp formats
...
Most of the time the RtspFD fail to download videos but it report
success of the download with this output:
[mpv] 0 bytes
[download] 100% of 0.00B
8 years ago
Sergey M․
b070564efb
[extractor/common] Support multiple properties in _og_search_property
8 years ago
Sergey M․
40090e8d51
[extractor/common] Improve is_suitable
...
In order to fix breakage introduced by a3aa814b77
8 years ago
Sergey M․
f094834857
[extractor/common] Add support for $ in SegmentTemplate in MPD manifests
8 years ago
Remita Amine
00f4764cb7
[common] extract vbr, abr and fps for Unified Streaming Platform m3u8 manifests
8 years ago
Yen Chi Hsuan
0b68de3cc1
Merge pull request #8876 from remitamine/html5_media
...
[extractor/common] add helper method to extract html5 media entries
8 years ago
Sergey M․
95b31e266b
[extractor/common] Add expected_type in json ld routines
8 years ago
Sergey M․
6b3a3098b5
[extractor/common] Extract more metadata for VideoObject in _json_ld
8 years ago
Sergey M․
2de624fdd5
[extractor/common] Introduce filesize metafield for thumbnails
8 years ago
Déstin Reed
3fee7f636c
[flipagram] Add extractor
8 years ago
Yen Chi Hsuan
38cce791c7
Rename --cn-verfication-proxy to --geo-verification-proxy
...
And deprecate the former one
Since commit f138873900
, this option is
not limited to China websites, so rename it.
8 years ago
Sergey M․
bf8dd79045
[extractor/common] Fix sorting with custom field preference
9 years ago
remitamine
59bbe4911a
[extractor/common] add helper method to extract html5 media entries
9 years ago
Sergey M․
88d9f6c0c4
[utils] Add support for name list in _html_search_meta
9 years ago
remitamine
e154c65128
[downloader/hls] Add support for AES-128 encrypted segments in hlsnative downloader
9 years ago
Yen Chi Hsuan
a6571f1073
[common] Fix <bootstrapInfo> detection in F4M manifests
...
Regression since 0a5685b26f
9 years ago
Sergey M․
77b8b4e696
[extractor/common] Borrow quality metadata from parent set-level manifest for f4m
9 years ago
Yen Chi Hsuan
fac2af3c51
[common] Fix m3u8 extraction in f4m manifests
9 years ago
Yen Chi Hsuan
448bb5f333
[common] Fix non-bootstrapped support in f4m
9 years ago
Yen Chi Hsuan
240b60453e
[common] Support m3u8 in f4m manifests
...
Related: #9531
9 years ago
Yen Chi Hsuan
0a5685b26f
[common] Support non-bootstraped streams in f4m manifests
...
Related: #9531
9 years ago
Yen Chi Hsuan
7b2fcbfd4e
[common] Skip TYPE=CLOSED-CAPTIONS lines in m3u8 manifests
...
According to [1], valid values for TYPE are AUDIO, VIDEO, SUBTITLES
and CLOSED-CAPTIONS. Such a value is found in Anvato master playlists,
though I don't use _extract_m3u8_formats() in the end.
Part of #9522 .
[1] https://tools.ietf.org/html/draft-pantos-http-live-streaming-19#section-4.3.4.1
9 years ago
Yen Chi Hsuan
16da9bbc29
[common] Add _m3u8_meta_format() template
...
For extractors who handle m3u8 manifests by themselves. (eg., AnvatoIE)
Part of #9522
9 years ago
Yen Chi Hsuan
ad96b4c8f5
[common] Extract audio formats in SMIL
...
Found in http://www.cbc.ca/player/play/2657631896
Closes #5156
9 years ago
Sergey M․
ed56f26039
[extractor/common] Improve name extraction for m3u8 formats
9 years ago
Sergey M․
8a92e51c60
[extractor/common] Relax wording for creator metafield
9 years ago
Yen Chi Hsuan
e9c6cdf4a1
[common] Fix format_id construction for HLS
9 years ago
Kagami Hiiragi
b24d6336a7
[vlive] Add support for live videos
9 years ago
Yen Chi Hsuan
d6712378e7
Merge branch 'akamai_pv' of https://github.com/remitamine/youtube-dl into remitamine-akamai_pv
9 years ago
remitamine
fb72ec58ae
[extractor/common] do not process f4m manifest that contain akamai playerVerificationChallenge
9 years ago
Yen Chi Hsuan
2c0d9c6217
[extractor/common] Allow empty post data
9 years ago
Sergey M․
49caf3307f
[extractor/common] Remove irrelevant comment
9 years ago
Sergey M․
bacec0397f
[extractor/common] Relax _hidden_inputs
9 years ago
Sergey M․
fb38aa8b53
[extractor/common] Support arbitrary format strings for template based identifiers in mpd manifests ( Closes #9119 , closes #9120 )
9 years ago
Sergey M․
7a93ab5f3f
[extractor/common] Introduce music album metafields
9 years ago
Sergey M․
b507cc925b
[extractor/common] Carry long line
9 years ago
Sergey M․
db8ee7ec05
[extractor/common] Fix numeric identifiers conversion in DASH URL templates
9 years ago
remitamine
df634be2ed
[common] prefer using mime type over ext for smil subtitle extraction
...
the subtitle ext for http://www.cnet.com/videos/download-amazon-prime-movies-and-tv/
is adb_xml while using the mime type it get tt(application/smptett+xml)
9 years ago
Sergey M․
41d06b0424
[extractor/common] Improve _request_webpage
...
* Do not ignore data, headers and query for Requests
* Default values for headers and query switched to dicts since these are used by urllib itself
9 years ago
Sergey M․
b22ca76204
[extractor/common] Filter out unsupported encrypted media for f4m formats ( Closes #8573 )
9 years ago
Sergey M․
19dbaeece3
Remove _sort_formats from _extract_*_formats methods
...
Now _sort_formats should be called explicitly.
_sort_formats has been added to all the necessary places in code.
Closes #8051
9 years ago
Sergey M․
15707c7e02
[compat] Add compat_urllib_parse_urlencode and eliminate encode_dict
...
encode_dict functionality has been improved and moved directly into compat_urllib_parse_urlencode
All occurrences of compat_urllib_parse.urlencode throughout the codebase have been replaced by compat_urllib_parse_urlencode
Closes #8974
9 years ago
remitamine
49dea4913b
Merge pull request #8513 from remitamine/dash-sort
...
[extractor/common] fix dash formats sorting
9 years ago
Sergey M․
0fdbb3322b
[extractor/common] Add _parse_f4m_formats routine
9 years ago
remitamine
09f572fbc0
[extractor/common] add transform_source to _download_smil and _extract_smil_formats
9 years ago
remitamine
15bf934de5
Merge pull request #8819 from remitamine/simple-webpage-requests
...
[extractor/common] simplify using data, headers and query params with _download_* methods
9 years ago
remitamine
cdfee16818
[extractor/common] add data, headers and query params to _request_webpage
9 years ago
Yen Chi Hsuan
a6c8b75904
[common] Use mimeType to determine file extensions ( #8766 )
9 years ago
Yen Chi Hsuan
64f08d4ff2
Merge pull request #8766 from yan12125/dash-detect-ext
...
Detect file extensions of DASH formats from their codecs
9 years ago
Yen Chi Hsuan
af7d5a63b2
[common] Document protocol http_dash_segments
9 years ago
Yen Chi Hsuan
2def60c5f3
[common] Use codec2ext for DASH formats ( #8764 )
9 years ago
Yen Chi Hsuan
e9c0cdd389
[jython] Introduce compat_os_name
...
os.name is always 'java' on Jython
9 years ago
Sergey M․
7bcd2830dd
[extractor/common] Document uploader_url
9 years ago
Sergey M․
2bc0c46f98
[extractor/common] Document license metafield
9 years ago