Commit Graph

442 Commits (231bcd0b6b7737e6b0484ce8aaa4a14de442a1ae)

Author SHA1 Message Date
Sergey M․ eea0716cae
[extractor/common] Print origin country for fake IP 8 years ago
Sergey M․ 336a76551b
[extractor/common] Do not quit _initialize_geo_bypass on empty countries 8 years ago
Sergey M․ dc0a869e5e
[extractor/common] Fix typo 8 years ago
Sergey M․ e39b5d4ab8
[extractor/common] Allow calling _initialize_geo_bypass from extractors (#11970) 8 years ago
Sergey M․ 3ccdde8cb7
[extractor/common] Emphasize geo bypass APIs are experimental 8 years ago
Sergey M․ 4248dad92b Improve geo bypass mechanism
* Rename options to preffixly match with --geo-verification-proxy
* Introduce _GEO_COUNTRIES for extractors
* Implement faking IP right away for sites with known geo restriction
8 years ago
Sergey M․ 0a840f584c Rename bypass geo restriction options 8 years ago
Sergey M․ 0016b84e16 Add faked X-Forwarded-For to formats' HTTP headers 8 years ago
Sergey M․ 773f291dcb Add experimental geo restriction bypass mechanism
Based on faking X-Forwarded-For HTTP header
8 years ago
Yen Chi Hsuan a4a554a793
[generic] Try parsing JWPlayer embedded videos (closes #12030) 8 years ago
Remita Amine 08a00eef79 [extractor/common] skip m3u8 manifests protected with Adobe Flash Access 8 years ago
Sergey M․ 242a14a1f6
[extractor/common] Fix audio only with audio group in m3u8 (closes #11995) 8 years ago
Sergey M․ 2aec7256ae
[extractor/common] Speed-up media tags regex (closes #11979) 8 years ago
Mattias Wadman 45024183ae [infoq] Add audio only format if available (#11565)
* [infoq] Add audio only format if available

Refactor cookie code into a function.
Renamed formats to http_video, http_audio, rtmp_video
Renamed extract functions to video instead of videos as they return
one or no video.

* [infoq] Rename to _extract_cookies as it more than one

* [infoq] Remove redundant determine_ext

* [infoq] Add comment about hardcoded URL

* [infoq] Use _hidden_inputs instead of messy regex

* [infoq] Probe if audio URL is valid

Make it possible to pass headers to _is_valid_url

* [infoq] Add audio only test
8 years ago
Sergey M․ e228616c6e
[extractor/common] Fix initialization template (closes #11605, closes #11825) 8 years ago
Sergey M․ c58c2d63cb
[extractor/common] Document forgotten fragment base and path interfaces 8 years ago
Sergey M․ d04621daf4
[extractor/common] Fix duration per dash segment (closes #11868) 8 years ago
Remita Amine c4251b9aaa [common] add possibility to customize akamai manifest host 8 years ago
Yen Chi Hsuan 4606c34e19 [extractor/common] Allow non-lang in subtitles' keys
See 264e77c406
8 years ago
Remita Amine a07588369f [common] improve detection for video only formats and m3u8 manifest(fixes #11507) 8 years ago
Remita Amine 7fe1592073 [common] fix dash codec information for mixed videos and fragment url construction(#11490) 8 years ago
Sergey M․ 87a449c1ed
[extractor/common] Recognize DASH formats in html5 media entries 8 years ago
Remita Amine 8821a718cf [common] recognize hls manifests that contain video only formats(#11394) 8 years ago
Sergey M․ 4ea4c0bb22
[extractor/common] Fix Bandwidth substitution in media template (closes #11175) 8 years ago
Sergey M․ f076d7972c
[extractor/common] Improve thumbnail extraction from JSON-LD 8 years ago
Sergey M․ c4c9b8440c
[extractor/common] Tolerate malformed RESOLUTION attribute in m3u8 manifests (closes #11113) 8 years ago
Sergey M․ 3365ea8929
[extractor/common] Remove unused code 8 years ago
Sergey M․ 1616f9b452
[extractor/common] Fix typo 8 years ago
Sergey M․ 02dc0a36b7
[utils] Introduce base_url 8 years ago
Remita Amine b2758123c5 add Basic support for Smooth Streaming protocol(#8118) 8 years ago
Sergey M․ b82c33dd67
[extractor/common] Improve mpd base URL extraction (closes #10909, closes #11079) 8 years ago
Remita Amine 0384932e3d [extractor/common] try to extract non smil wowza mpd manifests 8 years ago
Remita Amine edd6074cea [extractor/common] detect f4m audio only formats 8 years ago
Sergey M․ 8d76bdf12b
[extractor/common] Mention podcast in series fields section 8 years ago
Yen Chi Hsuan cea364f70c
[extractor/common] Support HTML media elements without child nodes 8 years ago
Yen Chi Hsuan 98763ee354
[extractor/common] Add id and title helpers for generic IEs 8 years ago
Remita Amine e71a450956 [common] add hdcore sign to akamai f4m formats 8 years ago
Yen Chi Hsuan 5968d7d2fe
[extractor/common] Improved support for HTML5 subtitles
Ref: #10625

In a strict sense, <track>s with kind=captions are not subtitles. [1]
openload misuses this attribute, and I guess there will be more
examples, so I add it to common.py.

Also allow extracting information for subtitles-only <video> or <audio>
tags, which is the case of openload.

[1] https://www.w3.org/TR/html5/embedded-content-0.html#attr-track-kind
8 years ago
Sergey M․ 30d0b549be
[extractor/common] Add manifest_url for hls and hds formats 8 years ago
Sergey M․ 86f4d14f81
Refactor fragments interface and dash segments downloader
- Eliminate segment_urls and initialization_url
+ Introduce manifest_url (manifest may contain unfragmented data in this case url will be used for direct media URL and manifest_url for manifest itself correspondingly)
* Rewrite dashsegments downloader to use fragments data
* Improve generic mpd extraction
8 years ago
Sergey M․ b4c1d6e800
[extractor/common] Expose fragments interface for dashsegments formats 8 years ago
Sergey M․ a0d5077c8d
[extractor/common] Introduce fragments interface 8 years ago
Remita Amine 6ad0219556 [common] add helper method for Wowza Streaming Engine format extraction 8 years ago
Sergey M․ dcce092e0a
[extractor/common] Simplify _get_netrc_login_info and carry long lines 8 years ago
Sergey M․ 32443dd346
[extractor/common] Update _get_login_info's comment 8 years ago
Sergey M․ 2133565cec
[extractor/common] Simplify _get_login_info 8 years ago
Sergey M․ c849836854
[utils] Improve _hidden_inputs 8 years ago
Remita Amine 1b6712ab23 [adobepass] add specific options for adobe pass authentication
- add --ap-username and --ap-password option to specify
TV provider username and password in the cmd line
- add --ap-retries option to limit the number of retries
- add --list-ap-msi-ids to list the supported TV Providers
8 years ago
Remita Amine 0def758782 [internetvideoarchive] extract all formats 8 years ago
Christian Pointner dedb177029
Fix parsing of HTML5 media elements
This fixes an error in _parse_html5_media_entries in case
an audio or video tag directly uses a src attribute insted
of <source> elements in it's body.
8 years ago