Commit Graph

15350 Commits (4c47858c0584f5e38904871f8543f7271d703cc2)

Author SHA1 Message Date
xarantolus 4c47858c05 Fix regex for other variable declaration type
This now supports declarations like `window["ytInitialData"] = ...` and `var ytInitialData = ...`
4 years ago
xarantolus 94255fa0b1 [youtube] More general title extraction
Seems like this attribute is moved every few weeks, so we just extract both and use the one that is present.
4 years ago
xarantolus 7d743516b5 [youtube] Make `ytcfg.set` config extraction non-fatal
If the markup of the page changes in the future, it might be possible that _FEED_DATA still works, but the other regex does not. SInce it is not necessary for the first page of videos, we make sure the program doesn't exit before extracting them.

TL;DR: Extract the first video page even if there are problems
4 years ago
xarantolus a5e386d9fe Fix python2 compatibility and title extraction 4 years ago
xarantolus 4f37c60bf5 Run formatter 4 years ago
xarantolus c449f70965 [youtube] Fix feed extraction
In order to extract videos from further pages, we need to get various variables that are in an argument to the `ytcfg.set` call in a script on the feed page.
4 years ago
Sergey M․ de722d3cd7 release 2020.07.28 4 years ago
Sergey M․ 49004a6b59 [youtube] Fix sigfunc name extraction (closes #26134, closes #26135, closes #26136, closes #26137) 4 years ago
MRWITEK 54ffcbb8eb [youtube] Improve description extraction (closes #25937) (#25980) 4 years ago
Sergey M․ 9fa728f4e8 [wistia] Restrict embed regex (closes #25969) 4 years ago
Glenn Slayden bb2c950b8e [youtube] Prevent excess HTTP 301 (#25786) 4 years ago
Sergey M․ 255f31b5cb [youtube:playlists] Extend _VALID_URL (closes #25810) 4 years ago
Remita Amine 2bd94127a2 [bellmedia] add support for cp24.com clip URLs(closes #25764) 4 years ago
xarantolus 7fa0a67cc1 Remove unused variable 4 years ago
xarantolus c37ca4732b [youtube] Remote download limit 4 years ago
xarantolus 5cbe7563be [youtube] Return to old feed extraction code as it *seems* like that change was reverted
The old code now works again, but it downloads without limit. This is why a limit of 1000 videos is added, it can be overwritten with the `--max-downloads` option - that way, only so many ids will be extracted as videos downloaded
5 years ago
xarantolus 6a3cc89394 [youtube] Make search extraction less dependent on json schema.
If an object looks like a video (it has a `videoId` key), assume that it is.
5 years ago
xarantolus b3fd4b155e run flake8 5 years ago
xarantolus 57f72370c5 [youtube] Fix feed extraction
This moves feed extraction from using html content to json metadata. However, loading additional pages no longer works.

The _extract_video_info function also returns a continuation object that contains some metadata that - together with an API key that is in the page source - might be used to request the next page.
5 years ago
xarantolus 6dad89289c [youtube] Move search URL extraction to appropriate extractor 5 years ago
xarantolus 7a74feda78 [youtube] Fix extraction of search urls (closes ytdl-org/youtube-dl#25696) 5 years ago
Sergey M․ 2391941f28
[brightcove] Improve embed detection (closes #25674) 5 years ago
Sergey M․ 9ff6165a81
release 2020.06.16.1 5 years ago
Sergey M․ ee0b726cd7
[youtube] Force old layout (closes #25682, closes #25683, closes #25680, closes #25686) 5 years ago
Sergey M․ dbeafce5d5
[youtube] Fix categories and improve tags extraction 5 years ago
Sergey M․ ed604ce7bc
release 2020.06.16 5 years ago
Sergey M․ a6211d237b
[youtube] Fix uploader id and uploader URL extraction 5 years ago
Sergey M․ 7b16239a49
[youtube] Improve view count extraction 5 years ago
Sergey M․ 37357d21a9
[youtube] Fix upload date extraction 5 years ago
Sergey M․ b477fc1314
[youtube] Fix thumbnails extraction and remove uploader id extraction warning (closes #25676) 5 years ago
Sergey M․ d84b21b427
[youtube] Fix playlist and feed extraction (closes #25675) 5 years ago
Philipp Hagemeister 48bd042ce7 [facebook] Support single-video ID links
I stumbled upon this at https://www.facebook.com/bwfbadminton/posts/10157127020046316 . No idea how prevalent it is yet.
5 years ago
Sergey M․ 84213ea8d4
[youtube] Extract chapters from JSON (closes #24819) 5 years ago
Sergey M․ 562de77f41
[kaltura] Add support for multiple embeds on a webpage (closes #25523) 5 years ago
Sergey M․ e1723c4bac
release 2020.06.06 5 years ago
Sergey M․ a5b6102ea8
[tele5] Bypass geo restriction 5 years ago
Sergey M․ b77888228d
[jwplatform] Add support for bypass geo restriction 5 years ago
Sergey M․ 0b1eaec3bc
[tele5] Prefer jwplatform over nexx (closes #25533) 5 years ago
Sergey M․ b37e47a3f9
[twitch:stream] Expect 400 and 410 HTTP errors from API 5 years ago
Sergey M․ ce3735df02
[twitch:stream] Fix extraction (closes #25528) 5 years ago
Sergey M․ a0455d0ffd
[twitch] Pass v5 accept header and fix thumbnails extraction (closes #25531) 5 years ago
Sergey M․ c8b232cc48
[brightcove] Sort imports 5 years ago
Sergey M․ b4eb0bc7bd
[brightcove] Fix subtitles extraction (closes #25540) 5 years ago
Matej Dujava d5147b65ac
[malltv] Add support for sk.mall.tv (#25445) 5 years ago
Sergey M․ 7b0b53ea69
[twitter:broadcast] Add untitled periscope broadcast test 5 years ago
Sergey M․ 7016e24ebe
[periscope] Fix untitled broadcasts (#25482) 5 years ago
Sergey M․ bef4688c72
[jwplatform] Improve embeds extraction (closes #25467) 5 years ago
Sergey M․ 228c1d685b
release 2020.05.29 5 years ago
Sergey M․ fe515e5c75
[ard:beta] Extend _VALID_URL (closes #25405) 5 years ago
striker.sh 1db5ab6b34
[youtube] Add support for more invidious instances (#25417) 5 years ago