Andrew Dolgov
9423d72f6c
parser: force libxml error messages to valid utf8
6 years ago
Andrew Dolgov
1a484ec3f5
feedparser: remove horrible hacks for feeds with broken encoding
6 years ago
Andrew Dolgov
55ef85adc0
parser: clean() attribute values by default (except content)
6 years ago
Tobias Kappé
22a866edb5
Store language of entries as indicated by the feed.
6 years ago
Andrew Dolgov
ea79a0e033
remove some redundant php closing tags
8 years ago
Andrew Dolgov
e4071d2544
feedparser: fix syntax error
8 years ago
Andrew Dolgov
4a5490dc58
feedparser: fix normalize_encoding() not working properly for some encodings
8 years ago
Andrew Dolgov
cde8272205
parser: only try to convert encoding if mbstring actually supports it
8 years ago
Andrew Dolgov
e55a5ec601
atom: consider link rel=alternate in feed element for site urls
10 years ago
Lukas Hybner
4834ad5639
fix parse atom feed
10 years ago
Andrew Dolgov
31bd6f7643
parser: trim some some feed-extracted data link titles and links
11 years ago
Andrew Dolgov
4ce778677e
Revert "parser: experimentally set preservewhitespace to false"
...
This reverts commit b69d94eeeb
.
11 years ago
Andrew Dolgov
b69d94eeeb
parser: experimentally set preservewhitespace to false
11 years ago
wltb
d71ac5d3da
implemented get_comments_url() and Atom Threading Extension
11 years ago
Andrew Dolgov
4ad04ee227
report all libxml errors in updater debug output
...
force utf8 encoding if devforceupdate is on
parser: try to convert non-unicode feeds with specified encoding to utf8
before trying to remove dangling utf8 characters in case of utf8-related
libxml errors because doing so produces garbage content
11 years ago
wltb
2c6f3c2eb3
Feedparser: Add important check for unknown feed type detection
11 years ago
wltb
54f23d38a6
Feedparser: Store libXML fatal error messages in an array, repair error reporting
11 years ago
Andrew Dolgov
4d49863f65
fix style in feedparser
11 years ago
wltb
d3305ff8de
Feedparser: Only format fatal errors
11 years ago
wltb
5fa36a6401
Feedparser: Change handling of libxml error 9 (cycle all errors)
11 years ago
Andrew Dolgov
f612dbe8a0
improve support for feeds with invalid unicode entities
12 years ago
wltb
f8160106af
Feedparser/encoding change: More general regular expression, set encoding to UTF-8 explicitly
12 years ago
Andrew Dolgov
4f00f55ca2
parser: add charset recoding hack for systems where libxml is build without support for iconv (handles libxml error 32)
12 years ago
Andrew Dolgov
a9000b0344
feedparser: check if initial xpath query for root element returns anything
12 years ago
Andrew Dolgov
f17c3ee29a
parser: fix get_links() for RSS
12 years ago
Andrew Dolgov
1874c8d6df
feedparser: properly handle nodeValue of rss channel link element ( closes #692 )
12 years ago
Andrew Dolgov
d1f3fa9791
try to force-convert feed data to utf8
12 years ago
Andrew Dolgov
a3b9fd12c5
fix support of Atom 0.3
12 years ago
kpn3m0
349c4229fd
Feature #686
...
If the feed cannot be parsed there is not particular error but always "Unknown/unsupported feed type". The patch uses the error from the XML parser which gives more detailed error reporting.
12 years ago
Andrew Dolgov
8a95d630a9
fix rss content:encoded not used
12 years ago
Andrew Dolgov
d4992d6b48
add support for dc:subject and slash:comments
12 years ago
Andrew Dolgov
852d4ac890
support RDF-XML feeds
12 years ago
Andrew Dolgov
4c00e15b5d
pass xpath object to feeditem, support media-rss objects
12 years ago
Andrew Dolgov
3c8060aca0
fix get_links() for RSS
12 years ago
Andrew Dolgov
b9eee80e08
parser: add get_links()
12 years ago
Andrew Dolgov
b8f316dc28
change caching to save xml data, remove RDF init section
12 years ago
Andrew Dolgov
04d2f9c831
add basic rss support
12 years ago
Andrew Dolgov
cd07592c29
add basic tinyparser/atom
12 years ago