* [CssSelectorBridge] Metadata from social embed (#3602, #3687)
Implement the following metadata sources:
- Facebook Open Graph
- Twitter <meta> tags
- Standard <meta> tags
- JSON linked data (ld+json)
The following metadata is supported:
- Canonical URL (may help removing garbage from URLs)
- Article title
- Truncated summary
- Published/Updated timestamp
- Enclosure/Thumbnail image
- Author Name or Twitter handle
SitemapBridge will also automatically benefit from this commit.
* [php8backports] Add array_is_list()
Needed this function for ld+json implementation in CssSelectorBridge.
* [SitemapBridge] Add option to discard thumbnail
* [CssSelectorBridge] Fix linting issues
* .
* attempt to fix#2943https://github.com/RSS-Bridge/rss-bridge/issues/2943
* Revert "."
This reverts commit c0b6ccfea6ce873e9c9ce7c3600b3a96d9911468.
* lint
* Revert "attempt to fix#2943"
This reverts commit 9f1a66e48d636a543e2171df212acf9731744bd0.
* moved fix to BridgeAbstract
* fix undefined index
* lint
* .
* attempt to fix#2943https://github.com/RSS-Bridge/rss-bridge/issues/2943
* Revert "."
This reverts commit c0b6ccfea6ce873e9c9ce7c3600b3a96d9911468.
* lint
* Revert "attempt to fix#2943"
This reverts commit 9f1a66e48d636a543e2171df212acf9731744bd0.
* moved fix to BridgeAbstract
* Filter out any advertise tweet
* Make some filter work, fix bug that may happen with tweet id list.
* clear phpcs warning, ignore line length warning
* [core] support xhtml content type in FeedExpander
* [FilterBridge] change defaultValue to exampleValue
* [core] support content with child elements in FeedExpander
* [core] replace everything except bridge name to get a valid whitelist.txt
* [core] do not use hard code repository name to improve working with forks
* [core] trim bridge names from whitelist.txt to reduce chance of failure
It is possible to have a cached item with a very old mtime but it's technically expired.
So, check for presence of time and whether the time it is within 10 days
* refactor(cache): extract and encapsulate cache expiration logic
* fix: logic bug in getSimpleHTMLDOMCached
* fix: silly me, index should of course be on the key column
* silly me again, PRIMARY keys get index by default lol
* comment out the delete portion in loadData
* remove a few log statements
* tweak twitter cache timeout
* fix(asrocknews): Trying to get property src of non-object
Trying to get property 'src' of non-object at bridges/ASRockNewsBridge.php line 37
* refactor(http): tweak max redirs config
* fix(tiktok)
* fix(gizmodo)
* fix(craig)
* fix(nationalg)
* fix(roadandtrack)
* fix(etsy)