[GolemBridge] Remove multi-page page headers

On multi-page articles like [1], all the pages after the first one have
a page header that we add in the article content. When we tack the
pages together again, we don't need those extra page headers.

[1] https://www.golem.de/news/science-fiction-die-zehn-besten-filme-aus-den-spannenden-70ern-2312-179557.html
This commit is contained in:
Mynacol 2023-12-16 11:21:19 +01:00
parent c9c2944e7c
commit c5f586497f
1 changed files with 0 additions and 3 deletions

View File

@ -116,9 +116,6 @@ class GolemBridge extends FeedExpander
// reload html, as remove() is buggy
$article = str_get_html($article->outertext);
if ($pageHeader = $article->find('header.paged-cluster-header h1', 0)) {
$item .= $pageHeader;
}
$header = $article->find('header', 0);
foreach ($header->find('p, figure') as $element) {