This package is a Laravel wrapper for readability.php. It provides a simple way to extract the main content from a webpage.
You can install the package via composer:
composer require the-3labs-team/laravel-readabilitySince it uses the readability.php package, you will need the following PHP extensions:
$ sudo apt-get install php7.4-xml php7.4-mbstringPlease change the version according to your PHP version.
use The3LabsTeam\LaravelReadability\Facades\Readability;
$html = '<html>...</html>';
$parsed = Readability::parse($html);
$title = $parsed->getTitle();You can use the same methods as the original package. Please refer to the readability.php documentation.
$html = '<html>...</html>';
$parsed = Readability::parse($html);
$title = $parsed->getTitle();
$content = $parsed->getContent();
$excerpt = $parsed->getExcerpt();
$author = $parsed->getAuthor();
$direction = $parsed->getDirection();
$image = $parsed->getImage();
$images = $parsed->getImages();use The3LabsTeam\LaravelReadability\Readability as ReadabilityClass;
$html = '<html>...</html>';
$parse = (new ReadabilityClass($html))
->getSourceList(
domainWhitelist: ['example.com', 'another-example.com/some-path'],
tagsToExtract: ['a', 'iframe', 'text'] // Optional, default is ['a', 'iframe']
)
->parse();
$content = $parsed->getContent();´$content´ will contain the list of sources in the article.
...
<p>Source list: https://example.com/source1, https://example.com/source2</p>composer testPlease see CHANGELOG for more information on what has changed recently.
Please see CONTRIBUTING for details.
Please review our security policy on how to report security vulnerabilities.
The MIT License (MIT). Please see License File for more information.
