p5-HTML-ExtractContent - Perl extension for HTML content extractor with scoring heuristics

Property Value
Distribution FreeBSD 11
Repository FreeBSD Ports Quarterly amd64
Package filename p5-HTML-ExtractContent-0.11.txz
Package name p5-HTML-ExtractContent
Package version 0.11
Package release -
Package architecture amd64
Package type txz
Category perl5 www
Homepage https://metacpan.org/release/HTML-ExtractContent
License GPLv1+, ART10
Maintainer kuriyama@FreeBSD.org
Download size 10.07 KB
Installed size 21.42 KB
HTML::ExtractContent is a module for extracting content from HTML with
scoring heuristics.
It guesses which block of HTML looks like content according to scores
depending on the amount of punctuation marks and the lengths of non-tag
It also guesses whether content end in the block or continue to the next
WWW: https://metacpan.org/release/HTML-ExtractContent


Package Version Architecture Repository
p5-HTML-ExtractContent-0.11.txz 0.11 i386 FreeBSD Ports Quarterly
p5-HTML-ExtractContent-0.11.txz 0.11 i386 FreeBSD Ports Latest
p5-HTML-ExtractContent-0.11.txz 0.11 amd64 FreeBSD Ports Latest
p5-HTML-ExtractContent - - -


Name Value
p5-Class-Accessor-Lvalue = 0.11_1
p5-Exporter-Lite = 0.08
p5-HTML-Parser = 3.72
perl5 = 5.28.1_1


Type URL
Mirror pkg.freebsd.org
Binary Package p5-HTML-ExtractContent-0.11.txz
Source Package www/p5-HTML-ExtractContent

Install Howto

Install p5-HTML-ExtractContent txz package:

# pkg install p5-HTML-ExtractContent

See Also

Package Description
p5-HTML-ExtractMain-0.62_1.txz Perl extension to extract main content of a web page
p5-HTML-Field-1.19_1.txz Perl module to generate HTML form elements
p5-HTML-FillInForm-2.21.txz Perl5 module for auto-filling HTML form fields from previous values
p5-HTML-FillInForm-ForceUTF8-0.03_1.txz FillInForm with UTF-8 encoding
p5-HTML-FillInForm-Lite-1.13_1.txz Perl extension for lightweight FillInForm module in Pure Perl
p5-HTML-Form-6.04.txz Class that represents an HTML form element
p5-HTML-FormFu-2.07.txz HTML Form Creation, Rendering and Validation Framework
p5-HTML-FormFu-Imager-1.00_1.txz Imager.pm helpers for HTML::FormFu file uploads
p5-HTML-FormFu-Model-DBIC-2.03.txz Integrate HTML::FormFu with DBIx::Class
p5-HTML-FormFu-MultiForm-1.03_1.txz Handle multi-page/stage forms with FormFu
p5-HTML-FormHandler-0.40068,1.txz Form handler written in Moose
p5-HTML-FormHandler-Model-DBIC-0.29.txz Model class for FormHandler unsing DBIx::Class
p5-HTML-Format-2.12.txz Module to format HTML to text or PS
p5-HTML-FormatExternal-26.txz HTML to text formatting using external programs
p5-HTML-FormatText-WithLinks-0.14_1.txz Perl5 module to convert HTML to text with links as footnotes