A very fast HTML parser, generating a simplified DOM, with basic element query support.
npm install fast-html-parserFast HTML Parser is a _very fast_ HTML parser. Which will generate a simplified
DOM tree, with basic element query support.
Per the design, it intends to parse massive HTML files in lowest price, thus the `` Faster than htmlparser2! ` Tested with htmlparser-benchmark. ` var root = HTMLParser.parse(' console.log(root.firstChild.structure); console.log(root.querySelector('#list')); Parse given data, and return root of the generated DOM. - data, data to parse `
performance is the top priority. For this reason, some malformatted HTML may not
be able to parse correctly, but most usual errors are covered (eg. HTML4 style
no closing , etc). Install
shell`
npm install --save fast-html-parserPerformance
shell`
fast-html-parser: 2.18409 ms/file ± 1.37431
high5 : 4.55435 ms/file ± 2.51132
htmlparser : 27.6920 ms/file ± 171.588
htmlparser2-dom : 6.22320 ms/file ± 3.48772
htmlparser2 : 3.58360 ms/file ± 2.23658
hubbub : 16.1774 ms/file ± 8.95079
libxmljs : 7.19406 ms/file ± 7.04495
parse5 : 10.7590 ms/file ± 8.09687Usage
js
var HTMLParser = require('fast-html-parser');
');
// ul#list
// li
// #text
// { tagName: 'ul',
// rawAttrs: 'id="list"',
// childNodes:
// [ { tagName: 'li',
// rawAttrs: '',
// childNodes: [Object],
// classNames: [] } ],
// id: 'list',
// classNames: [] }
`API
$3
- options, parse optionsjs
{
lowerCaseTagName: false, // convert tag name to lower case (hurt performance heavily)
script: false, // retrieve content in