npm explorer

Results for "html extraction"

/

Showing 1-20 of 205,687 packages

911-scraper-mcp

v1.0.9

911Proxy Universal Web Scraper MCP Server - supports HTML extraction and screenshots

mcpscraper911proxyweb-scrapermodel-context-protocol
2 months ago0/week
Quality100%
Popularity100%
Maintenance100%

@nosferatu500/textract

v3.1.3

Extracting text from files of various type including html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf, text/*, and various open office.

textractextracthtmlcsv

textract

v2.5.0

Extracting text from files of various type including html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf, text/*, and various open office.

textractextracthtmlcsv

deeks

v3.2.0

Retrieve all keys and nested keys from objects and arrays of objects.

getkeysobjectdocument

i18next-cli

v1.42.6

A unified, high-performance i18next CLI.

i18nextswccli
today

pip-requirements-js

v1.0.2

A robust parser for requirements.txt files

pippythonrequirements.txt
4 months ago

html-escaper

v3.0.3

fast and safe way to escape and unescape &<>'" chars

htmlescapeencodeunescape

easygettext

v2.17.0

Simple tools to extract gettext strings

4 years ago0/week
Quality100%
Popularity100%
Maintenance100%

pdf2html

v4.4.0

PDF to HTML or Text conversion using Apache Tika. Also generate PDF thumbnail using Apache PDFBox.

pdftohtmltikapdfboxconvert

he

v1.2.0

A robust HTML entities encoder/decoder with full Unicode support.

stringentitiesentityhtml

html-minifier-terser

v7.2.0

Highly configurable, well-tested, JavaScript-based HTML minifier.

clicompresscompressorcss

apparatus

v0.0.10

various machine learning routines for node

machinelearningmlclassifier

html-webpack-plugin

v5.6.6

Simplifies creation of HTML files to serve your webpack bundles

webpackpluginhtmlhtml-webpack-plugin

html-entities

v2.6.0

Fastest HTML entities encode/decode library.

htmlhtml entitieshtml entities encodehtml entities decode

strong-globalize

v6.0.6

StrongLoop Globalize - API

StrongLoopglobalizecldr
2 years ago

dom-to-semantic-markdown

v1.5.0

DOM to Semantic-Markdown for use in LLMs

markdownhtmlllm
8 months ago

mp4box

v2.3.0

JavaScript version of GPAC's MP4Box tool

mp4HTML 5 mediaMedia Source Extensionstreaming

html-to-text

v9.0.5

Advanced html to plain text converter

htmlnodetextmail

@wordpress/dependency-extraction-webpack-plugin

v6.39.0

Extract WordPress script dependencies from webpack bundles.

wordpressgutenbergwebpackdependency

html-void-elements

v3.0.0

List of HTML void tag names

htmlvoidtagname
Page 1 of 10285
Next
text
+26 more
3 years ago0/week
Quality100%
Popularity100%
Maintenance100%
text
+25 more
6 years ago0/week
Quality100%
Popularity100%
Maintenance100%
deep
+2 more
3 months ago0/week
Quality100%
Popularity100%
Maintenance100%
0/week
Quality100%
Popularity100%
Maintenance100%
0/week
Quality100%
Popularity100%
Maintenance100%
decode
+1 more
4 years ago0/week
Quality100%
Popularity100%
Maintenance100%
pdf
+2 more
7 months ago0/week
Quality100%
Popularity100%
Maintenance100%
encode
+2 more
7 years ago0/week
Quality100%
Popularity100%
Maintenance100%
html
+15 more
2 years ago0/week
Quality100%
Popularity100%
Maintenance100%
clustering
+4 more
7 years ago0/week
Quality100%
Popularity100%
Maintenance100%
3 weeks ago0/week
Quality100%
Popularity100%
Maintenance100%
entities
+2 more
10 months ago0/week
Quality100%
Popularity100%
Maintenance100%
0/week
Quality100%
Popularity100%
Maintenance100%
0/week
Quality100%
Popularity100%
Maintenance100%
2 months ago0/week
Quality100%
Popularity100%
Maintenance100%
plain
+1 more
2 years ago0/week
Quality100%
Popularity100%
Maintenance100%
1 weeks ago0/week
Quality100%
Popularity100%
Maintenance100%
element
+3 more
2 years ago0/week
Quality100%
Popularity100%
Maintenance100%