Showing 1-20 of 40 packages
Polyfill for Intl.Segmenter
A polyfill for Intl.Segmenter
A high-performance wrapper around `Intl.Segmenter` for efficient text segmentation. This class resolves memory handling issues seen with large strings and can enhance performance by 50-500x. Only ~70 loc (with comments) and no dependencies.
Locale-aware word counting powered by the Web API [`Intl.Segmenter`](https://developer.mozilla.org/docs/Web/JavaScript/Reference/Global_Objects/Intl/Segmenter). The script automatically detects the primary writing system for each portion of the input, seg
A lightweight wrapper around Intl.Segmenter for segment-aware string operations
This repo builds .wasm module using icu4c for breaking text into words, so that we can polyfill [Intl Segmenter Proposal](https://github.com/tc39/proposal-intl-segmenter) with full compatibility, even on browsers that do not expose v8BreakIterator api.
Work with grapheme, words, and sentences with small, simple, and fast API using Intl.Segmenter
A polyfill for Intl.Segmenter
A lightweight implementation of the Unicode Text Segmentation (UAX #29)
A javascript text diff implementation.
Like `Intl.Segmenter`, but for paragraphs instead of graphemes/words/sentences
A React component that utilizes Intl.Segmenter to control line breaks in Japanese text.
An extension of Intl.Segmenter that support ANSI codes with zero dependencies
Zephr Browser
Stream UTF-8 bytes and read grapheme clusters safely
Split a string in to sentences. Supports multiple languages.
A powerful and lightweight multilingual tokenizer library that provides natural language processing capabilities for multiple languages including English, Chinese, Japanese, and Korean.
TypeScript/Node.js 版 wordfreq,内置词频数据与中文分词词典,开箱即用的跨语言词频查询与 Zipf/分词能力。
An easy, semantic and reliable way to split a string into characters in JavaScript, even when it contains emojis, flags and diacritics
Draw styled & multiline text on canvas. Supports css compatible line break and vertical vertical writing(vertical-rl).