Showing 1-20 of 31 packages
Effortless and fast string similarity comparison using multiple algorithms, including bigrams, Levenshtein, Jaro-Winkler, and Ratcliff/Obershelp. Perfect for fuzzy search, NLP, and intelligent text matching.
Calculates similarity between two strings
A utility for analyzing text to find bigrams, trigrams, and other n-grams.
Get n-grams from text
Measures how similar two strings are in terms of the number of common bigrams using Sørensen–Dice coefficient
The easiest way to get n-gram chunks from strings or token arrays!
Minimal implementations of a couple of classic text analysis tools (TF-IDF and cosine similarity)
Detects similar strings using the Simon White's approximate string matching method, that computes the Sørensen–Dice similarity coefficient of sets of adjacent letter pairs.
Find similarly-named files.
A library for scoring encoded text against a provided regex.
Terminal typing trainer focused on accuracy with constraint-based guided practice
Letter pairs sorted by how frequently they appear in English words
plugin for nlp-compromise
A lightweight and flexible implementation of Markov chains with support for n-grams, text generation, and persistence
A TypeScript-based content-based recommender with multilingual support (Japanese & English). Forked from content-based-recommender.
A configurable text scorer.
> 🚧 This project is a work in progress. For a more complete solution, please use [next-token-prediction](https://github.com/bennyschmidt/next-token-prediction).
This bot detects when suspicious smart contract are deployed
A TypeScript library for string similarity comparison
A fork of https://github.com/kerryrodden/tiny-tfidf, removes text splitting and creates 3 ngrams instead.