AllProfanity

A blazing-fast, multi-language profanity filter for JavaScript/TypeScript with advanced algorithms (Aho-Corasick, Bloom Filters) delivering 664% faster performance on large texts, intelligent leet-speak detection, and pattern-based context analysis.

![npm version](https://www.npmjs.com/package/allprofanity)
![License: MIT](https://opensource.org/licenses/MIT)

---

What's New in v2.2.0

- Aho-Corasick Algorithm: 664% faster on large texts (1KB+) with O(n) multi-pattern matching
- Bloom Filters: Lightning-fast probabilistic lookups reduce unnecessary checks
- Result Caching: 123x speedup on repeated inputs (perfect for chat apps and forms)
- Pattern-Based Context Detection: Reduces false positives in medical/negation contexts
- Word Boundary Detection: Smart whole-word matching prevents flagging "assassin" or "assistance"
- Flexible Configuration: Choose algorithm and trade-offs based on your use case

Read the full Performance Analysis →

---

Features

$3

- Multiple Algorithm Options: Choose between Trie (default), Aho-Corasick, or Hybrid modes
- 664% Faster on Large Texts: Aho-Corasick delivers O(n) multi-pattern matching
- 123x Speedup with Caching: Result cache perfect for repeated checks (chat, forms, APIs)
- ~27K ops/sec: Default Trie mode handles short texts incredibly fast
- Single-Pass Scanning: O(n) complexity regardless of dictionary size
- Batch Processing Ready: Optimized for high-throughput API endpoints

$3

- Word Boundary Matching: Smart whole-word detection prevents false positives like "assassin" or "assistance"
- Pattern-Based Context Detection: Recognizes medical terms ("anal region") and negation patterns ("not bad")
- Advanced Leet-Speak: Detects obfuscated profanities (f#ck, a55hole, sh1t, etc.)
- Comprehensive Coverage: Catches profanity while minimizing false flags
- Configurable Strictness: Tune detection sensitivity to your needs

$3

- Multi-Language Support: Built-in dictionaries for English, Hindi, French, German, Spanish, Bengali, Tamil, Telugu, Brazilian Portuguese
- Multiple Scripts: Latin/Roman (Hinglish) and native scripts (Devanagari, Tamil, Telugu, etc.)
- Custom Dictionaries: Add/remove words or entire language packs at runtime
- Whitelisting: Exclude safe words from detection
- Severity Scoring: Assess content offensiveness (MILD, MODERATE, SEVERE, EXTREME)

$3

- TypeScript Support: Fully typed API with comprehensive documentation
- Zero 3rd-Party Dependencies: Only internal code and data
- Configurable: Tune performance vs accuracy for your use case
- No Dictionary Exposure: Secure by design - word lists never exposed
- Universal: Works in Node.js and browsers

---

Installation

bash

npm install allprofanity

or

yarn add allprofanity





Generate configuration file (optional):

bash

npx allprofanity

Creates allprofanity.config.json and config.schema.json in your project





---



Quick Start

typescript

import profanity from 'allprofanity';



// Simple check

profanity.check('This is a clean sentence.');        // false

profanity.check('What the f#ck is this?');           // true (leet-speak detected)

profanity.check('यह एक चूतिया परीक्षण है।');           // true (Hindi)

profanity.check('Ye ek chutiya test hai.');          // true (Hinglish Roman script)





---



Algorithm Configuration



AllProfanity v2.2+ offers multiple algorithms optimized for different use cases. You can configure via constructor options or config file.



$3



#### Method 1: Constructor Options (Inline)

typescript

import { AllProfanity } from 'allprofanity';



const filter = new AllProfanity({

  algorithm: { matching: "hybrid" },

  performance: { enableCaching: true }

});





#### Method 2: Config File (Recommended)

bash

Generate config files in your project

npx allprofanity



This creates:

- allprofanity.config.json (main config)

- config.schema.json (for IDE autocomplete)

typescript

import { AllProfanity } from 'allprofanity';

import config from './allprofanity.config.json';



// Load from generated config file

const filter = AllProfanity.fromConfig(config);



// Or directly from object (no file needed)

const filter2 = AllProfanity.fromConfig({

  algorithm: { matching: "hybrid", useContextAnalysis: true },

  performance: { enableCaching: true, cacheSize: 1000 }

});





Example Config File (

allprofanity.config.json

):

json

{

  "algorithm": {

    "matching": "hybrid",

    "useAhoCorasick": true,

    "useBloomFilter": true,

    "useContextAnalysis": true

  },

  "contextAnalysis": {

    "enabled": true,

    "contextWindow": 50,

    "languages": ["en"],

    "scoreThreshold": 0.5

  },

  "profanityDetection": {

    "enableLeetSpeak": true,

    "caseSensitive": false,

    "strictMode": false

  },

  "performance": {

    "enableCaching": true,

    "cacheSize": 1000

  }

}





Config File: Run

npx allprofanity

 to generate config files in your project. The JSON schema provides IDE autocomplete and validation.



---



$3



#### 1. Default (Best for General Use)

typescript

import { AllProfanity } from 'allprofanity';

const filter = new AllProfanity();

// Uses optimized Trie - fast and reliable (~27K ops/sec)





#### 2. Large Text Processing (Documents, Articles)

typescript

const filter = new AllProfanity({

  algorithm: { matching: "aho-corasick" }

});

// 664% faster on 1KB+ texts





#### 3. Reduced False Positives (Social Media, Content Moderation)

typescript

const filter = new AllProfanity({

  algorithm: {

    matching: "hybrid",

    useBloomFilter: true,

    useAhoCorasick: true,

    useContextAnalysis: true

  },

  contextAnalysis: {

    enabled: true,

    contextWindow: 50,

    languages: ["en"],

    scoreThreshold: 0.5

  }

});

// Pattern-based context detection reduces medical/negation false positives





#### 4. Repeated Checks (Chat, Forms, APIs)

typescript

const filter = new AllProfanity({

  performance: {

    enableCaching: true,

    cacheSize: 1000

  }

});

// 123x speedup on cache hits





#### 5. Medical/Professional Content

typescript

const filter = new AllProfanity({

  algorithm: {

    matching: "hybrid",

    useContextAnalysis: true

  },

  contextAnalysis: {

    enabled: true,

    contextWindow: 100,

    scoreThreshold: 0.7  // Higher threshold = less sensitive

  }

});

// Reduces false positives from medical terms using keyword patterns





$3



| Use Case | Algorithm | Speed | Detection | Best For |

|----------|-----------|-------|----------|----------|

| Short texts (<500 chars) | Trie (default) | ~27K ops/sec | Excellent | Chat, comments |

| Large texts (1KB+) | Aho-Corasick | ~9.6K ops/sec | Excellent | Documents, articles |

| Repeated patterns | Any + Caching | 123x faster | Excellent | Forms, validation |

| Content moderation | Hybrid + Context | Moderate | Good (fewer false positives) | Social media, UGC |

| Professional content | Hybrid + Context (strict) | Moderate | Reduced false flags | Medical, academic |



See detailed benchmarks and comparisons →



---



API Reference & Examples



$3



Returns

true

 if the text contains any profanity.

typescript

profanity.check('This is a clean sentence.');  // false

profanity.check('This is a bullshit sentence.'); // true

profanity.check('What the f#ck is this?'); // true (leet-speak)

profanity.check('यह एक चूतिया परीक्षण है।'); // true (Hindi)





---



$3



Returns a detailed result:



-

hasProfanity: boolean

detectedWords: string[]

 (actual matched words)

-

cleanedText: string

 (character-masked)

-

severity: ProfanitySeverity (MILD, MODERATE, SEVERE, EXTREME

)

-

positions: Array<{ word: string, start: number, end: number }>

typescript

const result = profanity.detect('This is fucking bullshit and chutiya.');

console.log(result.hasProfanity); // true

console.log(result.detectedWords); // ['fucking', 'bullshit', 'chutiya']

console.log(result.severity); // 3 (SEVERE)

console.log(result.cleanedText); // "This is   and *."

console.log(result.positions); // e.g. [{word: 'fucking', start: 8, end: 15}, ...]





---



$3



Replace each character of profane words with a placeholder (default:

).

typescript

profanity.clean('This contains bullshit.'); // "This contains **."

profanity.clean('This contains bullshit.', '#'); // "This contains ########."

profanity.clean('यह एक चूतिया परीक्षण है।'); // e.g. "यह एक * परीक्षण है।"





---



$3



Replace each profane word with a single placeholder (default:

).  

(If the placeholder is omitted, uses

.)

typescript

profanity.cleanWithPlaceholder('This contains bullshit.'); // "This contains *."

profanity.cleanWithPlaceholder('This contains bullshit.', '[CENSORED]'); // "This contains [CENSORED]."

profanity.cleanWithPlaceholder('यह एक चूतिया परीक्षण है।', '####'); // e.g. "यह एक #### परीक्षण है।"





---



$3



Add a word or an array of words to the profanity filter.

typescript

profanity.add('badword123');

profanity.check('This is badword123.'); // true



profanity.add(['mierda', 'puta']);

profanity.check('Esto es mierda.'); // true (Spanish)

profanity.check('Qué puta situación.'); // true





---



$3



Remove a word or an array of words from the profanity filter.

typescript

profanity.remove('bullshit');

profanity.check('This is bullshit.'); // false



profanity.remove(['mierda', 'puta']);

profanity.check('Esto es mierda.'); // false





---



$3



Whitelist words so they are never flagged as profane.

typescript

profanity.addToWhitelist(['fuck', 'idiot','shit']);

profanity.check('He is an fucking idiot.'); // false

profanity.check('Fuck this shit.'); // false

// Remove from whitelist to restore detection

profanity.removeFromWhitelist(['fuck', 'idiot','shit']);





---



$3



Remove words from the whitelist so they can be detected again.

typescript

profanity.removeFromWhitelist(['anal']);





---



$3



Set the default placeholder character for

clean()

typescript

profanity.setPlaceholder('#');

profanity.clean('This is bullshit.'); // "This is ########."

profanity.setPlaceholder('*'); // Reset to default





---



$3



Change configuration at runtime.  

Options include:

enableLeetSpeak, caseSensitive, strictMode, detectPartialWords, defaultPlaceholder, languages, whitelistWords

typescript

profanity.updateConfig({ caseSensitive: true, enableLeetSpeak: false });

profanity.check('FUCK'); // false (if caseSensitive)

profanity.updateConfig({ caseSensitive: false, enableLeetSpeak: true });

profanity.check('f#ck'); // true





---



$3



Load a built-in language.

typescript

profanity.loadLanguage('french');

profanity.check('Ce mot est merde.'); // true





---



$3



Load multiple built-in languages at once.

typescript

profanity.loadLanguages(['english', 'french', 'german']);

profanity.check('Das ist scheiße.'); // true (German)





---



$3



Convenience: Load all major Indian language packs.

typescript

profanity.loadIndianLanguages();

profanity.check('यह एक बेंगाली गाली है।'); // true (Bengali)

profanity.check('This is a Tamil profanity: புண்டை'); // true





---



$3



Add your own dictionary as an additional language.

typescript

profanity.loadCustomDictionary('swedish', ['fan', 'jävla', 'skit']);

profanity.loadLanguage('swedish');

profanity.check('Det här är skit.'); // true





---



$3



Returns the names of all currently loaded language packs.

typescript

console.log(profanity.getLoadedLanguages()); // ['english', 'hindi', ...]





---



$3



Returns the names of all available built-in language packs.

typescript

console.log(profanity.getAvailableLanguages());

// ['english', 'hindi', 'french', 'german', 'spanish', 'bengali', 'tamil', 'telugu', 'brazilian']





---



$3



Remove all loaded languages and dynamic words (start with a clean filter).

typescript

profanity.clearList();

profanity.check('fuck'); // false

profanity.loadLanguage('english');

profanity.check('fuck'); // true





---



$3



Get the current configuration.

typescript

console.log(profanity.getConfig());

/*

{

  defaultPlaceholder: '*',

  enableLeetSpeak: true,

  caseSensitive: false,

  strictMode: false,

  detectPartialWords: false,

  languages: [...],

  whitelistWords: [...]

}

*/





---



Configuration File Structure



AllProfanity supports JSON-based configuration for easy setup and deployment. The config file structure supports all algorithm and detection options.



$3

typescript

{

  "algorithm": {

    "matching": "trie" | "aho-corasick" | "hybrid",  // Algorithm selection

    "useAhoCorasick": boolean,                        // Enable Aho-Corasick

    "useBloomFilter": boolean,                        // Enable Bloom Filter

    "useContextAnalysis": boolean                     // Enable context analysis

  },

  "bloomFilter": {

    "enabled": boolean,                               // Enable/disable

    "expectedItems": number,                          // Expected dictionary size (default: 10000)

    "falsePositiveRate": number                       // Acceptable false positive rate (default: 0.01)

  },

  "ahoCorasick": {

    "enabled": boolean,                               // Enable/disable

    "prebuild": boolean                               // Prebuild automaton (default: true)

  },

  "contextAnalysis": {

    "enabled": boolean,                               // Enable/disable pattern-based context detection

    "contextWindow": number,                          // Characters around match to check (default: 50)

    "languages": string[],                            // Languages for keyword patterns (default: ["en"])

    "scoreThreshold": number                          // Detection threshold 0-1 (default: 0.5)

  },

  "profanityDetection": {

    "enableLeetSpeak": boolean,                       // Detect l33t speak (default: true)

    "caseSensitive": boolean,                         // Case sensitive matching (default: false)

    "strictMode": boolean,                            // Require word boundaries (default: false)

    "detectPartialWords": boolean,                    // Detect within words (default: false)

    "defaultPlaceholder": string                      // Default censoring character (default: "*")

  },

  "performance": {

    "enableCaching": boolean,                         // Enable result cache (default: false)

    "cacheSize": number                               // Cache size limit (default: 1000)

  }

}





$3



#### High Performance (Large Texts)

json

{

  "algorithm": { "matching": "aho-corasick" },

  "ahoCorasick": { "enabled": true, "prebuild": true },

  "profanityDetection": { "enableLeetSpeak": true }

}





#### Reduced False Positives (Content Moderation)

json

{

  "algorithm": {

    "matching": "hybrid",

    "useContextAnalysis": true,

    "useBloomFilter": true

  },

  "contextAnalysis": {

    "enabled": true,

    "contextWindow": 50,

    "scoreThreshold": 0.5

  },

  "performance": { "enableCaching": true }

}





#### Balanced (Production)

json

{

  "algorithm": {

    "matching": "hybrid",

    "useAhoCorasick": true,

    "useBloomFilter": true

  },

  "profanityDetection": { "enableLeetSpeak": true },

  "performance": { "enableCaching": true, "cacheSize": 1000 }

}





$3



Step 1: Generate Config Files

bash

Run this in your project directory

npx allprofanity



Output:

✅ AllProfanity configuration files created!

#

Created files:

  📄 allprofanity.config.json - Main configuration

  📄 config.schema.json - JSON schema for IDE autocomplete





Step 2: Load Config in Your Code

typescript

// ES Modules / TypeScript

import { AllProfanity } from 'allprofanity';

import config from './allprofanity.config.json';



const filter = AllProfanity.fromConfig(config);

javascript

// CommonJS (Node.js)

const { AllProfanity } = require('allprofanity');

const config = require('./allprofanity.config.json');



const filter = AllProfanity.fromConfig(config);





Step 3: Customize Config



Edit

allprofanity.config.json

 to enable/disable features. Your IDE will provide autocomplete thanks to the JSON schema!



---



Severity Levels



Severity reflects the number and variety of detected profanities:



| Level     | Enum Value | Description                             |

|-----------|------------|-----------------------------------------|

| MILD      | 1          | 1 unique/total word                     |

| MODERATE  | 2          | 2 unique or total words                 |

| SEVERE    | 3          | 3 unique/total words                    |

| EXTREME   | 4          | 4+ unique or 5+ total profane words     |



---



Language Support



- Built-in: English, Hindi, French, German, Spanish, Bengali, Tamil, Telugu, Brazilian Portuguese

- Scripts: Latin/Roman, Devanagari, Tamil, Telugu, Bengali, etc.

- Mixed Content: Handles mixed-language and code-switched sentences.

typescript

profanity.check('This is bullshit and चूतिया.'); // true (mixed English/Hindi)

profanity.check('Ce mot est merde and पागल.');   // true (French/Hindi)

profanity.check('Isso é uma merda.');             // true (Brazilian Portuguese)





---



Use Exported Wordlists



For sample words in a language (for UIs, admin, etc):

typescript

import { englishBadWords, hindiBadWords } from 'allprofanity';

console.log(englishBadWords.slice(0, 5)); // ["fuck", "shit", ...]





---



Security



- No wordlist exposure: There is no

.list()

 function for security and encapsulation. Use exported word arrays for samples.

- TRIE-based: Scales easily to 50,000+ words.

- Handles leet-speak: Catches obfuscated variants like

f#ck, a55hole

.



---



Full Example

typescript

import profanity, { ProfanitySeverity } from 'allprofanity';



// Multi-language detection

profanity.loadLanguages(['english', 'french', 'tamil']);

console.log(profanity.check('Ce mot est merde.')); // true



// Leet-speak detection

console.log(profanity.check('You a f#cking a55hole!')); // true



// Whitelisting

profanity.addToWhitelist(['anal', 'ass']);

console.log(profanity.check('He is an associate professor.')); // false



// Severity

const result = profanity.detect('This is fucking bullshit and chutiya.');

console.log(ProfanitySeverity[result.severity]); // "SEVERE"



// Custom dictionary

profanity.loadCustomDictionary('pirate', ['barnacle-head', 'landlubber']);

profanity.loadLanguage('pirate');

console.log(profanity.check('You barnacle-head!')); // true



// Placeholder configuration

profanity.setPlaceholder('#');

console.log(profanity.clean('This is bullshit.')); // "This is ########."

profanity.setPlaceholder('*'); // Reset





---



FAQ



Q: How do I see all loaded profanities?  

A: For security, the internal word list is not exposed. Use

englishBadWords

 etc. for samples.



Q: How do I reset the filter?  

A: Use

clearList()

 and reload languages/dictionaries.



Q: Is this safe for browser and Node.js?  

A: Yes! AllProfanity is universal.



---



Middleware Examples



Looking for Express.js/Node.js middleware to use AllProfanity in your API or chat app?  

Check the

examples/` folder for ready-to-copy middleware and integration samples.

---

Roadmap

- 🚧 Multi-language context analysis (Hindi, Spanish, etc.)
- 🚧 Phonetic matching (sounds-like detection)
- 🚧 More language packs (Arabic, Russian, Japanese, etc.)
- 🚧 Machine learning integration for adaptive scoring
- 🚧 Plugin system for custom detection algorithms

---

License

MIT — See LICENSE

---

Contributing

We welcome contributions! Please see our CONTRIBUTORS.md for:

- How to add your name to our contributors list
- Guidelines for adding new languages
- Test requirements (must include passing test screenshots in PRs)
- Code of conduct and PR guidelines

AllProfanity

What's New in v2.2.0

Features

$3

Installation

bash

npm install allprofanity

or

yarn add allprofanity





Generate configuration file (optional):

bash

npx allprofanity

Creates allprofanity.config.json and config.schema.json in your project





---



Quick Start

typescript

import profanity from 'allprofanity';



// Simple check

profanity.check('This is a clean sentence.');        // false

profanity.check('What the f#ck is this?');           // true (leet-speak detected)

profanity.check('यह एक चूतिया परीक्षण है।');           // true (Hindi)

profanity.check('Ye ek chutiya test hai.');          // true (Hinglish Roman script)





---



Algorithm Configuration



AllProfanity v2.2+ offers multiple algorithms optimized for different use cases. You can configure via constructor options or config file.



$3



#### Method 1: Constructor Options (Inline)

typescript

import { AllProfanity } from 'allprofanity';



const filter = new AllProfanity({

  algorithm: { matching: "hybrid" },

  performance: { enableCaching: true }

});





#### Method 2: Config File (Recommended)

bash

Generate config files in your project

npx allprofanity



This creates:

- allprofanity.config.json (main config)

- config.schema.json (for IDE autocomplete)

typescript

import { AllProfanity } from 'allprofanity';

import config from './allprofanity.config.json';



// Load from generated config file

const filter = AllProfanity.fromConfig(config);



// Or directly from object (no file needed)

const filter2 = AllProfanity.fromConfig({

  algorithm: { matching: "hybrid", useContextAnalysis: true },

  performance: { enableCaching: true, cacheSize: 1000 }

});





Example Config File (

allprofanity.config.json

):

json

{

  "algorithm": {

    "matching": "hybrid",

    "useAhoCorasick": true,

    "useBloomFilter": true,

    "useContextAnalysis": true

  },

  "contextAnalysis": {

    "enabled": true,

    "contextWindow": 50,

    "languages": ["en"],

    "scoreThreshold": 0.5

  },

  "profanityDetection": {

    "enableLeetSpeak": true,

    "caseSensitive": false,

    "strictMode": false

  },

  "performance": {

    "enableCaching": true,

    "cacheSize": 1000

  }

}





Config File: Run

npx allprofanity

 to generate config files in your project. The JSON schema provides IDE autocomplete and validation.



---



$3



#### 1. Default (Best for General Use)

typescript

import { AllProfanity } from 'allprofanity';

const filter = new AllProfanity();

// Uses optimized Trie - fast and reliable (~27K ops/sec)





#### 2. Large Text Processing (Documents, Articles)

typescript

const filter = new AllProfanity({

  algorithm: { matching: "aho-corasick" }

});

// 664% faster on 1KB+ texts





#### 3. Reduced False Positives (Social Media, Content Moderation)

typescript

const filter = new AllProfanity({

  algorithm: {

    matching: "hybrid",

    useBloomFilter: true,

    useAhoCorasick: true,

    useContextAnalysis: true

  },

  contextAnalysis: {

    enabled: true,

    contextWindow: 50,

    languages: ["en"],

    scoreThreshold: 0.5

  }

});

// Pattern-based context detection reduces medical/negation false positives





#### 4. Repeated Checks (Chat, Forms, APIs)

typescript

const filter = new AllProfanity({

  performance: {

    enableCaching: true,

    cacheSize: 1000

  }

});

// 123x speedup on cache hits





#### 5. Medical/Professional Content

typescript

const filter = new AllProfanity({

  algorithm: {

    matching: "hybrid",

    useContextAnalysis: true

  },

  contextAnalysis: {

    enabled: true,

    contextWindow: 100,

    scoreThreshold: 0.7  // Higher threshold = less sensitive

  }

});

// Reduces false positives from medical terms using keyword patterns





$3



| Use Case | Algorithm | Speed | Detection | Best For |

|----------|-----------|-------|----------|----------|

| Short texts (<500 chars) | Trie (default) | ~27K ops/sec | Excellent | Chat, comments |

| Large texts (1KB+) | Aho-Corasick | ~9.6K ops/sec | Excellent | Documents, articles |

| Repeated patterns | Any + Caching | 123x faster | Excellent | Forms, validation |

| Content moderation | Hybrid + Context | Moderate | Good (fewer false positives) | Social media, UGC |

| Professional content | Hybrid + Context (strict) | Moderate | Reduced false flags | Medical, academic |



See detailed benchmarks and comparisons →



---



API Reference & Examples



$3



Returns

true

 if the text contains any profanity.

typescript

profanity.check('This is a clean sentence.');  // false

profanity.check('This is a bullshit sentence.'); // true

profanity.check('What the f#ck is this?'); // true (leet-speak)

profanity.check('यह एक चूतिया परीक्षण है।'); // true (Hindi)





---



$3



Returns a detailed result:



-

hasProfanity: boolean

detectedWords: string[]

 (actual matched words)

-

cleanedText: string

 (character-masked)

-

severity: ProfanitySeverity (MILD, MODERATE, SEVERE, EXTREME

)

-

positions: Array<{ word: string, start: number, end: number }>

typescript

const result = profanity.detect('This is fucking bullshit and chutiya.');

console.log(result.hasProfanity); // true

console.log(result.detectedWords); // ['fucking', 'bullshit', 'chutiya']

console.log(result.severity); // 3 (SEVERE)

console.log(result.cleanedText); // "This is   and *."

console.log(result.positions); // e.g. [{word: 'fucking', start: 8, end: 15}, ...]





---



$3



Replace each character of profane words with a placeholder (default:

).

typescript

profanity.clean('This contains bullshit.'); // "This contains **."

profanity.clean('This contains bullshit.', '#'); // "This contains ########."

profanity.clean('यह एक चूतिया परीक्षण है।'); // e.g. "यह एक * परीक्षण है।"





---



$3



Replace each profane word with a single placeholder (default:

).  

(If the placeholder is omitted, uses

.)

typescript

profanity.cleanWithPlaceholder('This contains bullshit.'); // "This contains *."

profanity.cleanWithPlaceholder('This contains bullshit.', '[CENSORED]'); // "This contains [CENSORED]."

profanity.cleanWithPlaceholder('यह एक चूतिया परीक्षण है।', '####'); // e.g. "यह एक #### परीक्षण है।"





---



$3



Add a word or an array of words to the profanity filter.

typescript

profanity.add('badword123');

profanity.check('This is badword123.'); // true



profanity.add(['mierda', 'puta']);

profanity.check('Esto es mierda.'); // true (Spanish)

profanity.check('Qué puta situación.'); // true





---



$3



Remove a word or an array of words from the profanity filter.

typescript

profanity.remove('bullshit');

profanity.check('This is bullshit.'); // false



profanity.remove(['mierda', 'puta']);

profanity.check('Esto es mierda.'); // false





---



$3



Whitelist words so they are never flagged as profane.

typescript

profanity.addToWhitelist(['fuck', 'idiot','shit']);

profanity.check('He is an fucking idiot.'); // false

profanity.check('Fuck this shit.'); // false

// Remove from whitelist to restore detection

profanity.removeFromWhitelist(['fuck', 'idiot','shit']);





---



$3



Remove words from the whitelist so they can be detected again.

typescript

profanity.removeFromWhitelist(['anal']);





---



$3



Set the default placeholder character for

clean()

typescript

profanity.setPlaceholder('#');

profanity.clean('This is bullshit.'); // "This is ########."

profanity.setPlaceholder('*'); // Reset to default





---



$3



Change configuration at runtime.  

Options include:

enableLeetSpeak, caseSensitive, strictMode, detectPartialWords, defaultPlaceholder, languages, whitelistWords

typescript

profanity.updateConfig({ caseSensitive: true, enableLeetSpeak: false });

profanity.check('FUCK'); // false (if caseSensitive)

profanity.updateConfig({ caseSensitive: false, enableLeetSpeak: true });

profanity.check('f#ck'); // true





---



$3



Load a built-in language.

typescript

profanity.loadLanguage('french');

profanity.check('Ce mot est merde.'); // true





---



$3



Load multiple built-in languages at once.

typescript

profanity.loadLanguages(['english', 'french', 'german']);

profanity.check('Das ist scheiße.'); // true (German)





---



$3



Convenience: Load all major Indian language packs.

typescript

profanity.loadIndianLanguages();

profanity.check('यह एक बेंगाली गाली है।'); // true (Bengali)

profanity.check('This is a Tamil profanity: புண்டை'); // true





---



$3



Add your own dictionary as an additional language.

typescript

profanity.loadCustomDictionary('swedish', ['fan', 'jävla', 'skit']);

profanity.loadLanguage('swedish');

profanity.check('Det här är skit.'); // true





---



$3



Returns the names of all currently loaded language packs.

typescript

console.log(profanity.getLoadedLanguages()); // ['english', 'hindi', ...]





---



$3



Returns the names of all available built-in language packs.

typescript

console.log(profanity.getAvailableLanguages());

// ['english', 'hindi', 'french', 'german', 'spanish', 'bengali', 'tamil', 'telugu', 'brazilian']





---



$3



Remove all loaded languages and dynamic words (start with a clean filter).

typescript

profanity.clearList();

profanity.check('fuck'); // false

profanity.loadLanguage('english');

profanity.check('fuck'); // true





---



$3



Get the current configuration.

typescript

console.log(profanity.getConfig());

/*

{

  defaultPlaceholder: '*',

  enableLeetSpeak: true,

  caseSensitive: false,

  strictMode: false,

  detectPartialWords: false,

  languages: [...],

  whitelistWords: [...]

}

*/





---



Configuration File Structure



AllProfanity supports JSON-based configuration for easy setup and deployment. The config file structure supports all algorithm and detection options.



$3

typescript

{

  "algorithm": {

    "matching": "trie" | "aho-corasick" | "hybrid",  // Algorithm selection

    "useAhoCorasick": boolean,                        // Enable Aho-Corasick

    "useBloomFilter": boolean,                        // Enable Bloom Filter

    "useContextAnalysis": boolean                     // Enable context analysis

  },

  "bloomFilter": {

    "enabled": boolean,                               // Enable/disable

    "expectedItems": number,                          // Expected dictionary size (default: 10000)

    "falsePositiveRate": number                       // Acceptable false positive rate (default: 0.01)

  },

  "ahoCorasick": {

    "enabled": boolean,                               // Enable/disable

    "prebuild": boolean                               // Prebuild automaton (default: true)

  },

  "contextAnalysis": {

    "enabled": boolean,                               // Enable/disable pattern-based context detection

    "contextWindow": number,                          // Characters around match to check (default: 50)

    "languages": string[],                            // Languages for keyword patterns (default: ["en"])

    "scoreThreshold": number                          // Detection threshold 0-1 (default: 0.5)

  },

  "profanityDetection": {

    "enableLeetSpeak": boolean,                       // Detect l33t speak (default: true)

    "caseSensitive": boolean,                         // Case sensitive matching (default: false)

    "strictMode": boolean,                            // Require word boundaries (default: false)

    "detectPartialWords": boolean,                    // Detect within words (default: false)

    "defaultPlaceholder": string                      // Default censoring character (default: "*")

  },

  "performance": {

    "enableCaching": boolean,                         // Enable result cache (default: false)

    "cacheSize": number                               // Cache size limit (default: 1000)

  }

}





$3



#### High Performance (Large Texts)

json

{

  "algorithm": { "matching": "aho-corasick" },

  "ahoCorasick": { "enabled": true, "prebuild": true },

  "profanityDetection": { "enableLeetSpeak": true }

}





#### Reduced False Positives (Content Moderation)

json

{

  "algorithm": {

    "matching": "hybrid",

    "useContextAnalysis": true,

    "useBloomFilter": true

  },

  "contextAnalysis": {

    "enabled": true,

    "contextWindow": 50,

    "scoreThreshold": 0.5

  },

  "performance": { "enableCaching": true }

}





#### Balanced (Production)

json

{

  "algorithm": {

    "matching": "hybrid",

    "useAhoCorasick": true,

    "useBloomFilter": true

  },

  "profanityDetection": { "enableLeetSpeak": true },

  "performance": { "enableCaching": true, "cacheSize": 1000 }

}





$3



Step 1: Generate Config Files

bash

Run this in your project directory

npx allprofanity



Output:

✅ AllProfanity configuration files created!

#

Created files:

  📄 allprofanity.config.json - Main configuration

  📄 config.schema.json - JSON schema for IDE autocomplete





Step 2: Load Config in Your Code

typescript

// ES Modules / TypeScript

import { AllProfanity } from 'allprofanity';

import config from './allprofanity.config.json';



const filter = AllProfanity.fromConfig(config);

javascript

// CommonJS (Node.js)

const { AllProfanity } = require('allprofanity');

const config = require('./allprofanity.config.json');



const filter = AllProfanity.fromConfig(config);





Step 3: Customize Config



Edit

allprofanity.config.json

 to enable/disable features. Your IDE will provide autocomplete thanks to the JSON schema!



---



Severity Levels



Severity reflects the number and variety of detected profanities:



| Level     | Enum Value | Description                             |

|-----------|------------|-----------------------------------------|

| MILD      | 1          | 1 unique/total word                     |

| MODERATE  | 2          | 2 unique or total words                 |

| SEVERE    | 3          | 3 unique/total words                    |

| EXTREME   | 4          | 4+ unique or 5+ total profane words     |



---



Language Support



- Built-in: English, Hindi, French, German, Spanish, Bengali, Tamil, Telugu, Brazilian Portuguese

- Scripts: Latin/Roman, Devanagari, Tamil, Telugu, Bengali, etc.

- Mixed Content: Handles mixed-language and code-switched sentences.

typescript

profanity.check('This is bullshit and चूतिया.'); // true (mixed English/Hindi)

profanity.check('Ce mot est merde and पागल.');   // true (French/Hindi)

profanity.check('Isso é uma merda.');             // true (Brazilian Portuguese)





---



Use Exported Wordlists



For sample words in a language (for UIs, admin, etc):

typescript

import { englishBadWords, hindiBadWords } from 'allprofanity';

console.log(englishBadWords.slice(0, 5)); // ["fuck", "shit", ...]





---



Security



- No wordlist exposure: There is no

.list()

 function for security and encapsulation. Use exported word arrays for samples.

- TRIE-based: Scales easily to 50,000+ words.

- Handles leet-speak: Catches obfuscated variants like

f#ck, a55hole

.



---



Full Example

typescript

import profanity, { ProfanitySeverity } from 'allprofanity';



// Multi-language detection

profanity.loadLanguages(['english', 'french', 'tamil']);

console.log(profanity.check('Ce mot est merde.')); // true



// Leet-speak detection

console.log(profanity.check('You a f#cking a55hole!')); // true



// Whitelisting

profanity.addToWhitelist(['anal', 'ass']);

console.log(profanity.check('He is an associate professor.')); // false



// Severity

const result = profanity.detect('This is fucking bullshit and chutiya.');

console.log(ProfanitySeverity[result.severity]); // "SEVERE"



// Custom dictionary

profanity.loadCustomDictionary('pirate', ['barnacle-head', 'landlubber']);

profanity.loadLanguage('pirate');

console.log(profanity.check('You barnacle-head!')); // true



// Placeholder configuration

profanity.setPlaceholder('#');

console.log(profanity.clean('This is bullshit.')); // "This is ########."

profanity.setPlaceholder('*'); // Reset





---



FAQ



Q: How do I see all loaded profanities?  

A: For security, the internal word list is not exposed. Use

englishBadWords

 etc. for samples.



Q: How do I reset the filter?  

A: Use

clearList()

 and reload languages/dictionaries.



Q: Is this safe for browser and Node.js?  

A: Yes! AllProfanity is universal.



---



Middleware Examples



Looking for Express.js/Node.js middleware to use AllProfanity in your API or chat app?  

Check the

examples/` folder for ready-to-copy middleware and integration samples.

---

Roadmap

License

MIT — See LICENSE

---