@codeslick/ai-detection

AI-generated code detection library with comprehensive signals for identifying AI-generated code patterns.

Version 1.1.0 - Complete Detection System

This release provides a complete AI code detection system with 45 detectors + scoring logic.

See IMPLEMENTATION_STATUS.md for implementation details.

Features

$3

- 13 Code Smell Heuristics: Structural patterns that indicate AI-generated code
- 8 original heuristics (over-engineering, wrappers, comments, etc.)
- 5 "perfect code" patterns (AI code is "too perfect")
- 32 LLM Fingerprints: Behavioral patterns specific to GPT-4, GitHub Copilot, Claude Code, and Cursor
- GPT-4: 8 patterns (verbose docstrings, defensive null checks, etc.)
- GitHub Copilot: 7 patterns (boilerplate comments, generic names, etc.)
- Claude Code: 5 patterns (explanatory comments, custom errors, etc.)
- Cursor: 8 patterns (AI markers, diff comments, artifacts, etc.)
- Scoring System: Confidence levels (HIGH/MEDIUM/LOW) and severity mapping
- Combines hallucinations (60%), heuristics (25%), LLM fingerprints (15%)

$3

- 119 Hallucination Patterns: Language-specific API misuse patterns are maintained in CodeSlick analyzers (not this package)
- This package provides reusable detection primitives for both CodeSlick and Endure

Installation

``bash npm install @codeslick/ai-detection`

`Usage`

`$3`

`typescript import { calculateHeuristicScores, detectOverEngineeredErrorHandling, detectZeroEdgeCases, isTestFile, type HeuristicScores } from '@codeslick/ai-detection';

// Read your code file const code =
function processData(data) {
result = data.append(value); // Python-style method (hallucination)
return result;
}
;

const lines = code.split('\n');

// Calculate heuristic scores (0.0-1.0) const scores: HeuristicScores = calculateHeuristicScores(lines);

console.log('AI Detection Scores:', scores); // Output: { overEngineeredErrors: 0.2, unnecessaryWrappers: 0.0, ... }`

`$3`

`typescript import { detectVerboseDocstrings, detectBoilerplateComments, detectDetailedExplanatoryComments, detectAICommandMarkers, type LLMFingerprintScores } from '@codeslick/ai-detection';

const code =
/**
* This function processes the data by iterating through each element
* and applying the transformation. It returns the result.
* @param data - The input data
* @returns The processed result
*/
function processData(data) {
return data; // One-line function with 5+ line docstring = GPT-4 fingerprint
}

// TODO: implement error handling
// your code here
;

const lines = code.split('\n');

// Detect LLM fingerprints const gpt4Verbose = detectVerboseDocstrings(lines); const copilotBoilerplate = detectBoilerplateComments(lines);

console.log('GPT-4 Verbose Docstrings:', gpt4Verbose); // 1.0 (detected) console.log('Copilot Boilerplate:', copilotBoilerplate); // 0.66 (2/3 threshold)`

`$3`

`typescript import { calculateHeuristicScores, calculateAICodeConfidence, detectVerboseDocstrings, detectBoilerplateComments, // ... import other LLM fingerprints as needed type DetectionResult, type HeuristicScores, type LLMFingerprintScores } from '@codeslick/ai-detection';

const code = / your AI-generated code here /; const lines = code.split('\n');

// 1. Calculate heuristic scores const heuristicScores: HeuristicScores = calculateHeuristicScores(lines);

// 2. Calculate LLM fingerprint scores (optional) const llmScores: Partial = { // GPT-4 fingerprints verboseDocstrings: detectVerboseDocstrings(lines), // GitHub Copilot fingerprints boilerplateComments: detectBoilerplateComments(lines), // Claude Code fingerprints detailedExplanatoryComments: detectDetailedExplanatoryComments(lines), // Cursor fingerprints aiCommandMarkers: detectAICommandMarkers(lines), // ... add more as needed };

// 3. Calculate overall confidence // hallucinationCount would come from language-specific analyzers (not this package) const hallucinationCount = 0; // Example: detected in your analyzer

const result: DetectionResult | null = calculateAICodeConfidence( hallucinationCount, heuristicScores, llmScores );

if (result) { console.log('AI Code Detected!'); console.log('Confidence:', result.confidence); // HIGH, MEDIUM, or LOW console.log('Severity:', result.severity); // CRITICAL, HIGH, or MEDIUM console.log('Hallucinations:', result.hallucinationPatterns); console.log('Combined Score:', result.heuristicScore); }`

`Available Detectors`

`$3`

- detectOverEngineeredErrorHandling(lines)- 4+ nested if/else in catch blocks -detectUnnecessaryWrappers(lines)- Single-line function wrappers -detectVerboseComments(lines)- Comments that duplicate code (>70% overlap) -detectMixedNamingConventions(lines)- camelCase + snake_case in same window -detectRedundantNullChecks(lines)- Multiple null checks on same variable -detectUnnecessaryAsync(lines)- async functions without await -detectGenericVariableOveruse(lines)- Generic names (data, result, temp) overused -detectInconsistentStringConcatenation(lines) - Mixed concatenation methods

`$3`

- detectZeroEdgeCases(lines)- Functions with no error handling (AI assumes happy path) -detectUniformIndentation(lines)- Perfectly aligned code (humans are messier) -detectTextbookVariableNames(lines)- Generic tutorial-style names -detectNoCommentsWithPerfectStructure(lines)- Clean code with zero comments -detectExcessiveParameterValidation(lines) - Validation in all functions (even private)

`$3`

- isTestFile(filename?)- Checks if file is a test file -removeCommentsAndStrings(line, language) - Preprocesses code to avoid false positives

`API Reference`

`$3`

Runs all 13 heuristic detectors and returns normalized scores (0.0-1.0).

Parameters: -lines: Array of code lines

Returns: -HeuristicScores object with all 13 heuristic scores

`$3`

Checks if a filename indicates a test file.

Supported patterns: -.test., .spec., __tests__/-Test.java, _test.py, _test.go-test_*.py

`Weights and Scoring`

Heuristics are weighted to sum to 1.0:

`typescript const HEURISTIC_WEIGHTS = { overEngineeredErrors: 0.10, unnecessaryWrappers: 0.08, verboseComments: 0.07, mixedNaming: 0.09, redundantNullChecks: 0.10, unnecessaryAsync: 0.08, genericVariables: 0.07, inconsistentStrings: 0.09, // Perfect code heuristics zeroEdgeCases: 0.08, uniformIndentation: 0.07, textbookVariableNames: 0.07, noCommentsWithPerfectStructure: 0.05, excessiveParameterValidation: 0.05, };`

`TypeScript Support`

Full TypeScript support with exported types:

`typescript import type { DetectionResult, HeuristicScores, LLMFingerprintScores } from '@codeslick/ai-detection';`

`License`

MIT © Vitor Lourenco

`Links`

- GitHub: codeslick/packages/ai-detection - Issues: GitHub Issues - CodeSlick: https://codeslick.dev

`Related Projects`

- CodeSlick: Security-first code analysis platform (uses this library) - Endure: Architecture archaeology tool (will use this library)

`Development`

`bash

`Install dependencies`


npm install
Build

npm run build
Run tests

npm test
Type check

npm run type-check

@codeslick/ai-detection

AI-generated code detection library with comprehensive signals for identifying AI-generated code patterns.

Version 1.1.0 - Complete Detection System

This release provides a complete AI code detection system with 45 detectors + scoring logic.

See IMPLEMENTATION_STATUS.md for implementation details.

Features

$3

Installation

``bash npm install @codeslick/ai-detection`

`Usage`

`$3`

`typescript import { calculateHeuristicScores, detectOverEngineeredErrorHandling, detectZeroEdgeCases, isTestFile, type HeuristicScores } from '@codeslick/ai-detection';

// Read your code file const code =
function processData(data) {
result = data.append(value); // Python-style method (hallucination)
return result;
}
;

const lines = code.split('\n');

// Calculate heuristic scores (0.0-1.0) const scores: HeuristicScores = calculateHeuristicScores(lines);

console.log('AI Detection Scores:', scores); // Output: { overEngineeredErrors: 0.2, unnecessaryWrappers: 0.0, ... }`

`$3`

`typescript import { detectVerboseDocstrings, detectBoilerplateComments, detectDetailedExplanatoryComments, detectAICommandMarkers, type LLMFingerprintScores } from '@codeslick/ai-detection';

// TODO: implement error handling
// your code here
;

const lines = code.split('\n');

// Detect LLM fingerprints const gpt4Verbose = detectVerboseDocstrings(lines); const copilotBoilerplate = detectBoilerplateComments(lines);

console.log('GPT-4 Verbose Docstrings:', gpt4Verbose); // 1.0 (detected) console.log('Copilot Boilerplate:', copilotBoilerplate); // 0.66 (2/3 threshold)`

`$3`

const code = / your AI-generated code here /; const lines = code.split('\n');

// 1. Calculate heuristic scores const heuristicScores: HeuristicScores = calculateHeuristicScores(lines);

// 3. Calculate overall confidence // hallucinationCount would come from language-specific analyzers (not this package) const hallucinationCount = 0; // Example: detected in your analyzer

const result: DetectionResult | null = calculateAICodeConfidence( hallucinationCount, heuristicScores, llmScores );

`Available Detectors`

`$3`

- isTestFile(filename?)- Checks if file is a test file -removeCommentsAndStrings(line, language) - Preprocesses code to avoid false positives

`API Reference`

`$3`

Runs all 13 heuristic detectors and returns normalized scores (0.0-1.0).

Parameters: -lines: Array of code lines

Returns: -HeuristicScores object with all 13 heuristic scores

`$3`

Checks if a filename indicates a test file.

Supported patterns: -.test., .spec., __tests__/-Test.java, _test.py, _test.go-test_*.py

`Weights and Scoring`

Heuristics are weighted to sum to 1.0:

`TypeScript Support`

Full TypeScript support with exported types:

`typescript import type { DetectionResult, HeuristicScores, LLMFingerprintScores } from '@codeslick/ai-detection';`

`License`

`Links`

- GitHub: codeslick/packages/ai-detection - Issues: GitHub Issues - CodeSlick: https://codeslick.dev

`Related Projects`

- CodeSlick: Security-first code analysis platform (uses this library) - Endure: Architecture archaeology tool (will use this library)

`Development`

`bash

`Install dependencies`


npm install
Build

npm run build
Run tests

npm test
Type check

npm run type-check