@bear-block/vision-camera-ocr

!React Native
!Platform
!License
!Version

A high-performance React Native Vision Camera plugin for real-time OCR (Optical Character Recognition)

Features • Installation • Usage • API Reference • Examples • Contributing

---

🚀 Overview

@bear-block/vision-camera-ocr is a powerful React Native library that provides real-time text recognition capabilities directly within your camera app. Built on top of react-native-vision-camera, it leverages native OCR engines for optimal performance:

- Android: Powered by Google ML Kit Text Recognition
- iOS: Powered by Apple's Vision Framework

Perfect for applications requiring real-time text extraction, document scanning, business card readers, or any OCR functionality.

✨ Features

- 🔥 Real-time Processing - Instant text recognition from camera frames
- 📱 Cross-platform - Native implementation for both Android & iOS
- 🚀 High Performance - Optimized native APIs with minimal overhead
- 🌐 Offline First - No internet connection required, all processing on-device
- 🎯 Easy Integration - Simple API that works seamlessly with Vision Camera
- 📊 Configurable - Support for different recognition models and options
- 🛡️ Production Ready - Built with TypeScript and comprehensive error handling

📦 Installation

$3

- React Native 0.79+
- react-native-vision-camera >= 3.0
- react-native-worklets-core ^1.5.0

$3

``bash

`Using yarn (recommended)`


yarn add @bear-block/vision-camera-ocr
Using npm

npm install @bear-block/vision-camera-ocr
Using pnpm

pnpm add @bear-block/vision-camera-ocr

$3

`bash cd ios && pod install`

`$3`

No additional setup required - the package is auto-linked.

`🎯 Quick Start`

`$3`

`typescript import { Camera, useFrameProcessor } from 'react-native-vision-camera'; import { performOcr } from '@bear-block/vision-camera-ocr';

function MyCameraComponent() { const frameProcessor = useFrameProcessor((frame) => { 'worklet'; const result = performOcr(frame); if (result?.text) { console.log('Detected text:', result.text); } }, []);

return ( style={StyleSheet.absoluteFill} device={device} isActive={true} frameProcessor={frameProcessor} frameProcessorFps={5} /> ); }`

`$3`

`typescript import { Camera, useFrameProcessor } from 'react-native-vision-camera'; import { performOcr } from '@bear-block/vision-camera-ocr';

function AdvancedCameraComponent() { const [detectedText, setDetectedText] = useState(''); const [isProcessing, setIsProcessing] = useState(false);

const frameProcessor = useFrameProcessor((frame) => { 'worklet'; try { setIsProcessing(true); const result = performOcr(frame);

if (result?.text) { setDetectedText(result.text); // You can also send to your app's state management runOnJS(handleTextDetected)(result.text); } } catch (error) { console.error('OCR processing error:', error); } finally { setIsProcessing(false); } }, []);

const handleTextDetected = (text: string) => { // Handle the detected text in your app console.log('New text detected:', text); };

return ( style={StyleSheet.absoluteFill} device={device} isActive={true} frameProcessor={frameProcessor} frameProcessorFps={3} // Lower FPS for better performance />

{detectedText && ( {detectedText} )}

{isProcessing && ( Processing... )} ); }

const styles = StyleSheet.create({ container: { flex: 1, }, textOverlay: { position: 'absolute', bottom: 100, left: 20, right: 20, backgroundColor: 'rgba(0,0,0,0.7)', padding: 15, borderRadius: 10, }, text: { color: 'white', fontSize: 16, textAlign: 'center', }, processingIndicator: { position: 'absolute', top: 50, alignSelf: 'center', backgroundColor: 'rgba(0,0,0,0.7)', padding: 10, borderRadius: 20, }, });`

`📚 API Reference`

`$3`

Performs OCR on a camera frame and returns recognized text with optional structure. Returns null when no text is detected.

#### Parameters

- frame (Frame): The camera frame to process from react-native-vision-camera-options(optional): -includeBoxes?: boolean— include normalized bounding boxes for blocks/lines/words -includeConfidence?: boolean— include confidence scores when available (iOS lines) -recognitionLevel?: 'fast' | 'accurate'(iOS) — control Vision request speed/accuracy -recognitionLanguages?: string[] (iOS) — language hints, e.g. ["en-US", "vi-VN"]-usesLanguageCorrection?: boolean (iOS) — enable language correction

#### Returns

- OcrResult | null: -text: string— concatenated recognized text -blocks?: OcrBlock[] — present when includeBoxesis true -OcrBlock: { text: string, box?: OcrBox, lines?: OcrLine[] }-OcrLine: { text: string, box?: OcrBox, words?: OcrWord[], confidence?: number }-OcrWord: { text: string, box?: OcrBox, confidence?: number }-OcrBox: { x: number, y: number, width: number, height: number } (normalized 0..1 on iOS; absolute px on Android for now)

#### Example

`typescript const result = performOcr(frame, { includeBoxes: true, includeConfidence: true, recognitionLevel: 'accurate', // iOS recognitionLanguages: ['en-US', 'vi-VN'], // iOS usesLanguageCorrection: true, // iOS }); if (result) { console.log('Detected text:', result.text); const firstLine = result.blocks?.[0]?.lines?.[0]; if (firstLine?.box) { // Use normalized box on iOS (0..1). Android currently returns pixel units. console.log('First line box:', firstLine.box); } } else { console.log('No text detected'); }`

`🔧 Configuration`

`$3`

Runtime performOcr options are recommended (see above). Initialization options are currently limited and not required.

> Note: Android uses ML Kit Latin text recognizer by default. iOS options map to Apple's Vision.

`📱 Platform-Specific Details`

`$3`

- Uses Google ML Kit Text Recognition - Optimized for Latin script languages - Automatic language detection - Fast processing with minimal memory usage - Bounding boxes returned in pixel units (subject to change to normalized in future)

`$3`

- Uses Apple's Vision Framework - Native integration with iOS camera system - Support for multiple text recognition languages - Optimized for iOS performance characteristics - SupportsrecognitionLevel, recognitionLanguages, usesLanguageCorrection- Bounding boxes returned normalized (0..1); y-origin is top-left in returned box structure

`🎨 Use Cases`

This library is perfect for:

- Document Scanners - Convert paper documents to digital text - Business Card Readers - Extract contact information from business cards - Receipt Scanners - Automate expense tracking and receipt processing - Text Translation Apps - Real-time text recognition for translation - Accessibility Tools - Help visually impaired users read text - Form Processing - Automate data entry from paper forms - License Plate Recognition - Vehicle identification systems - Product Label Scanners - Extract information from product packaging

`🚀 Performance Tips`

- Frame Rate: Use frameProcessorFps={3-5}for optimal performance - Error Handling: Always wrap OCR calls in try-catch blocks - State Management: Debounce text updates to avoid excessive re-renders - Memory: Process frames efficiently and avoid storing large amounts of data

`🔧 Troubleshooting`

`$3`

#### Plugin Not Found / "Failed to load Frame Processor Plugin"

Problem: The frame processor plugin isn't being registered properly.

Solutions:

1. Clean and rebuild:

`bash cd android ./gradlew clean cd .. # Then rebuild your app`

2. Verify auto-linking:

- Check that react-native.config.jsexists in your project root - Ensure the package is listed inpackage.jsondependencies - Runnpx react-native config to verify the package is detected

3. Manual linking (if auto-linking fails):

- Add to android/settings.gradle:`gradle include ':vision-camera-ocr' project(':vision-camera-ocr').projectDir = new File(rootProject.projectDir, '../node_modules/@bear-block/vision-camera-ocr/android')`- Add toandroid/app/build.gradledependencies:`gradle implementation project(':vision-camera-ocr')`

4. Check React Native version:

- Ensure you're using React Native 0.79+ (check package.json) - Verifyreact-native-vision-camera>= 3.0 is installed - Verifyreact-native-worklets-core ^1.5.0 is installed

5. Verify ML Kit dependency: - The library uses Google ML Kit Text Recognition - Ensure yourandroid/build.gradlehas Google Maven repository:`gradle repositories { google() mavenCentral() }`

#### Camera Permission Issues

Problem: Camera permission is denied or not requested.

Solutions:

1. Add to AndroidManifest.xml:

`xml`

2. Request permission at runtime (React Native 0.79+):

`typescript import { PermissionsAndroid } from 'react-native';

const granted = await PermissionsAndroid.request( PermissionsAndroid.PERMISSIONS.CAMERA );`

#### Build Errors

Problem: Gradle build fails with dependency or compilation errors.

Solutions:

1. Update Gradle:

- Ensure Android Gradle Plugin 8.7.2+ is used - Checkandroid/build.gradle for correct versions

2. Clean build:

`bash cd android ./gradlew clean rm -rf .gradle cd ..`

3. Check minSdkVersion: - Library requires minSdkVersion 24 - Verify inandroid/build.gradle:`gradle minSdkVersion 24`

`$3`

#### Pod Install Fails

Solutions:

1. Update CocoaPods: sudo gem install cocoapods2. Clean pods:cd ios && pod deintegrate && pod install3. Clear cache:rm -rf ~/Library/Caches/CocoaPods

#### Camera Permission

Solutions:

1. Add to Info.plist:`xml NSCameraUsageDescription This app needs access to your camera to perform OCR`

`$3`

#### No Text Detected

Solutions:

1. Ensure good lighting conditions 2. Hold camera steady and focus on text 3. Try adjustingframeProcessorFps(lower values may help) 4. Check that text is clear and not too small 5. Verify the frame processor is being called (add console logs)

#### Performance Issues

Solutions:

1. Reduce frameProcessorFpsto 2-3 2. Add throttling/debouncing to text updates 3. Process frames conditionally (e.g., only when camera is focused) 4. Avoid heavy operations in the frame processor worklet

`📱 Example App`

A complete working example app is available in the example directory. See the example README for setup instructions.

`🤝 Contributing`

We welcome contributions! Please see our Contributing Guide for details.

`$3`

`bash

`Clone the repository`


git clone https://github.com/bear-block/vision-camera-ocr.git
cd vision-camera-ocr
Install dependencies

yarn install
Run tests

yarn test
Type checking

yarn typecheck
Linting

yarn lint

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

- Built on top of react-native-vision-camera
- Android OCR powered by Google ML Kit
- iOS OCR powered by Apple Vision Framework

💖 Support This Project

If you find this library useful and would like to support ongoing development, please consider:

- ⭐ Starring this repository
- 🐛 Reporting bugs and feature requests
- 💻 Contributing code improvements
- 💰 Sponsoring us on GitHub

👉 Become a Sponsor

---

Made with ❤️ by Bear Block

GitHub • Issues • Discussions

@bear-block/vision-camera-ocr

!React Native
!Platform
!License
!Version

A high-performance React Native Vision Camera plugin for real-time OCR (Optical Character Recognition)

Features • Installation • Usage • API Reference • Examples • Contributing

---

🚀 Overview

- Android: Powered by Google ML Kit Text Recognition
- iOS: Powered by Apple's Vision Framework

Perfect for applications requiring real-time text extraction, document scanning, business card readers, or any OCR functionality.

✨ Features

📦 Installation

$3

- React Native 0.79+
- react-native-vision-camera >= 3.0
- react-native-worklets-core ^1.5.0

$3

``bash

`Using yarn (recommended)`


yarn add @bear-block/vision-camera-ocr
Using npm

npm install @bear-block/vision-camera-ocr
Using pnpm

pnpm add @bear-block/vision-camera-ocr

$3

`bash cd ios && pod install`

`$3`

No additional setup required - the package is auto-linked.

`🎯 Quick Start`

`$3`

`typescript import { Camera, useFrameProcessor } from 'react-native-vision-camera'; import { performOcr } from '@bear-block/vision-camera-ocr';

return ( style={StyleSheet.absoluteFill} device={device} isActive={true} frameProcessor={frameProcessor} frameProcessorFps={5} /> ); }`

`$3`

`typescript import { Camera, useFrameProcessor } from 'react-native-vision-camera'; import { performOcr } from '@bear-block/vision-camera-ocr';

function AdvancedCameraComponent() { const [detectedText, setDetectedText] = useState(''); const [isProcessing, setIsProcessing] = useState(false);

const frameProcessor = useFrameProcessor((frame) => { 'worklet'; try { setIsProcessing(true); const result = performOcr(frame);

const handleTextDetected = (text: string) => { // Handle the detected text in your app console.log('New text detected:', text); };

return ( style={StyleSheet.absoluteFill} device={device} isActive={true} frameProcessor={frameProcessor} frameProcessorFps={3} // Lower FPS for better performance />

{detectedText && ( {detectedText} )}

{isProcessing && ( Processing... )} ); }

`📚 API Reference`

`$3`

Performs OCR on a camera frame and returns recognized text with optional structure. Returns null when no text is detected.

#### Parameters

#### Returns

#### Example

`🔧 Configuration`

`$3`

Runtime performOcr options are recommended (see above). Initialization options are currently limited and not required.

> Note: Android uses ML Kit Latin text recognizer by default. iOS options map to Apple's Vision.

`📱 Platform-Specific Details`

`$3`

`🎨 Use Cases`

This library is perfect for:

`🚀 Performance Tips`

`🔧 Troubleshooting`

`$3`

#### Plugin Not Found / "Failed to load Frame Processor Plugin"

Problem: The frame processor plugin isn't being registered properly.

Solutions:

1. Clean and rebuild:

`bash cd android ./gradlew clean cd .. # Then rebuild your app`

2. Verify auto-linking:

- Check that react-native.config.jsexists in your project root - Ensure the package is listed inpackage.jsondependencies - Runnpx react-native config to verify the package is detected

3. Manual linking (if auto-linking fails):

4. Check React Native version:

- Ensure you're using React Native 0.79+ (check package.json) - Verifyreact-native-vision-camera>= 3.0 is installed - Verifyreact-native-worklets-core ^1.5.0 is installed

5. Verify ML Kit dependency: - The library uses Google ML Kit Text Recognition - Ensure yourandroid/build.gradlehas Google Maven repository:`gradle repositories { google() mavenCentral() }`

#### Camera Permission Issues

Problem: Camera permission is denied or not requested.

Solutions:

1. Add to AndroidManifest.xml:

`xml`

2. Request permission at runtime (React Native 0.79+):

`typescript import { PermissionsAndroid } from 'react-native';

const granted = await PermissionsAndroid.request( PermissionsAndroid.PERMISSIONS.CAMERA );`

#### Build Errors

Problem: Gradle build fails with dependency or compilation errors.

Solutions:

1. Update Gradle:

- Ensure Android Gradle Plugin 8.7.2+ is used - Checkandroid/build.gradle for correct versions

2. Clean build:

`bash cd android ./gradlew clean rm -rf .gradle cd ..`

3. Check minSdkVersion: - Library requires minSdkVersion 24 - Verify inandroid/build.gradle:`gradle minSdkVersion 24`

`$3`

#### Pod Install Fails

Solutions:

1. Update CocoaPods: sudo gem install cocoapods2. Clean pods:cd ios && pod deintegrate && pod install3. Clear cache:rm -rf ~/Library/Caches/CocoaPods

#### Camera Permission

Solutions:

1. Add to Info.plist:`xml NSCameraUsageDescription This app needs access to your camera to perform OCR`

`$3`

#### No Text Detected

Solutions:

#### Performance Issues

Solutions:

`📱 Example App`

A complete working example app is available in the example directory. See the example README for setup instructions.

`🤝 Contributing`

We welcome contributions! Please see our Contributing Guide for details.

`$3`

`bash

`Clone the repository`


git clone https://github.com/bear-block/vision-camera-ocr.git
cd vision-camera-ocr
Install dependencies

yarn install
Run tests

yarn test
Type checking

yarn typecheck
Linting

yarn lint

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

- Built on top of react-native-vision-camera
- Android OCR powered by Google ML Kit
- iOS OCR powered by Apple Vision Framework

💖 Support This Project

If you find this library useful and would like to support ongoing development, please consider:

- ⭐ Starring this repository
- 🐛 Reporting bugs and feature requests
- 💻 Contributing code improvements
- 💰 Sponsoring us on GitHub

👉 Become a Sponsor

---

Made with ❤️ by Bear Block

GitHub • Issues • Discussions