Image to Text OCR Application

PrototypeLearning OCR prototype / 2024

Problem

Users need to digitize image-based text, review it, and store the result with the original image.

Solution

Build a React frontend for browser OCR and text review, backed by an Express/Multer API and MongoDB.

Key Features

In-browser OCR using Tesseract.js with no server-side model required

File upload interface for images containing text

Extracted text review and editing step before saving

Express.js REST API with Multer for image uploads

MongoDB persistence for both images and extracted text via Mongoose

Responsive frontend built with React and Tailwind CSS

Architecture Map

Mermaid

flowchart LR
  User["React browser UI"] --> Upload["Image upload"]
  Upload --> OCR["Tesseract OCR"]
  OCR --> Review["Extracted text review"]
  Review --> API["Express API"]
  Upload --> API
  API --> Files["Multer file storage"]
  API --> Models["Mongoose models"]
  Models --> DB[("MongoDB")]

OCR App Screenshots

2 views

Test image containing Edge Detection text, used as the input to the OCR pipeline. — Input sample from the repository: a styled Edge Detection document used to validate the OCR extraction.

GitHub view of Test_text.txt showing OCR-extracted Edge Detection text saved by the backend. — Result artifact stored as Test_text.txt by the Express backend, showing the extracted Edge Detection text after processing.