OpenCV EAST model and Tesseract for detection and recognition of text in natural scenes

Source

Unstructured Text

Unstructured Texts: Handwritten, Multiple fonts and sparse; Image source: https://pixabay.com

Datasets for unstructured OCR tasks

SVHN dataset

Scene Text dataset

Devanagri Character dataset

Text Detection :

Source

Sliding window technique

Single Shot and Region based detectors

YOLO architecture: source

EAST (Efficient accurate scene text detector)

Source

Text Recognition :

CRNN

Source

Tesseract

Results :

References :

Committed lifelong learner. I am passionate about machine learning, data engineering and currently working as a datascientist.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store