OpenCV EAST model and Tesseract for detection and recognition of text in natural scenes


Unstructured Text

Unstructured Texts: Handwritten, Multiple fonts and sparse; Image source:

Datasets for unstructured OCR tasks

SVHN dataset

Scene Text dataset

Devanagri Character dataset

Text Detection :


Sliding window technique

Single Shot and Region based detectors

YOLO architecture: source

EAST (Efficient accurate scene text detector)


Text Recognition :




Results :

References :

Committed lifelong learner. I am passionate about machine learning, data engineering and currently working as a datascientist.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store