Skip to content

Add basic OCR function using pytesseract #56

Description

@AshtamiBiju

I’d like to add a simple OCR function that can read text from an image. This will be an early step towards VisionMate’s text recognition feature.

I didn’t see an OCR function in the repo, but if something like this already exists (or is planned elsewhere), please let me know so I don’t duplicate work.

What it will do:-

Take an image as input
Use pytesseract to extract text
Return the text as a string

Steps I’ll take:-

Create a new file vision/ocr.py
Write a function extract_text(image_path: str) -> str
Add a sample image (sample_text.png) for testing
Write a small test script to show how it works

Metadata

Metadata

Assignees

Labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions