Python OCR and Barcode Recognition

High performance, royalty-free Python OCR and barcode recognition on Windows, Linux, Mac OS and Unix

Resources and FAQ's for Asprise OCR for Python

    Recognizes text only or barcode only

    To recognize text only:

    s = ocrEngine.recognize("test.png", -1, -1, -1, -1, -1,
                            OCR_RECOGNIZE_TYPE_TEXT, OCR_OUTPUT_FORMAT_PLAINTEXT)

    To recognize barcode only:

    s = ocrEngine.recognize("test.png", -1, -1, -1, -1, -1,
                            RECOGNIZE_TYPE_BARCODE, OCR_OUTPUT_FORMAT_PLAINTEXT)

    Perform OCR on part of the image

    In some cases, you might not want to OCR the whole image. In that case, you can OCR on part of the image to save time:

    s = ocrEngine.recognize("test.png", -1, 0, 0, 400, 200,
                            OCR_RECOGNIZE_TYPE_TEXT, OCR_OUTPUT_FORMAT_PLAINTEXT)

    The above code OCR the top left part of the image with width 400 pixels and height 200 pixels.

    Perform OCR on multiple input files in one shot

    s = ocrEngine.recognize("test1.png;test2.png", -1, -1, -1, -1, -1,
                            OCR_RECOGNIZE_TYPE_TEXT, OCR_OUTPUT_FORMAT_PLAINTEXT)

    Perform OCR on a certain page from the specified TIFF file

    A TIFF file may contain multiple pages. If you need to recognize only a certain page, you can specify the page number as following:

    s = ocrEngine.recognize("test.tif", 2, -1, -1, -1, -1,
                            OCR_RECOGNIZE_TYPE_TEXT, OCR_OUTPUT_FORMAT_PLAINTEXT)

    Note 2 means the second page (the page number of the first page is 1).

    Perform OCR on a PDF input file

    You use the following method to perform OCR on a PDF input file:

    s = ocrEngine.recognize("test.pdf", -1, -1, -1, -1, -1, OCR_RECOGNIZE_TYPE_TEXT, OCR_OUTPUT_FORMAT_PLAINTEXT)

    Read the developer's guide for more details

    Back to top