Tesseract OCR Text Position

Tags:

I am working on OCR using tesseract. I am able to make the application working and get the output. Here i'm trying to extract data from an invoice bill and getting the extracted data. But the spacing between words in input has to be similar in output file.I am now getting each words and coordinates.I need to export to text file according to coordinates

Code Sample :

Click to copy

            using (var engine = new TesseractEngine(Server.MapPath(@"~/tessdata"), "eng", EngineMode.Default))
            {
                engine.DefaultPageSegMode = PageSegMode.AutoOsd;
                // have to load Pix via a bitmap since Pix doesn't support loading a stream.

                using (var image = new System.Drawing.Bitmap(imageFile.PostedFile.InputStream))
                {

                    Bitmap bmp = Resize(image, 1920, 1080);

                    using (var pix = PixConverter.ToPix(image))
                    {
                        using (var page = engine.Process(pix))
                        {
                            using (var iter = page.GetIterator())
                            {
                                iter.Begin();
                                do
                                {
                                    Rect symbolBounds;
                                    string path = Server.MapPath("~/Output/data.txt");
                                    if (iter.TryGetBoundingBox(PageIteratorLevel.Word, out symbolBounds))
                                    {
                                        // do whatever you want with bounding box for the symbol

                                    var curText = iter.GetText(PageIteratorLevel.Word);

                                        //WriteToTextFile(curText, symbolBounds, path);
                                        resultText.InnerText += curText;
                                        // Your code here, 'rect' should containt the location of the text, 'curText' contains the actual text itself
                                    }
                                } while (iter.Next(PageIteratorLevel.Word));
                            }


                            meanConfidenceLabel.InnerText = String.Format("{0:P}", page.GetMeanConfidence());

                        }
                    }
                }
            }

Here is an example of input and output showing the wrong spacing.

Input Output

481

asked Jul 11 '18 09:07

ab2015

1 Answers

You can loop through found items in the page using page.GetIterator(). For the individual items you can get a 'bounding box', this is a Tesseract.Rect (rectangle struct) which contains: X1, Y1, X2, Y2 coordinates.

Click to copy

Tesseract.PageIteratorLevel myLevel = /*TODO*/;
using (var page = Engine.Process(img))
using (var iter = page.GetIterator())
{
    iter.Begin();
    do
    {
        if (iter.TryGetBoundingBox(myLevel, out var rect))
        {
            var curText = iter.GetText(myLevel);
            // Your code here, 'rect' should containt the location of the text, 'curText' contains the actual text itself
        }
    } while (iter.Next(myLevel));
}

There is no clear-cut way to use the positions in the input to space the text in the output. You're going to have to write some custom logic for that.

You might be able to estimate the number of spaces you need to the left of your text with something like this:

Click to copy

var padLeftSpaces = (int)Math.Round((rect.X1 / inputWidth) * outputWidthSpaces);

142

answered Sep 22 '22 11:09

GWigWam

Related questions
                            
                                How to disable elements in a grid
                            
                                Web API POST parameter is null for large JSON request
                            
                                Using .net standard 1.5 lib in .net 4.6.2 misses System.Runtime 4.1.0.0
                            
                                How to track MongoDB requests from a console application
                            
                                Recurring jobs with Hangfire and Asp.Net Core
                            
                                Resize more than one different shape at same time using .net adorner
                            
                                C# HttpClient adding "User-Agent" header shows up as several different headers
                            
                                Target .NET Core Class Library From .NET Framework 4.6.2 Class Library
                            
                                C# 7 Tuples and names in .NET Core
                            
                                How to get users from a existing database for identityServer4
                            
                                Dictionary initializer has different behavior and raises run-time exception when used in combination of array initializer
                            
                                C# function uses extension function but VB equivalent does not?
                            
                                Change Color of Xamarin Forms Label in C# only, no XAML
                            
                                Create a table if it does not exist?
                            
                                First time exception - System.pdb not loaded
                            
                                Why does Visual Studio mark my added .cs files as "ignored"?
                            
                                Visual Studio 2017 Docker support not available for ASP.Net Core Angular or React projects
                            
                                How does the hard-coded ApplicationInsightsResourceId impact the gathering of AI data from resources in varying production levels?
                            
                                Should write complex query in Repository or Service layer?
                            
                                How do I make an ASP.NET Core void/Task action method return 204 No Content

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Tesseract OCR Text Position

Tags:

c#

asp.net

ocr

tesseract

ab2015

People also ask

1 Answers

GWigWam

Recent Activity

Donate For Us