Convert image to text in .net


This article is about Image to Text converter using MODI (Microsoft Office Document Imaging) adds programming features to the document scanning and viewing tools. We can easily convert scanned document to text.

What is MODI?
MODI (Microsoft Office Document Imaging) is library which contained in Microsoft office package use to extract text and layout information from scanned pages by using OCR (optical character recognition).

Adding MODI to Project?
Add reference to your project from COM -> Microsoft Office Document Imaging 11.0 Type Library.

Note: If you have installed Microsoft Office 2003 then MODI is installed by default for office 2007 you have to add from installation package. You have run package and add MODI .In office 2010 unfortunately MODI is removed by Microsoft. But you can get by installing Microsoft Office SharePoint Designer 2007.
To install refer given link
Click Here

After adding reference to project is more than your half your work done

1)Create a form and add text box and button to upload image.
2)You can design as your here I designed according to my requirement.
3)Take picture box to display image and text box to display text.
4)Change text box property of Multiline=true;
5)On Button upload on button_click event write the following code.

Code is Here :


private void btnUpload_Click(object sender, EventArgs e)
{
OpenFileDialog op1 = new OpenFileDialog();

op1.ShowDialog();
txtUpload.Text = op1.FileName;
MODI.Document DOC = new MODI.Document();
StringBuilder str = new StringBuilder();
string filename = txtUpload.Text;
MODI.Word word = null;
MODI.Image image = null;
MODI.Layout layout = null;


pictureBox1.ImageLocation = txtUpload.Text;
DOC.Create(filename);
DOC.OCR(MODI.MiLANGUAGES.miLANG_ENGLISH, true, true);

for (int i = 0; i < DOC.Images.Count; i++)
{

image = (MODI.Image)DOC.Images[i];
layout = image.Layout;
for (int j = 0; j < layout.Words.Count; j++)
{

word = (MODI.Word)layout.Words[j];
str.Append(" "+word.Text);


}
}
textBox1.Text = str.ToString();
DOC.Close(false);

}


and you can run your project and check result my output is like this.
Or You can check attachment I have attached project.
Snap shot of Image to Text


Attachments

  • Project Image to Text using C# (45094-12033-Project-Image-Text-using-C.zip)
  • Comments

    No responses found. Be the first to comment...


  • Do not include your name, "with regards" etc in the comment. Write detailed comment, relevant to the topic.
  • No HTML formatting and links to other web sites are allowed.
  • This is a strictly moderated site. Absolutely no spam allowed.
  • Name:
    Email: