Forums » .NET » .NET »

Extract text from jpeg image file


Posted Date: 05 Jan 2010      Posted By:: Pankaj     Member Level: Bronze    Member Rank: 0     Points: 1   Responses: 1



How to extract a text from jpeg image file,I have already extracted text from jpeg image file using Modi but i did'nt got the 100% accuracy in the text file.How can we acheive 100 % accuracy.
Here is my code in C#:

private void btnConvert_Click(object sender, EventArgs e)
{
CheckFileType(txtFolder.Text);
}
public void CheckFileType(string directoryPath)
{

IEnumerator files = Directory.GetFiles(directoryPath).GetEnumerator();
while (files.MoveNext())
{
//get file extension
string fileExtension = Path.GetExtension(Convert.ToString(files.Current));

//get file name without extenstion
string fileName = Convert.ToString(files.Current).Replace(fileExtension, string.Empty);

//Check for JPG File Format
if (fileExtension == ".jpg" || fileExtension == ".JPG") // or // ImageFormat.Jpeg.ToString()
{
try
{
//OCR Operations ...

MODI.Document md = new MODI.Document();
md.Create(Convert.ToString(files.Current));
//md.OCR(MODI.MiLANGUAGES.miLANG_ENGLISH, true, true);
md.OCR(MODI.MiLANGUAGES.miLANG_ENGLISH, true, true);
MODI.Image image = (MODI.Image)md.Images[0];

//create text file with the same Image file name
FileStream createFile = new FileStream(fileName + ".txt", FileMode.CreateNew);

//save the image text in the text file
StreamWriter writeFile = new StreamWriter(createFile);
writeFile.Write(image.Layout.Text);
writeFile.Close();
}
catch (Exception)
{
//MessageBox.Show("This Image hasn't a text or has a problem",
//"OCR Notifications",
//MessageBoxButtons.OK, MessageBoxIcon.Information);
}
}
}
}




Responses

#464145    Author: Jigar      Member Level: Gold      Member Rank: 171     Date: 15/Jan/2010   Rating: 2 out of 52 out of 5     Points: 2

Hi,
Pankaj

There is no one software / api / sdk which guaranteed you 100% accuracy for making image to text conversion. because after all everything is depends upon the quality of the image.

well, for some brief, it is just to let you know that this process is call as OCR ( Optical Character Recognition ) which will convert the image file text data.

i know the best tool to make this conversion which i have used in past is ABBY FINE READER. you can find more information about it by googling it.

but to make integration to your application you need the SDK ( Software Development Kit ) which will provide you a dll file and through which you can extract the text.

we have worked on highcourt pages which are transferred to searchable form. and store it in databsae with whole text. we have successfully completed the trial project for that. and we have done so many ocr softwares.

hope this will help you

-:)

Jigar Shah
Software Developer


 
Post Reply

 This thread is locked for new responses. Please post your comments and questions as a separate thread.
If required, refer to the URL of this page in your new post.



Return to Discussion Forum
Start new thread

Subscribe to Subscribers
Active Members
TodayLast 7 Daysmore...

Awards & Gifts
Talk to Webmaster Tony John

Online Members

More...
Copyright © SpiderWorks Technologies Pvt Ltd., Kochi, India