muguk Posted March 4, 2010 Share Posted March 4, 2010 I'm currently wading through individual pages of a scanned manual (an old Atari ST one for Hisoft 'C') which is 380+ pages long. Adobe Writer tried it's best to OCR it on the first pass but certain words kept coming up as unrecognised due to the bad scan/yellowed hue in the background. When I remove the yellow background, the OCR operation works a lot better. I'm using Paint.NET's fill option on the background (white) with 50% tolerance and it's working a treat but is there any way to automate this process? There are a few other manuals which could do with some prep work done on them too before I attempt to OCR those as well and they have the same yellowing background. Also, for the few pages that are a bad scan (I didn't do the original), how can I 'liven' up the text? My first attempt was using the colour replace function to replace the various shades of grey in the scan with a much darker one. Is there a plugin / option I can use to only sharpen up the text? Quote Link to comment Share on other sites More sharing options...
yellowman Posted March 4, 2010 Share Posted March 4, 2010 Try Thrshold plugin, find it in this package : http://paintdotnet.forumer.com/viewtopic.php?f=16&t=26244 Quote My GalleryMy YouTube Channel "PDN Tutorials" Link to comment Share on other sites More sharing options...
Rick Brewster Posted March 4, 2010 Share Posted March 4, 2010 The current version of Paint.NET can't be used for batch processing. Sorry. Quote The Paint.NET Blog: https://blog.getpaint.net/ Donations are always appreciated! https://www.getpaint.net/donate.html Link to comment Share on other sites More sharing options...
Ego Eram Reputo Posted March 4, 2010 Share Posted March 4, 2010 However Scriptlab can be used to automate a number of steps if you're applying the same methodology to each image. It's not batch processing, but it might help. I'd also have a look at the Alphaspace plugin for background removal: viewtopic.php?f=16&t=27959&start=0 Quote ebook: Mastering Paint.NET | resources: Plugin Index | Stereogram Tut | proud supporter of Codelab plugins: EER's Plugin Pack | Planetoid | StickMan | WhichSymbol+ | Dr Scott's Markup Renderer | CSV Filetype | dwarf horde plugins: Plugin Browser | ShapeMaker Link to comment Share on other sites More sharing options...
muguk Posted March 4, 2010 Author Share Posted March 4, 2010 In theory, the top 10x10 pixels of each scanned image will be exactly the same - however <Family-friendly forum.>-eyed the original scan was. Will take a look at this tomorrow in work as that's where I've left all the project files. Quote Link to comment Share on other sites More sharing options...
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.