Jump to content

Bulk background filling on 300+ pages

Recommended Posts

I'm currently wading through individual pages of a scanned manual (an old Atari ST one for Hisoft 'C') which is 380+ pages long. Adobe Writer tried it's best to OCR it on the first pass but certain words kept coming up as unrecognised due to the bad scan/yellowed hue in the background. When I remove the yellow background, the OCR operation works a lot better.

I'm using Paint.NET's fill option on the background (white) with 50% tolerance and it's working a treat but is there any way to automate this process? There are a few other manuals which could do with some prep work done on them too before I attempt to OCR those as well and they have the same yellowing background.

Also, for the few pages that are a bad scan (I didn't do the original), how can I 'liven' up the text? My first attempt was using the colour replace function to replace the various shades of grey in the scan with a much darker one. Is there a plugin / option I can use to only sharpen up the text?

Link to comment
Share on other sites

However Scriptlab can be used to automate a number of steps if you're applying the same methodology to each image. It's not batch processing, but it might help.

I'd also have a look at the Alphaspace plugin for background removal: viewtopic.php?f=16&t=27959&start=0

Link to comment
Share on other sites

In theory, the top 10x10 pixels of each scanned image will be exactly the same - however <Family-friendly forum.>-eyed the original scan was.

Will take a look at this tomorrow in work as that's where I've left all the project files.

Link to comment
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

  • Create New...