Step 10: Post Processing

There are two ways we can post-process these images before we make a PDF out of them:

The Lazy Way
Open up your favorite editor and rotate all of those images so they're the right way up :(

Or you can use some software Matti wrote to batch rotate them for you:
  • If you followed our instructions and took pictures of the right-hand side of the book all the way to the end, then flipped the book and took all the left-side pictures:
RotateAll.exe (Source code) will rotate the first half of the images clockwise, the second half counter-clockwise.
  • If you didn't use a tripod and instead took pictures of each page, alternating right then left, starting with the right-hand side of the book:
RotateEveryOther.exe (Source code) will rotate every other image clockwise, the remaining counter-clockwise.

To use these programs, just drag and drop a folder containing your images onto the .exe file of your choice, the program will automatically rotate your images and save them as 00001.jpg, etc. in the same folder as your images.

Make sure the (alphabetically) first image (RotateEveryOther) or set of images (RotateAll) is/are the right-hand side page, otherwise your images will be rotated wrong...

If you follow this procedure, your resulting images will be something like this: 

The Better Way
Over on the DIY Book Scannerforums, we prefer to use Scan tailor.

Scan Tailor was originally written by Joseph Artsimovich for processing scanned-in books from flatbed scanners; it does a wonderful job of automatically finding the content of the pages and generally makes them look a lot better than the original camera shots.

Following our directions, your images will be out of order (all right-hand pages first, then all left-hand pages.) It'd be a pain to rename all of these so they were in the right order, so Matti wrote a little utility to copy/rename all the images:

RenameAll.exe(Source code) copies and renames the first half of the images 000001-a.jpg, etc. then the second half 000001-b.jpg, etc.

To use this program, just drag and drop the folder containing your images onto the RenameAll.exe file, and the images will be copied and renamed into the same folder.

Using Scan Tailor
When you load up the Scan Tailor program, you'll want to create a new project, and then select the directory containing all of your images as the input directory, and some other (empty) directory for your output directory.

When the "Fix DPI" window pops op, select All Pages, change the DPI to 300 x 300, hit Apply, then OK.

Now we're in the main window. On the right you'll see the task list:
  Fix Orientation
  Split Pages (optional)
  Deskew (optional)
  Select Content
  Page Layout (optional)

At the bare minimum, you need to fix the orientations of the images, select the content boxes, (skipping split pages and deskew) then output the processed images.

After rotating the on-screen image to the correct orientation, use the "Apply to..." button and select how you'd like to fix the other images in the project. Use "Apply to..."->"This page and the following ones" if your images are all right-hand pages, then all left-hand pages. Use "Apply to..."->"Every other page" if your images are sequential pages.

In the "Select Content" tab, first hit the little arrow to automatically detect each page, then quickly scroll through each image to make sure the box is the right size in each image.

Finally, select the "Output" tab, and deselect the "despeckle" option, and hit "Apply to.."->"Every page". Hit the little arrow, and Scan Tailor will save all the nice, crisp output images to the output directory you specified.

Now you have all your pages ready to be turned into a PDF, or you could put the pictures into a zip file.

Your output will look something like this: 

<p>Thanks for the great instrucable. I ended up using adobe lightroom for the image processing. It worked like a charm.</p>
<p>I've been using one camera on a tripod with no close up lighting. I do it during the day and I have a daylight bulb. Shadows don't seem to be a problem though the lamp would probably help. But I would use a daylight bulb rather than those old yellow ones. Not sure if that would make much of a difference but I'm guessing it would.</p><p>I use Scan Tailor for post processing.<br><br>What I got was a fairly decent PDF of a relatively rare book that is not cheap to get second hand online.<br><br>I think for me the biggest thing is reducing page curvature. I could just try placing a piece of perspex flat down on the book...<br><br>Using one camera on each side would take too long and Scan Tailor works only on pictures with two pages in them.</p>
<p>Nice idea. I was doing it already, but I never thought about the light above the book to minimize or eliminate the shadow caused by the camera and the tripod. Thank you.</p>
<p>how much pages i can scan and post-process with this method in a day?</p>
<p>This sounds like a crazy and mad idea! I would probably be better off moving the book to a professional copier and paying the copy fees!</p>
<p>This looked very nice, but the OCR quality turned out to be terrible. For ebooking purposes this is probably useless. If you just want a PDF image, on the other hand, this should work.</p>
Lesson learned.. Turn off the time stamp on your camera before taking photos of 224 pages!
<p>&quot;Doh!&quot; - Homer Simpson</p>
is anyone using this to scan glossy magazines? One part of the page is blown with higlights, almost white and bottom is dark. Did enyone else had this problem?
<p>I constructed the box base scan bed as instructed and it works as described. I have an issue I'd like advice about if possible. I've been scanning my grandmothers sewing publications for archiving. As the example shows, I've have a reflection on the bottom edge of the picture (it's been rotated) that I haven't been able to eliminate. I've repositioned the light source and also tried putting spacers under the center of the box's hinge point to change the angle from about 90 degrees to something larger. These changes haven't eliminated the reflection. Moving the book higher on the box to avoid the hinge point does work, but that takes away the efficiency in this design when scanning multiple pages.</p>
<p>Just made one of these and am currently using it to scan some awesome antique science books. Thanks for the ible!</p>
Nice! If you want to expand it, or explore other book scanning stuff, join us at www.diybookscanner.org/forum .
Thanks for this! Some notes- you can go with a bigger piece of glass (mine was 12x15), but your wrist will get tired of holding it. Turn the pages away from the camera so that you can have an efficient circular motion throughout the scanning process. Don't move the book between pictures! It will make cropping the pictures with ScanTailor way easier later. <br>Also, ScanTailor is very neat, though I have trouble with it always running out of memory on certain pictures. The dewarping facility was useful because I didn't get a proper straight-ahead shot of the pages.
I'm having a problem getting the rename and rotate all program to work correctly. When it runs in the DOS shell it indicates that it went thru all of the files and renamed them all, but when I actually open the file it only finished half the job. Any ideas???
that looks AWESOME, Where did the idea come from?
I also looking for similar solution. <br>But it seems that some company came out with XCANEX. <br>Checked its video and it look cheaper. <br>ANyone tried it??
I'd been looking for a solution to this; too many books and no kindle versions. So far this seems to be an outstanding solution, with detailed help every step of the way. <br> <br>Thank You!
tooo awesome!
Is a very good website, Yes, for I have a lot of help, thank you!!!!!! <br>I would recommend to my friends. <br> <br>The recommendation of a friend professional Graphic Converter! really good and I like it very much, I save a lot of time. <a href="http://www.graphic-converter.net/" rel="nofollow"> To try his</a>
I tried the Scan Tailor and it won't process jpg files. Anyone having the same problem? (I convert to tiff , it worked but would rather have the jpg files and save a step). By the way, thank you so much for the instruction for the book scanner, I am in the process of learning how to set it up and actually have a book in pdf.<br> Awesome..
I wonder how this would work...<br><br>Skip the tripod<br>Get 2 lamps instead of one (With the bendy arm just like in your example)<br>Remove the light from one of the lamps and attach the camera to that.
Another cheap alternative for buying a glass sheet are clip frames. You can usually find those in cheap, or poundland type shops in the UK anyways.
So what do you do with a paperback book you want to copy that does not open up nicely as hardback books do? Meaning -- if I put a new paperback book in the box-wedge and open to the page I want to copy, the book simply closes. Any thoughts? Thanks.
Wondering if anyone had any thoughts yet on how to handle a paperback book with the problems described in my first post (above) about book not lying flat. Thanks!
maybe two pieces of glass?
This is an amazing prodject. i was in a bind (no pun here) the other day. my backpack literally broke under the weight of my engineering texts. i built this in 10 mins, and now all i bring to class is a laptop and a note book. you guys are a life saver. 5 stars here!
Awesome. If you ever get a chance, drop by DIYBookScanner.org and share a picture of your setup!
I started this today, (great instructable) and have some questions. How do you consolidate all the separate pdf pages into a &quot;book&quot; to load to the ereader? Is there another program to make a large pdf or epub file? I haven't bought an ereader yet, but plan to soon. I have a 1200 page vegetation ID reference book, plus a few others that I'd like to convert so I can lighten my backpack.
Robbtoberfest, you should really join us over at DIYBookScanner.org. In particular, check out the &quot;software&quot; forum and the &quot;new standard scanner&quot; build thread to see some modern improvements. This instructable, though great, is outdated and we have lot of improvements over on the forums.
Will do, thanks much.
Mac Users can do the same &quot;print to PDF&quot; thing, but it is already built into the Mac OS X. View all those images in any application, then print. Note the &quot;PDF&quot; button at the lower left corner of the print dialog box, and use it to print to a PDF file instead of whatever printer is selected.
10.6 and possibly 10.5 allows you to create multi-page pdfs simply by copying all the files in finder and pasting them in preview.
Thanks for the tips!
Some cameras have the ability to &quot;lock&quot; the focus and exposure settings, allowing a much quicker recovery and re-shoot. With a static setup such as this, it should prove quite useful.
Very nice, this would work great with old books that are falling apart. It also would come in handy for keeping a documented copy of important items such as genealogy, baby books, scrapbook pages, wills... I could go on but I think you get the idea. I have taken photo's and made them into phish, not fun with old books, some over 100 years old. This gives the book some support without cracking the old spine and the camera is parallel to the page.
Um... correct me if I am wrong.... but I believe a camera should be on the things needed list... just saying ;) Maybe you know some magic trick that I have yet to learn :P
Good point, we thought it was implied. I'll change it someday.
I may or may not do this project, but one thing is for sure;<br>THANK YOU SO MUCH FOR THE PROGRAMS!!!!!!
You're welcome! Matti did most of the work for the small programs, and Scan Tailor was done by Joseph Artsimovich. I am just the camera/scanner guy. :)
Mac users can use the shareware <a href="http://www.lemkesoft.com/">GraphicConverter.</a> If you pay the shareware fee that unlocks batch processing.Opens almost any format of graphic, and saves in almost any graphic format. Also great for cropping &amp; adjusting. The batch processing is a wonderful thing.<br>
This is a _superb_ 'ible. There aren't many ebooks in my language, and being able to convert my own books (that I've bought and paid for) to read on my e-reader is _awesome_. Thank you so much!
You're very welcome. Feel free to join us at the DIY Book Scanner forum if you run into any trouble along the way.
Totally cool...I've got all the raw materials for this. I agree with you totally about banging up books, and sometimes, magazines, depending on how they are bound, are just as much a problem [think National Geographics and the like] to scan.<br><br>
Awesome. Looking forward to seeing your results... and all your questions are probably answered over at diybookscanner.org. :)
My younger one has a project like this in mind. He wants to build a portable scanner in a box. To do that the camera has to be situated in the box. Is this possible with the steps above.<br />
Hello, I am just giving my first steps into scanning books and this has been very helpful, specially the Q&amp;A section at the beginning.<br /> <br /> My goal is not only to have a pdf but a text document which is more useful for ereaders. So far, with my short experience I have two questions for you:<br /> <br /> * Do you get good pdf quality which can be converted into text witha high success rate?<br /> <br /> * In my opinion lighting is one of the keys? How could we improve it to have a better scanning? Fluorescent or halogen do any better?<br /> <br /> Thanks again.<br />
&nbsp;Freeware Irfanview has plenty of features to process several pages: cropping, rotating, color enhancing (auto or user-defined),resolution changing, etc.
I would give a try to a CFL (compact fluorescent lamp), instead of the incandescent one.
I've been having an issue running the RenameAll.exe file. It will consistently lock up after ~90 images and Windows will close the program. Is anyone else having this issue or can see the issue in the code? Thanks.<br />
Turns out it was easy to hunt down; my first shot at fixing it was trying to free() the <em>next</em> image from memory, problem was the next image hadn't even been loaded yet. Also I wanted to free() the <em>current</em> image. It's all fixed now.<br /> <br /> The links in the instructable now point to the new binaries and source code.<br /> <br /> <br />

About This Instructable




Bio: Hacker, Artist, Researcher, and founder of the diybookscanner.org community.
More by daniel_reetz:Bargain-Price Book Scanner From A Cardboard Box. DIY Camera Array 2: Computational Refocusing With Just One Camera Removing Anodizing From Aluminum Quickly and Easily. 
Add instructable to: