4.2 C. 10. Do a Spell Check to catch typical scannos

Skip to end of metadata
Go to start of metadata

The page Scanning And Proofreading Manual does not exist.

4.2  C.  10.  Do a Spell Check to catch typical scannos

Back to:  4. Proofread a Book


              1)  Spell Check the file
              2)  Example of a typical scanno
              3)  Other patterns of common OCR mistakes  (scannos)


1)  Spell Check the file

      Correct all "scannos" as the spell checker finds them.


2)  Example of a typical scanno

      Note:  Many scannos (or OCR errors) will not be found by the Spell Checker.  Spell Check will skip
      over them and not give you a chance to make a correction because they are legitimate words in English!

      For example, the OCR could read an image of the word "clear" and mistakenly interpret it as the word
      "dear".  But since "dear" is a legitimate English word, Spell Check will pass right over it, even though
      this is not the correct word in the book.

      Here's more about this process...


      First the scanner takes a picture of each page.  Then OCR software is run on these pictures.  The OCR
      tries to figure out what letters and numbers (and other characters!) are represented by the groups of
      pixels it finds in the image of each page.

      One type of scanno can occur when two letters that are right next to each other are interpreted as one,
      completely different letter  For example, when a lower case letter  c  is followed immediately by a lower
      case letter  l...

      In the "mind" of the OCR, the letter  c  followed by the letter  l  forms a shape that is very similar to the
      shape of the letter  d,  illustrated on the next two lines:

          cl
          d

      The sentence in the original was:
          "It was a clear, sunny day."

      But the OCR mistakenly put a letter  d  instead of the two letters  c  and  l.  Now the sentence erroneously
      reads:
          "It was a dear, sunny day."

      This error won't be caught by the Spell Checker because the  word  dear  is a "legal" word in
      English, even though it's not the correct word in the original book.


3)  Other patterns of common OCR mistakes  (scannos)

      Experienced proofreaders are on the lookout for such errors invisible to the spell checker.
      For example:


      die       for     the

      1          for     I

      fit         for     lit             or its reverse

      nor      for     not           or its reverse

      rm       for     m & n       or its reverse

      bad     for     had


      etc.


To the next Topic:  11. Important additional information



TOP OF PAGE
The page Scanning And Proofreading Manual does not exist.

Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.