![[personal profile]](https://www.dreamwidth.org/img/silk/identity/user.png)
Among the e-books I'm currently reading are a collection of Chinese fairy tales and a biography of Theodore Roosevelt. I'm pretty sure both were digitized using OCR, because both are plagued by recurrent errors. The Roosevelt book makes frequent reference to GOP pol James G. Blaine as "Elaine". (They get it right a couple of times, but more often than not it's wrong.) The last couple of fairy tales have referred to "the Euler of Heaven", and while my mathematician's heart feels the phrase is accurate, I'm pretty sure it's supposed to be "Ruler".
(Neither is as bad as the collection of Navajo stories I read a couple of years ago; the author, to represent sounds that are present in Navajo but not in English, used a set of nonstandard symbols, which the OCR reader couldn't handle. There were numerous cases where what was obviously the same name was transcribed differently in successive paragraphs.)
(Neither is as bad as the collection of Navajo stories I read a couple of years ago; the author, to represent sounds that are present in Navajo but not in English, used a set of nonstandard symbols, which the OCR reader couldn't handle. There were numerous cases where what was obviously the same name was transcribed differently in successive paragraphs.)