Jump to content

Timeline of optical character recognition

From Wikipedia, the free encyclopedia

This is a timeline of optical character recognition.

Overview

[edit]
Time period Summary
1870–1931 Earliest ideas of optical character recognition (OCR) are conceived. Fournier d'Albe's Optophone and Tauschek's Reading Machine are developed as devices to help the blind read.[1]
1931–1954 First OCR tools are invented and applied in industry, able to interpret Morse code and read text out loud. The Intelligent Machines Research Corporation is the first company created to sell such tools.
1954–1974 The Optacon, the first portable OCR device, is developed. Similar devices are used to digitise Reader's Digest coupons and postal addresses. Special typefaces are designed to facilitate scanning.[1][2][3]
1974–2000 Scanners are used massively to read price tags and passports.[4] Companies such as Caere Corporation, ABBYY and Kurzweil Computer Products Inc, are created. The latter one develops the first omni-font OCR software, capable of reading any text document.[5]
2000–2016 OCR software is made available online for free, through products like Adobe Acrobat, WebOCR, and Google Drive.[6][7]

Timeline

[edit]
Year Event type Technology Details
1870 Invention American inventor Charles R. Carey invents the retina scanner, an image transmission system using a mosaic of photocells, considered the first OCR invention in the world.[1]
1885 Invention Image scanner Paul Nipkow invents the Nipkow disk, an image scanning device that later will be a major breakthrough both for modern television and reading machines.[8]
1900 Invention Russian scientist Tyurin envisions the first OCR machine to serve as an aid to the visually handicapped, but never manages to develop it.[1]
1912 Product Text-to-speech Edmund Fournier d'Albe develops the Optophone, a handheld scanner that when moved across a printed page, produces tones that corresponded to specific letters or characters, so as to be interpreted by a blind person.[9][10]
1916 Patent American engineer John B. Flowers patents the "One-Eyed Machine Stenographer", a machine capable of reading and typing a script. It worked by superimposing all the letters to find a point that marked each of them.[11]
1921 Invention Text-to-tactile sensations Italian professor Ciro Codelupi envisions the "Reading machine for the blind", capable of transforming luminous sensations into tactile sensations.[12]
1929 Invention Austrian engineer Gustav Tauschek creates the first OCR device called the "Reading Machine", with a photo-sensor pointing light on words when they corresponded to a content template in its memory.[13]
1931 Patent Text-to-telegraph Israeli physicist and inventor Emanuel Goldberg is granted a patent for his "Statistical machine" (US Patent 1838389), which was later acquired by IBM. It was described as capable of reading characters and converting them into standard telegraph code.[1]
1938 Invention MIT professor Vannevar Bush develops the Microfilm Rapid Selector, a similar but simpler Goldberg' statistical machine, and 40 times faster.[14]
1949 Application Engineers working on the Radio Corporation of America start a project to help the blind and the U.S. Department of Veterans Affairs, using the first text-to-speech techniques.[15]
1951 Invention Text & Morse-to-speech American cryptoanalyst David H. Shepard and Harvey Cook Jr. build "Gismo", a machine able to read aloud letter by letter and interpret Morse code (U.S. Patent 2,663,758).
1952 Company The Intelligent Machines Research Corporation is founded by D. Shepard and William Lawless Jr, to commercialise Gismo (later renamed to "Analysing Reader").[16]
1954 Application American magazine Reader's Digest becomes the first business to install an OCR reader, used to convert typewritten sales reports into punched cards.[1]
1962 Invention Portability Stanford professor John Linvill develops the Optacon, the first portable reading device for the blind.[17]
1965 Application Reader's Digest expands its OCR use to digitise serial numbers of coupons. with a RCA 501 computer.[citation needed]
1965 Invention American inventor Jacob Rabinow develops an OCR machine to sort mail from the US Post Office.[3]
1966 Invention Handwriting scanner The IBM Rochester lab develops the IBM 1287, the first scanner capable of reading any handwritten numbers.[18]
1966 Patent Linvill is granted the patent for the Optacon, described as "Reading aid for the blind" (U.S. patent 3229387).
1968 Invention Typefaces American Type Founders and Swiss designer Adrian Frutiger introduced OCR-A and OCR-B; typefaces made to facilitate OCR operations.[2][19]
1969 The US Army implemented what may have been one of the first major applications using OCR technology by converting their manual allotment program to a centralized system using IBM 360 computers. The process involved the purchase of IBM Selectric typewriters using Time Roman font 12 for all of its finance offices around the world. This application allowed all military personnel to allot portions of their paycheck through automated payroll deductions to pay bills, send to savings, etc. which eliminated monthly processing. The success of this program paved the way for all military services to follow and eventually led to the conversion to a fully automated pay system years later.[citation needed]
1971 Application Postal scanner Canadian postal operator Canada Post starts using OCR systems, to read the name and address on the envelopes and to print barcodes, using ultraviolet ink (U.S. Patent 5420403).[20]
1974 Company Omni-font American inventor Ray Kurzweil creates Kurzweil Computer Products Inc., which develops the first omni-font OCR software, able to recognize text printed in virtually any font.[4]
1976 Company Dallas company Recognition Equipment Inc. is founded to read credit card receipts from gasoline purchases (U.S. Patent 4027141).[8]
1977 Company Commercialisation Robert Noyce founds the Caere Corporation (now Nuance Communications), and introduces the first commercial handheld OCR reader.[21]
1978 Product Kurzweil Computer Products begins selling a commercial version of the OCR computer program, called the "Kurzweil Reading Machine".[5]
1980 Selling Kurzweil's company is sold to Xerox, who renamed it as Scansoft (now merged with Nuance Communications).[8]
1984 Product Passport scanner Caere Corporation develops the first passport scanner for the U.S. State Department.[22]
1987 Application Price tag scanner American retailers Sears, Kmart and J.C. Penney start using OCR to scan price tags.[20]
1989 Company OCR Russian company ABBYY is founded by David Yang, and starts selling products intended to simplify converting paper files to digital data.[23]
1992 Invention The first program that recognizes Cyrillic is invented by Russian company OKRUS.[1]
2000 Application Online service OCR technology is made available online as a service (WebOCR), in a cloud computing environment, as well as in mobile applications like real-time translation of foreign-language signs on a smartphone.[24]
2005 Application Software The free cross-platform OCR engine Tesseract is published by Hewlett Packard and the University of Nevada, Las Vegas.
2008 Application Adobe Acrobat starts including support for OCR on any PDF file.[7]
2011 Application Word-frequency lookup Google Ngram Viewer is developed to chart frequencies of words on any source printed from 1950 to 2008.[25][26]
2013 Application The MNIST database is created to train machine learning models in pattern recognition.[27]
2015 Application Open access Google offers OCR tools to scan any Google Drive files in over 200 languages for free.[6]

See also

[edit]

References

[edit]
  1. ^ a b c d e f g Schantz, H. F. (1982) The history of OCR: optical character recognition, Recognition Technologies Users Association.
  2. ^ a b Frutiger, Adrian. Type. Sign. Symbol. ABC Verlag, Zurich, 1980. p. 50
  3. ^ a b "Optical character recognition - History". ABBYY Technology. Retrieved 18 September 2016.
  4. ^ a b J. Scott Hauger, Reading Machines for the Blind ( PDF ), Blacksburg, Virginia, Faculty of the Virginia Polytechnic Institute and State University, April 1995, pp. I-II, 11-13.
  5. ^ a b "Kurzweil Computer Products". www.kurzweiltech.com. Retrieved 2016-09-18.
  6. ^ a b "Paper to Digital in 200+ languages". 6 May 2015. Retrieved 2016-09-18.
  7. ^ a b "Press Room". Adobe Systems. 14 July 2009. Retrieved 4 December 2010.
  8. ^ a b c "The History of OCR". Data processing magazine. 12: 46. 1970.
  9. ^ EE Fournier, The Type-Reading Optophone, Our Surplus, Our Ships, and Europe's Need, and more ( PDF ), inScientific American, vol. 123, nº 19, New York, Scientific American Publishing Co., November 6, 1920, pp. 463-465.
  10. ^ d'Albe, E. E. Fournier (1914-07-01). "On a Type-Reading Optophone". Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences. 90 (619): 373–375. Bibcode:1914RSPSA..90..373D. doi:10.1098/rspa.1914.0061. ISSN 1364-5021.
  11. ^ La macchina che legge e che scrive (PDF), in La scienza per tutti, Year XXIII, nº 11, Milano, Casa Editrice Sozogno, 1º June 1916, p. 166. (italian)
  12. ^ Macchina per leggere pei ciechi (PDF), in La scienza per tutti, Year XXVIII, nº 2, Milano, Casa Editrice Sozogno, 15 January 1921, p. 20 (italian)
  13. ^ "History of Computers and Computing, Birth of the modern computer, The bases of digital computers, OCR". history-computer.com. Retrieved 2016-09-09.
  14. ^ Buckland, Michael Keeble (2006-01-01). Emanuel Goldberg and His Knowledge Machine: Information, Invention, and Political Forces. Greenwood Publishing Group. ISBN 9780313313325.
  15. ^ "Reading Machine Speaks Out Loud", February 1949, Popular Science.
  16. ^ Douglas Martin (December 11, 2007). "David H. Shepard, 84, Dies; Optical Reader Inventor". New York Times. Retrieved June 5, 2010.
  17. ^ "The Reading Machine That Hasn't Been Built Yet". AccessWorld. Retrieved 18 September 2016.
  18. ^ "Rochester chronology". IBM. 23 January 2003. Archived from the original on March 28, 2008. Retrieved 18 September 2016.
  19. ^ "OCR-A Std | Typekit". typekit.com. Retrieved 2016-09-18.
  20. ^ a b "Overview of OCR and Its Applications" (PDF). Understanding Optical Character Recognition. Retrieved 18 September 2016.
  21. ^ "History of Caere Corporation – FundingUniverse". www.fundinguniverse.com. Retrieved 2016-09-23.
  22. ^ Jacobson, Gary. "No grudges, Bill Moore says, but he still seeks justice". Dallas News. Retrieved 18 September 2016.
  23. ^ "Mixergy interview: How A Bulletin Board Post Changed Everything – with David Yang". Retrieved 22 August 2013.
  24. ^ "Understanding Optical Character Recognition" (PDF). Bar Code & Data Acquisition. Retrieved 18 September 2016.
  25. ^ "Google Ngram Database Tracks Popularity Of 500 Billion Words" Huffington Post, 17 December 2010, webpage: HP8150.
  26. ^ "Culturomics, Ngrams and new power tools for Science". 10 August 2011. Retrieved 2016-09-18.
  27. ^ "MNIST handwritten digit database, Yann LeCun, Corinna Cortes and Chris Burges". yann.lecun.com. Retrieved 2016-09-18.