DATASET: Handwritten Language and Writer ID Dataset
This collection contains handwritten document images scanned at 300dpi in 9 different languages, and written by different writers. The documents were taken from pre-existing student notes. The documents can be used for both page classification and writer identification. There is no content level ground truth, on language and writer information.
This dataset has been used by the University of Maryland in some or all of the publications listed below. It is being distributed through this site for research purposes only, and should not be redistributed. Any reference to, or use of the data should include the following citation:
University of Maryland, Laboratory for Language and Media Processing (LAMP) , Handwritten Language and Writer ID Dataset, http://lamp.cfar.umd.edu, (year of download)
David Doermann: email@example.com
Last Updated: Tuesday 20 December, 2011