V11 N2 Paper 1
|
Annals of the MS in Computer Science and Information Systems at
UNC Wilmington
|
Fall 2017
|
Enhance Handwriting Script Recognition with Language Modeling
Yunkai Xiao
Committee
Abstract
Current technology could already treat the handwriting-script recognition problem very well in certain domain. On other domains, such as recognizing damaged, faded, or stained scripts, the recognition rate are still not perfect. This paper intends to compare two language models (n-gram and LSTM network) that could potentially be used in enhancing the recognition rate for such domain by making prediction to the missing word. We found that with limited context given, the traditional n-gram model with Katz smoothing performs around 10 times better than the LSTM network. As a conclusion when the context given is limited, one should avoid using LSTM network on word prediction, and only use it when enough context is given. In the case of enhancing handwriting script recognition, this means to avoid using LSTM network when too many words are damaged or missing from the script.
download
(pdf)
Recommended Citation:
Xiao, Y., Guinn, C, Kline, D., Ricanek, K. (2017) Enhance Handwriting Script Recognition with Language Modeling. Annals of the Master of Science in Computer Science and Information Systems at UNC Wilmington, 11(2) paper 1. http://csbapp.uncw.edu/data/mscsis/full.aspx.
V11 N2 Paper 1
|
Annals of the MS in Computer Science and Information Systems at
UNC Wilmington
|
Fall 2017
|