This article will be permanently flagged as inappropriate and made unaccessible to everyone.
Are you certain this article is inappropriate?
Political / Social
Truecasing is the problem in natural language processing (NLP) of determining the proper capitalization of words where such information is unavailable. This commonly comes up due to the standard practice (in English and many other languages) of automatically capitalizing the first word of a sentence. It can also arise in badly cased or noncased text (for example, all-lowercase or all-uppercase text messages). Truecasing aids in many other NLP tasks, such as named entity recognition, machine translation and Automatic Content Extraction.
Truecasing is unnecessary in languages whose scripts do not have a distinction between uppercase and lowercase letters. This includes all languages not written in the Latin, Greek, Cyrillic or Armenian alphabets, such as Japanese, Chinese, Thai, Hebrew, Arabic, Hindi, etc.
Machine learning, Chinese language, Speech recognition, Corpus linguistics, English language
Technology, Claude Piron, Franz Och, Bilingual Evaluation Understudy
Indian English, Gujarati language, Kannada language, Kashmiri language, Nepali language
Natural language processing, Speech synthesis, Noam Chomsky, Educational assessment, Statistical classification
Natural Language Processing, Artificial intelligence, Syntax, Met Office, Speech synthesis
Natural language processing, Artificial intelligence, Customer service, EBay, Twitter
Natural Language Processing, Unix, Microsoft, Grammar, Capitalization