This article will be permanently flagged as inappropriate and made unaccessible to everyone.
Are you certain this article is inappropriate?
Political / Social
A speech corpus (or spoken corpus) is a database of speech audio files and text transcriptions. In Speech technology, speech corpora are used, among other things, to create acoustic models (which can then be used with a speech recognition engine). In Linguistics, spoken corpora are used to do research into Phonetic, Conversation analysis, Dialectology and other fields.
A corpus is one such database. Corpora is the plural of corpus (i.e. it is many such databases).
There are two types of Speech Corpora:
A special kind of speech corpora are non-native speech databases that contain speech with foreign accent.
Relational model, ACID, Database normalization, SQL, Parallel computing
Machine learning, Chinese language, Speech recognition, Corpus linguistics, English language
Natural Language Processing, Corpus linguistics, Linguistics, Parsing, Speech recognition
Communication, Language acquisition, CHILDES, Speech corpus, Brian MacWhinney
Telephony, Speech recognition, Voice over IP, Speech corpus, Language model
Natural Language Processing, Machine learning, Latent Dirichlet allocation, Bioinformatics, University of Massachusetts Amherst