Corpus

usgb/ˈkɔːrpəs/
noun

A collection of written or spoken material in machine-readable form, for example for linguistic analysis.

We used a large linguistic corpus to train our language model.
Visual representation of "corpus" - A collection of written or spoken material in machine-readable form, for example for linguistic analysis.

Often appears as...

  • linguistic corpus
  • corpus of texts

Usage tips

Technical

Definition 1 of 4
Visual representation of "corpus"
LampPro Tip 1/2

Language Research

A 'corpus' in computing is often used for analyzing languages, like forming grammar rules.

Illustration for Language Research
Researchers compiled a corpus of spoken English to study regional dialects.
LampPro Tip 2/2

Machine Learning

In computing, corpora are crucial for training language models in AI, like chatbots.

Illustration for Machine Learning
The AI's understanding improved after feeding it a more diverse language corpus.
Visual representation of the word "Corpus"

Never forget "Corpus"

Humans forget easily. That's why you should download WordUp: Smart reminders, word games, AI practice, and much more!

Download on the App StoreGet it on Google PlayGet it from MicrosoftGet it on AppGallery
Chrome

WordUp Chrome Extension

As you browse the web instantly look up words you don’t know.

Get Chrome Extension