paperlined.org
academics
>
linguistics
document updated 2 years ago, on Nov 17, 2022
Linguistics corpora
at english-corpora.org
— News on the Web (NOW), iWeb: The Intelligent Web-based Corpus, Global Web-Based English (GloWbE), Wikipedia Corpus, Coronavirus Corpus, Corpus of Contemporary American English (COCA), etc
Syed, Munira, et al. "
Unified Representation of Twitter and Online News Using Graph and Entities.
"
Frontiers in big Data
4 (2021).
github.com/niderhoff/nlp-datasets
(list at Linguistic Data Consortium)