paperlined.org
academics
>
data_mining
document updated 11 years ago, on Dec 19, 2012
text corpus
given names
Census.gov
Given Name Frequency Project
data.govt.nz
wiktionary
(downloadable
here
)
CommonCrawl.org
— web-crawling data
more at
Wikipedia's Category:Open_data