COW

Free state-of-the-art web corpora, frequency lists, and link data

Menu

Skip to content
  • Corpora
    • Dutch
    • English
    • French
    • German
    • Spanish
    • Swedish
  • Access
    • Web interface
    • RStudio and Python
    • Corpus download
    • Frequency lists (CC-BY)
    • Link data sets (CC-BY)
  • Research
  • People
    • Current Contributors
    • Former Contributors
  • Research with COW
  • Impressum (DE)
  • Datenschutz (DE)

Roland Schäfer (since 2011)

Founding member if the COW initiative. Areas of expertise:

  • crawling (ClaraX random walker with texrex)
  • linguistic web characterization
  • document classification (COReCo and COReX frameworks)
  • web page cleaning/processing (texrex software suite)
  • linguistic annotation (COW toolchain)
  • languages: English, German, Swedish

Roland Schäfer’s personal homepage

Post navigation

← Dutch Linguistics Department, Freie Universität Berlin Felix Bildhauer (since 2011) →

If you publish or present research based on COW, please notify us.

Recent Posts

  • RanDECOW17 March 6, 2019
  • COReX 2018 feature set and databases March 5, 2019
  • DECOW16 (A and B) September 4, 2018
  • Linguistic web characterisation June 24, 2017
  • Correct citation of COW corpora June 24, 2017
 
Funded by the German Research Council (DFG)