Web Corpus Construction
Bildhauer, Felix / Schäfer, Roland![Web Corpus Construction](https://support.digitalhusky.com/media/annotations/sorted/390/39048785/CHSBZCOP0339048785.jpg)
The World Wide Web constitutes the largest existing source of texts written in a great variety of languages. A feasible and sound way of exploiting this data for linguistic research is to compile a static corpus for a given language. There are several adavantages of this approach: (i) Working with such corpora obviates the problems encountered when using Internet search engines in quantitative linguistic research (such as non-transparent ranki...