Link data sets (CC-BY)

You can download a growing number of link databases derived from COW14 corpora from this repository:

FU Berlin COW link databases

As opposed to the corpora, the ngram databases can be used freely under a permissive Creative Common CC-BY license. Notice that the BY in CC-BY implies that you have to cite the appropriate papers specified here (always check this link immediately before you publish anything based on COW data):

http://corporafromtheweb.org/category/cow-citation/