We have completed the full research corpus by adding another 10 journals to the already compiled corpus of the Global Environmental Change (GEC) research articles. In the same manner as with GEC, only the main body of each paper has been included into the corpus. Thus, all the contents of abstracts, footnotes, boxes, references and appendices have been excluded from the data set.
In total, the corpus consists of 11 journals, and includes 11462 journal articles which amount to more than 53 million word tokens making it of the largest specialised corpora. In comparison, general language corpora like, for example, the British National Corpus (BNC) and Bank of English (BoE), consist of 100 and 450 million word tokens respectively. Thus, our corpus represents a significant body of both monodisciplianry and interdisciplinary discourse. Continue Reading