This data package contains 2 millions english collocations with example sentences.
Extensive database of 2 million English collocations for 43 thousand English words.
Collocations have been extracted from a dependency-parsed corpus with more than 2 billion words. Main text sources were:
- 20,000 books from Project Gutenberg
- full text of the English Wikipedia
- British National Corpus
Each collocation includes the following information:
- Collocation (e.g. heavy smoker)
- 3 English examples
- Basis word
- Syntactic relation
|verb-direct object-indirect object||lend drilling machine neighbour|
|verb-prepositional object||wobble onto floor|
|verb-direct object-prepositional object||drive nail into wall|
|verb-subclause verb||let move|
|verb-subclause verb with „to||force to resign|
|noun with genitive attribute||man’s friend|
|noun compound||mega prize|
|noun with prepositional phrase||cloud of smoke|
You can query all 4 millions collocations in this online demo tool: http://linguatools.de/kollokationen-en/
You can test the free API.
If you license the data package of the English Collocations you will recieve the data package as XML, CSV, sqlite3 or in another desired format as a file for download.
Licensing conditionsOnly for commercial use.
Please contact Peter Kolb (email@example.com) for more information.
English collocations as API
For the english collocations we provide an API.
Description and testing of the API: https://linguatools.org/language-apis/linguatools-collocation-api/
Other language APIs by linguatools: https://linguatools.org/language-apis/
An overview of all collocation databases: https://linguatools.org/online-projects/collocation-database/