Skip to content

gramorgan/collocations

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 

Repository files navigation

collocations

Collocation extraction using mutual information

This repo contains my term project for CS 250, completed with my partner Zhengqi Liu. It implements two different applications of mutual information to collocation extraction. collocations.py extracts the set of collocations for some fixed head and dependency. similarity.py extracts the most similar head words from a given text corpus. View the help output for these scripts with ./collocations.py -h and ./similarity.py -h.

Both of these scripts are based on algorithms outlined in Extracting Collocations from Text Corpora by Dekang Lin.

These scripts require Python 3 and textacy/spacy.

About

collocation extraction using mutual information

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages