Digital Philologist's toolbox.
- proc: Processing tools and classifiers.
- clf: General classification utilities.
- authorship clf: Classifier for determining likely authorship of texts.
- freq matrix: Computes frequency matrices from text data.
- metaphor sentence clf: Identifies metaphorical expressions in sentences.
- semantic models: Implements semantic vector space models (e.g. word embeddings).
- sentence clf: Classifies sentences based on predefined criteria.
- sim matrix: Calculates sentence or word similarity matrices.
- tfidf sentence clf: Applies TF-IDF methods for sentence-level classification.
- token clf: Classifies individual tokens (e.g., for part-of-speech tagging).
- vis: Visualization tools for data, models, or classification results.
- io: Handles input/output operations.
- misc: Miscellaneous utilities.
See the documentation for more.
- pip install pydoctor
- pydoctor --add-package ._
- pip install doc2dash
- doc2dash apidocs