TextAnalysis.jl

Julia package for text analysis
Popularity
352 Stars
Updated Last
12 Months Ago
Started In
May 2012

TextAnalysis

A Julia package for working with text.

CI version docs

Introduction

TextAnalysis provides support for standard tools and models for working with textual data and natural languages in the Julia language.

Features

  • Container type for Document and Corpus
  • DocumentTermMatrix and TF/IDF
  • LSA/LDA
  • Vocabulary and statistical Language Model
  • Co-occurance matrix
  • NaiveBayes classifier
  • ROUGE evaluation metrics

This package also incorporates features from the Languages and WordTokenizers packages within the JuliaText ecosystem.

TextModels

The TextModels package enhances this library with the additon of practical neural network based models. Some of that code used to live in this package, but was moved to simplify installation and reduce the number of dependencies.

Installation

pkg> add TextAnalysis

Contributing and Reporting Bugs

Contributions, in the form of bug-reports, pull requests, additional documentation are encouraged. They can be made to the Github repository.

All contributions and communications should abide by the Julia Community Standards.

Support

Feel free to ask for help on the Julia Discourse forum, or in the #natural-language channel on julia-slack. (Which you can join here). You can also raise issues in this repository to request new features and/or improvements to the documentation and codebase.