Word and phrase counting software
A parallel corpora (bitext) aligning tool. Create TMX databases
Drug name extraction
Experimental Java library for reading and writing GrAF/XML files.
Extract title and creation time from web page.
An automatic restoration of Arabic diacritic marks
Calculate semantic similarity for any human and human-like languages
An Arabic Corpora Processing Tool
A polylingual dictionary/ontology system
A Single Click Language Changer and Publishing System for Web and DTP
JSON based text search Java Project
No more support for this project - TAKE A LOOK AT FALCONSEARCH
Generator for textual models by applying different techniques
Lemmatization tool for morphological analysis of biomedical literature
automatic alignment pipeline for parallel treebanks
Java API and tools for performing NLP and other AI tasks
Simply convert your PDF files into audio books
Additional dictionary files for the NetBeans spellchecker.
This implements a phrased-based hidden semi-Markov Model for SMT
Similarity Word-Sequence Kernels for Sentence Clustering toolkit