Stream-oriented Java library and a set of command line tools for high quality sentence boundary detection. (Sentence segmentation / splitting / disambiguation). Currently has one model for German (trained on general text and Wikipedia lynx dumps).

Features

  • model for German (trained on general text and wikipedia lynx dumps)
  • highly accurate
  • handles a wide range of potential boundaries
  • can cope with headlines, lists, tables
  • preserves whitespace
  • stream-oriented

Project Samples

Project Activity

See All Activity >

License

GNU General Public License version 3.0 (GPLv3)

Follow Sentrick

Sentrick Web Site

Other Useful Business Software
EasySend is a no-code platform that transforms customer journeys Icon
EasySend is a no-code platform that transforms customer journeys

Defy form limits. 
Create digital experiences.

Evolve forms into smart, AI-powered digital workflows that streamline your data intake and elevate customer experiences.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Sentrick!

Additional Project Details

Programming Language

Java, Prolog

Registered

2010-01-31