Standalone. Small. Language-neutral. BudouX is the successor to Budou, the machine learning-powered line break organizer tool. It is standalone. It works with no dependency on third-party word segmenters such as Google cloud natural language API. It is small. It takes only around 15 KB including its machine learning model. It's reasonable to use it even on the client-side. It is language-neutral. You can train a model for any language by feeding a dataset to BudouX’s training script.

Features

  • BudouX supports HTML inputs
  • Documentation available
  • Examples available
  • You can get a list of phrases by feeding a sentence to the parser
  • BudouX supports HTML inputs and outputs HTML strings
  • BudouX uses the AdaBoost algorithm to segment a sentence into phrases

Project Samples

Project Activity

See All Activity >

Categories

Machine Learning

License

Apache License V2.0

Follow BudouX

BudouX Web Site

Other Useful Business Software
The full-stack observability platform that protects your dataLayer, tags and conversion data Icon
The full-stack observability platform that protects your dataLayer, tags and conversion data

Stop losing revenue to bad data today. and protect your marketing data with Code-Cube.io.

Code-Cube.io detects issues instantly, alerts you in real time and helps you resolve them fast. No manual QA. No unreliable data. Just data you can trust and act on.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of BudouX!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Machine Learning Software

Registered

2024-08-19