Showing 49 open source projects for "git:/git.code.sf.net/p/docfetcher/code"

View related business solutions
  • Premier Construction Software Icon
    Premier Construction Software

    Premier is a global leader in financial construction ERP software.

    Rated #1 Construction Accounting Software by Forbes Advisor in 2022 & 2023. Our modern SAAS solution is designed to meet the needs of General Contractors, Developers/Owners, Homebuilders & Specialty Contractors.
    Learn More
  • Rezku Point of Sale Icon
    Rezku Point of Sale

    Designed for Real-World Restaurant Operations

    Rezku is an all-inclusive ordering platform and management solution for all types of restaurant and bar concepts. You can now get a fully custom branded downloadable smartphone ordering app for your restaurant exclusively from Rezku.
    Learn More
  • 1
    XLM (Cross-lingual Language Model)

    XLM (Cross-lingual Language Model)

    PyTorch original implementation of Cross-lingual Language Model

    ...Using a shared subword vocabulary, XLM learns language-agnostic features that work well for classification and sequence labeling tasks such as XNLI, NER, and POS without target-language supervision. The repository provides preprocessing pipelines, training code, and fine-tuning scripts so you can reproduce benchmark results or adapt models to your own multilingual corpora. Pretrained checkpoints cover dozens of languages and multiple model sizes, balancing quality and compute needs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    fastNLP

    fastNLP

    fastNLP: A Modularized and Extensible NLP Framework

    fastNLP is a lightweight framework for natural language processing (NLP), the goal is to quickly implement NLP tasks and build complex models. A unified Tabular data container simplifies the data preprocessing process. Built-in Loader and Pipe for multiple datasets, eliminating the need for preprocessing code. Various convenient NLP tools, such as Embedding loading (including ELMo and BERT), intermediate data cache, etc.. Provide a variety of neural network components and recurrence models (covering tasks such as Chinese word segmentation, named entity recognition, syntactic analysis, text classification, text matching, metaphor resolution, summarization, etc.). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    NLP.js

    NLP.js

    An NLP library for building bots

    NLP.js is an NLP library for building bots, with entity extraction, sentiment analysis, automatic language identifier, and much more. "NLP.js" is a general natural language utility for nodejs. Search the best substring of a string with less Levenshtein distance to a given pattern. Get stemmers and tokenizers for several languages. Sentiment Analysis for phrases (with negation support). Named Entity Recognition and management, multi-language support, and acceptance of similar strings, so the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    PyTorch Natural Language Processing

    PyTorch Natural Language Processing

    Basic Utilities for PyTorch Natural Language Processing (NLP)

    ...It’s open-source software, released under the BSD3 license. With your batch in hand, you can use PyTorch to develop and train your model using gradient descent. For example, check out this example code for training on the Stanford Natural Language Inference (SNLI) Corpus. Now you've setup your pipeline, you may want to ensure that some functions run deterministically. Wrap any code that's random, with fork_rng and you'll be good to go. Now that you've computed your vocabulary, you may want to make use of pre-trained word vectors to set your embeddings.
    Downloads: 0 This Week
    Last Update:
    See Project
  • The AI workplace management platform Icon
    The AI workplace management platform

    Plan smart spaces, connect teams, manage assets, and get insights with the leading AI-powered operating system for the built world.

    By combining AI workflows, predictive intelligence, and automated insights, OfficeSpace gives leaders a complete view of how their spaces are used and how people work. Facilities, IT, HR, and Real Estate teams use OfficeSpace to optimize space utilization, enhance employee experience, and reduce portfolio costs with precision.
    Learn More
  • 5
    Seq2Seq Chatbot

    Seq2Seq Chatbot

    Chatbot in 200 lines of code using TensorLayer

    Seq2Seq Chatbot is an implementation of a sequence-to-sequence chatbot model using TensorLayer, demonstrating how to build conversational agents with minimal code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    TEES

    Turku Event Extraction System

    Turku Event Extraction System (TEES) is a free and open source natural language processing system developed for the extraction of events and relations from biomedical text. It is written mostly in Python, and should work in generic Unix/Linux environments. Currently, the TEES source code repository still remains on GitHub at http://jbjorne.github.com/TEES/ where there is also a wiki with more information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    GT NLP Class

    GT NLP Class

    Course materials for Georgia Tech CS 4650 and 7650

    ...Clear organization and self-contained examples make it possible to follow along outside the classroom, using the repo as a self-study resource. For learners and instructors alike, the course provides a coherent path from foundational linguistics to current techniques, with reproducible code that makes concepts concrete.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8

    BioC

    We describe a simple XML format to share text documents and annotation

    A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We are interested in reuse, and we focus on common NLP tasks that are broadly useful for textmining.
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    This ohnlp project has released "pipelines" that were contributed by members of the OHNLP Consortium. The pipelines are based on the Apache UIMA framework. medKAT/P, MedCoref, MedTagger, MedXN, and cTAKES are licensed under Apache License V2.0. MedTime is licensed under GNU General Public License version 3.0 (GPLv3). cTAKES development has moved to apache.org. See http://ctakes.apache.org/
    Downloads: 1 This Week
    Last Update:
    See Project
  • The full-stack observability platform that protects your dataLayer, tags and conversion data Icon
    The full-stack observability platform that protects your dataLayer, tags and conversion data

    Stop losing revenue to bad data today. and protect your marketing data with Code-Cube.io.

    Code-Cube.io detects issues instantly, alerts you in real time and helps you resolve them fast. No manual QA. No unreliable data. Just data you can trust and act on.
    Learn More
  • 10
    Osman Arabic Text Readability

    Osman Arabic Text Readability

    Open Source tool for Arabic text readability

    ...This makes the tool useful for researchers and educators working with Arabic text. All the readability metrics mentioned in Section \ref{calcRead} are included within the open source code, they all work with vocalised and non-vocalised text but based our results presented here we recommend adding the diacritics in by using the addTashkeel() method. See the files sections for the vocalised version of UN Arabic English parallel paragraphs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    Darkbot

    The IRC's Talking Robot

    [ Please read https://sourceforge.net/p/darkbot/news/2014/01/darkbots-revitalization/ ] Darkbot is a portable IRC chat robot written in the C language that can be taught responses to user inquiries, and even have conversations with them. Darkbot was originally created by Jason Hamilton as an aid for help channels on Intenet Relay Chat.
    Leader badge
    Downloads: 12 This Week
    Last Update:
    See Project
  • 12

    cosd_java

    A Tool for Searching and Demonstrating Code Structure

    CoSD is a tool for searching and demonstrating code structure based on graph database. Since some commercial code is involved in the industrial version of CoSD , we just give out a baseline version of it. We also provide a quick demo for this baseline version. To use CoSD, please read the file readme.txt at first.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Graphical Grammar Studio

    Graphical Grammar Studio

    An user friendly grammar tool for natural language processing tasks

    ...Nodes are used for consuming words from the input, for executing jumps to other graphs in the grammar or for creating annotations etc. GGS has a unique feature: It allows the user to write JavaScript code to be executed for nodes of the grammar. This is useful for checking grammatical agreements but not only. The user can: declare variables (including complex js structures), check for boolean conditions, use variables in annotations etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    neural network designer

    neural network designer

    a dbms for neural nets. Chatbots, DTrees, random forests, n-grams,...

    This project consists out of a windows based designer application and a library (that can run on multiple platforms, including android) together with several demo applications (including an MVC3 chatbot client and an android application). It is probably best compared to a database management system, but for neural networks instead of relational data. As such, the library is optimized for handling any type of data-size by using advanced streaming and caching algorithms. With the designer,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15

    TextProcessor

    A Java package to preprocess text datasets for posterior text analysis

    The TextProcessor Java package is a text processing toolkit, which provides some frequently used text processing functions such as stemming, removing stop-words, generating a term vocabulary, and calculating the term-doc frequency matrix. Basic topic mining models such as LDA and sparse NMF are also supported. The package can also generate feature files from a given text dataset with LDA and LIBSVM format for posterior procedures such as classification or clustering. The toolkit is also...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    miac-p

    Code for syntactic parsing and other NLP apps.

    Code for syntactic parsing and other natural language processing applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    ...Calvo (2012) “Using Information Extraction to Generate Trigger Question for Academic Writing Support”, 11th International Conference on Intelligent Tutoring Systems, Crete, Springer LNCS 7315, p.360-369. 2.Ming Liu, Rafael A. Calvo, Anindito Aditomo and Luiz Augusto Pizzato (2012), “Using Wikipedia and Conceptual Graph Structures to Generate Questions for Academic Writing Support”, IEEE Transactions on Learning Technologies, vol. 5, no. 3, pp. 251-263.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    This is a Java-based project for complex event extraction from text and co-reference resolution. Currently the code can read BioNLP shared task format (http://2011.bionlp-st.org/) and i2b2 Natural Language Processing for Clinical Data shared task format (https://www.i2b2.org/NLP/DataSets/Main.php). Event extraction includes finding events and the parameters for an event in a text. The method is based on SVM but other ML algorithms can be adopted.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ...As an output of the analysis, we have a highly informative table mainly containing vocalization of the stem, its grammatical category, its possible roots associated with corresponding patterns, proclitics and enclitics. A new version is available on the following link: http://oujda-nlp-team.net/?p=1299&lang=en How to cite the project: Boudlal, A., Lakhouaja, A., Mazroui, A., Meziane, A., Bebah, M. O. A. O., & Shoul, M. (2010). Alkhalil morpho sys1: A morphosyntactic analysis system for arabic texts. In International Arab conference on information technology (pp. 1-6). Benghazi Libya.
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Foma is a compiler, programming language, and C library for constructing finite-state automata and transducers for various uses. It has specific support for many natural language processing applications such as producing morphological analyzers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    This project is contains implementations of algorithms to integrate the output of different NLP tools (part of speech taggers, morphologies, parsers, etc.) in order to obtain more accurate, more robust and more fine-grained linguistic analyses. Note that the code is outdated, but left here for documentation purposes. Its functionality may be reimplemented within the NLP2RDF project (http://code.google.com/p/nlp2rdf).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    A library of Natural Language Processing (NLP) functions for use in REALbasic (RB) programs, all implemented in pure RB code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Some NLP experiments starting with a tokenization attempt in Python. The code tokenite.py reads a text file "blog1.txt" and tries to tokenize it. The code doesnot work as is, but is almost on the verge of working. Any suggestions will be greatly appreciated. I define a class called text and define methods inside it. The method count defines a generator which I use in the method named t_tok. But if you look closely at 66 to 72 you will see that I am modifying the outer limit of the for loop while in the loop. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    NLP-Template-Engine

    NLP template engine using QAS

    Natural Language Processing (NLP) template engine. (Using question answering systems and machine learning classifiers.)
    Downloads: 0 This Week
    Last Update:
    See Project