A HTML scraper that uses machine learning frameworks to extract labelled fields from raw HTML. The project also involves the development of a tool to display the semi structured data generated by the scraper component.
A Python function library to extract EEG feature from EEG time series in standard Python and numpy data structure. Features include classical spectral analysis, entropies, fractal dimensions, DFA, inter-channel synchrony and order, etc.
ConDEnSE (Confidential Data Enabled Statistical Exploration) will be a web-based environment for statistical analysis of confidential data from various database sources, based on Plone and R, and using the Jackknife method of confidentiality protection.
Helps compatibility with Vernier's Graphical Analysis and Logger Pro software. Includes a converter to extract important data out of Vernier .ga3 and .cmbl data files, and spreadsheet to analyze the data with tables, graphs, and curve fitting.
PMML-compliant scoring engine and analytic toolkit
Augustus development has moved to google code. The new project page is augustus.googlecode.com. New releases of the project are not currently being released to sourceforge.
Augustus is designed for statistical and data mining models and produces and consumes models with 10,000s of segments.
Versions of Augustus support PMML 3, 4.0.1, and 4.1.
The Simple Versatile Plotting (SVP) tools create camera-ready plots of performance analysisdata gathered from high performance computing (HPC) applications.
A lightweight, browsing-based, 100% Python, federated data integration framework. Users may create custom schemas for disparate sources, query and expand results across sources to find related data; for use in fields such as bioinformatics and datamining
A windrose, also know as polar rose plot, is a special diagram for representing the distribution of meteorological datas, typically wind speeds by class and direction.
A set of services to allow for polling remote sites for data, displaying it on the local machine requesting said data, and moving that data anywhere else it needs to go.
The Serial Data Acquisition is a lightweight data acquisition system able to parse a vast majority of mostly unidirectional streams. Results are saved in a SQLite DB and accessible over XML-RPC or plain HTTP. Its design is modular and easily extendable.
The Model Interaction Environment for Neuroscience provides tools for development, searching, editing, execution, and visualization of biophysical models, abstract mathematical models, and experimental protocols used in neuroscience research.
sarface is a user-interface to the sysstat/sar database which inputs data from sar and plots to a live X11 graph via gnuplot. It mimics the cmd-line options from sar but can cross-plot any two or more stats and apply simple mathematical functions them.
the pIG bulk-processes data files generated by 'popular' scientific behavioural programs, including Med Associates MedIV-PC, Coulburn TruScan and Kinder Scientific.
KML is a knowledge base with support of logical modeling. Advanced model is used to represent knowledge as a set of statements similar to natural language sentences. This project hosts a set of model storage library and server (vrb-ols) and clients.
Ocular is a spreadsheet written entirely in python. Cell contents are evaluated by python after any standard spreadsheet coordinates are parsed. This allows the full Monty from Python to be implemented in a visual environment.
PyPotrace is a Python programming language binding for Peter Selinger's Potrace raster to vector conversion algorithm. No separate Potrace library or application required!
Crawl a LiveJournal-based blog hosting for friends data. LiveJournal is a blog hosting engine that allows its users to list others as friends. This tool can download most of the data about friendship relations between users for later processing.
Benetech Human Rights Program Utility Library: Collection of utilities used by the Benetech Human Rights programming for processing human rights data in preparation for statistical analysis. Library handles DSV files, name canonicalization, binarray, et
clusterviz allows to cluster three-dimensional data. The clustering process is visualized using OpenGL. As clustering algorithms the family of k-means algorithms is implemented, including mixture models.
PyVisi is the Python visualisation interface: a generic Python-based interface to many different scientific visualisation packages. PyVisi is intended for use in both standard and high-performance computing environments, and with the escript package.