Search Results for "faculty evaluation system"

Sort By:

Showing 324 open source projects for "faculty evaluation system"

View related business solutions

AestheticsPro Medical Spa Software
Our new software release will dramatically improve your medspa business performance while enhancing the customer experience

AestheticsPro is the most complete Aesthetics Software on the market today. HIPAA Cloud Compliant with electronic charting, integrated POS, targeted marketing and results driven reporting; AestheticsPro delivers the tools you need to manage your medical spa business. It is our mission To Provide an All-in-One Cutting Edge Software to the Aesthetics Industry.

Learn More
Collect! is a highly configurable debt collection software
Everything that matters to debt collection, all in one solution.

The flexible & scalable debt collection software built to automate your workflow. From startup to enterprise, we have the solution for you.

Learn More
1

Quantitative Trading System

A comprehensive quantitative trading system with AI-powered analysis

Quantitative Trading System is a comprehensive quantitative trading platform that integrates artificial intelligence, financial data analysis, and automated strategy execution within a unified software system. The project is designed to provide an end-to-end infrastructure for building and operating algorithmic trading strategies in financial markets. It includes tools for collecting and processing market data from multiple sources, performing statistical and machine learning analysis, and...

Downloads: 1 This Week

Last Update: 2026-03-12
See Project
2

Gorse Recommender System Engine

An open source recommender system service written in Go

An open-source recommender system service written in Go. Recommend items from Popular, latest, user-based, item-based and collaborative filtering. Search the best recommendation model automatically in the background. Support horizontal scaling in the recommendation stage after single node training. Support Redis, MySQL, Postgres, MongoDB, and ClickHouse as its storage backend. Expose RESTful APIs for data CRUD and recommendation requests. Analyze online recommendation performance from...

Downloads: 7 This Week

Last Update: 6 days ago
See Project
3

Typst

A new markup-based typesetting system that is powerful and easy

...Whether in the classroom, the faculty office, or at home. Typst runs in your browser, so everyone on the team can just start writing.

Downloads: 22 This Week

Last Update: 2025-12-12
See Project
4

VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs)

VLMEvalKit is an open-source evaluation toolkit designed for benchmarking large vision-language models that combine visual understanding with natural language reasoning. The toolkit provides a unified framework that allows researchers and developers to evaluate multimodal models across a wide range of datasets and standardized benchmarks with minimal setup. Instead of requiring complex data preparation pipelines or multiple repositories for each benchmark, the system enables evaluation through simple commands that automatically handle dataset loading, model inference, and metric computation. ...

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
Next-Gen Encryption for Post-Quantum Security | CLEAR by Quantum Knight
Lock Down Any Resource, Anywhere, Anytime

CLEAR by Quantum Knight is a FIPS-140-3 validated encryption SDK engineered for enterprises requiring top-tier security. Offering robust post-quantum cryptography, CLEAR secures files, streaming media, databases, and networks with ease across over 30 modern platforms. Its compact design, smaller than a single smartphone image, ensures maximum efficiency and low energy consumption.

Learn More
5

Opik

Debug, evaluate, and monitor your LLMapps, RAG systems, and agentic AI

Confidently evaluate, test, and monitor LLM applications. Opik is an open-source platform for evaluating, testing, and monitoring LLM applications. Built by Comet. Record, sort, search, and understand each step your LLM app takes to generate a response. Manually annotate, view, and compare LLM responses in a user-friendly table. Log traces during development and in production. Run experiments with different prompts and evaluate against a test set. Choose and run pre-configured evaluation...

Downloads: 10 This Week

Last Update: 16 hours ago
See Project
6

Hallucination Leaderboard

Leaderboard Comparing LLM Performance at Producing Hallucinations

...By focusing on hallucination rates rather than traditional metrics such as accuracy or fluency, the benchmark highlights an important aspect of AI system safety and trustworthiness. The leaderboard is regularly updated as new models are released and evaluation methods evolve.

Downloads: 0 This Week

Last Update: 2026-03-20
See Project
7

Easy DataSet

A powerful tool for creating datasets for LLM fine-tuning

...The system includes automated question-generation capabilities, hierarchical label trees, and answer generation pipelines that use LLM APIs to produce coherent paired data with customizable templates. Beyond dataset creation, Easy-dataset also provides a built-in evaluation system with model testing and blind-test features, helping teams validate model performance using curated test sets.

Downloads: 9 This Week

Last Update: 2026-04-10
See Project
8

darwin-skill

Autoresearch-inspired autonomous skill optimization for Claude Code

darwin-skill is an experimental framework designed to automatically improve AI agent “skills” through iterative evaluation and optimization loops inspired by machine learning training processes. Instead of treating prompts or skill definitions as static assets, the system applies a continuous improvement cycle that evaluates performance, proposes changes, tests outcomes, and either retains or reverts modifications. The framework introduces a scoring system across multiple dimensions, enabling quantitative assessment of skill quality and ensuring that only improvements are preserved over time. ...

Downloads: 0 This Week

Last Update: 13 hours ago
See Project
9

DeepSeek-OCR 2

Visual Causal Flow

...The repository provides model code and inference scripts that let researchers and developers run and benchmark the system on both images and PDFs, with support for batch evaluation and optimized pipelines leveraging vLLM and transformers.

Downloads: 13 This Week

Last Update: 2026-02-03
See Project
Iris Powered By Generali - Iris puts your customer in control of their identity.
Increase customer and employee retention by offering Onwatch identity protection today.

Iris Identity Protection API sends identity monitoring and alerts data into your existing digital environment – an ideal solution for businesses that are looking to offer their customers identity protection services without having to build a new product or app from scratch.

Learn More
10

i-Educar

Launching the most free educational software in Brazil

Accessible from anywhere and with single student registration available for the entire education network. Time-saving for everyone. Get current quantitative, financial and statistical data on all processes, at the time and place you want. Evaluation system and reports adapted to the different realities of the country, with numerical, conceptual or descriptive evaluation notes. Management of allocations, removals, substitutions, absences and delays, offering an integrated view of all professionals. Time frame management for analysis of demands and availability of professionals in the education network in each school period. ...

Downloads: 2 This Week

Last Update: 2025-07-01
See Project
11

RecBole

A unified, comprehensive and efficient recommendation library

...We implement more than 100 commonly used recommendation algorithms and provide formatted copies of 28 recommendation datasets. We support a series of widely adopted evaluation protocols or settings for testing and comparing recommendation algorithms. RecBole is developed based on Python and PyTorch for reproducing and developing recommendation algorithms in a unified, comprehensive and efficient framework for research purpose. It can be installed from pip, conda and source, and is easy to use. We have implemented more than 100 recommender system models, covering four common recommender system categories in RecBole and eight toolkits of RecBole2.0, including General Recommendation, Sequential Recommendation, Context-aware Recommendation, and Knowledge-based Recommendation and sub-packages.

Downloads: 0 This Week

Last Update: 2025-02-23
See Project
12

Agent Behavior Monitoring

The open source post-building layer for agents

Agent Behavior Monitoring is an open-source framework designed to monitor, evaluate, and improve the behavior of AI agents operating in real or simulated environments. The system focuses on agent behavior monitoring by collecting interaction data and analyzing how agents perform across different scenarios and tasks. Developers can use the framework to observe agent actions in both online production environments and offline evaluation settings, making it useful for debugging and performance analysis. Judgeval transforms agent interaction trajectories into structured evaluation datasets that can be used for reinforcement learning, supervised fine-tuning, or other forms of post-training improvement. ...

Downloads: 0 This Week

Last Update: 2026-04-09
See Project
13

FIT Framework

An enterprise-level AI development framework

FIT Framework is an open-source infrastructure designed to support the development, training, and evaluation of machine learning and AI models through a modular and scalable architecture. It aims to streamline the lifecycle of AI systems by providing standardized components for data processing, model training, evaluation, and deployment. The framework is particularly useful for research and production environments where reproducibility and consistency are critical, as it enforces structured workflows and configurable pipelines. ...

Downloads: 0 This Week

Last Update: 2026-03-19
See Project
14

autoresearch for AMD

AI agents running research on single-GPU nanochat training

autoresearch for AMD is a framework for autonomous scientific experimentation in machine learning, enabling AI agents to iteratively improve models through a continuous loop of hypothesis generation, experimentation, and evaluation. The system is built around a minimal structure that includes a data preparation module, a training script that can be modified, and a program specification that guides the agent’s decision-making process. During each iteration, the agent edits the training code, runs an experiment within a fixed time budget, evaluates performance metrics, and decides whether to retain or discard the changes. ...

Downloads: 1 This Week

Last Update: 2026-03-30
See Project
15

LangWatch

The platform for LLM evaluations and AI agent testing

LangWatch is an open-source observability and monitoring platform designed to help developers evaluate and improve applications built with large language models. The platform provides tools for tracking model interactions, analyzing prompt behavior, and identifying issues such as hallucinations, latency problems, or unexpected responses. By collecting telemetry data from AI applications, LangWatch allows developers to understand how their systems perform in real-world usage scenarios. The...

Downloads: 1 This Week

Last Update: 5 days ago
See Project
16

Auto-Deep-Research

Your Fully-Automated Personal AI Assistant

Auto-Deep-Research is a system designed to fully automate deep research workflows using language models, retrieval, planning, and multi-stage reasoning to produce structured research artifacts such as surveys, benchmarks, reports, and even prototypes without heavy human intervention. Users provide a research topic or multifaceted goal, and the system autonomously breaks the objective down into subtasks like literature collection, critical summarization, cross-comparison, citation extraction, metric evaluation, and structured writing. ...

Downloads: 2 This Week

Last Update: 2026-02-03
See Project
17

Output

TypeScript framework for building AI workflows and agents

Output is an open-source TypeScript framework designed to build, orchestrate, and manage AI workflows and agents within a single unified system. It consolidates multiple aspects of AI development, including prompt management, evaluation, tracing, cost tracking, and orchestration, into a file-based architecture that lives entirely within the codebase. The framework is built specifically to work with AI coding agents, enabling them to read, modify, and execute workflows directly from structured project folders. ...

Downloads: 0 This Week

Last Update: 2 hours ago
See Project
18

Kiln

Open source platform for managing, testing, and deploying AI apps

Kiln is an open source platform designed to help developers build, evaluate, and deploy AI-powered applications with greater structure and reliability. It provides a unified environment for managing prompts, datasets, and evaluation workflows, allowing teams to iterate on AI behavior in a controlled and measurable way. Kiln emphasizes reproducibility, enabling users to track changes to prompts and models while comparing outputs across different configurations. Kiln also supports systematic testing of AI systems by defining evaluation criteria and running experiments to assess performance over time. ...

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
19

GrowthBook

Open source feature flagging and AB testing platform

GrowthBook is an open-source platform for feature flagging and AB testing built to give teams the power of a fully-featured experimentation system without building it entirely from scratch. It supports both self-hosted and cloud-hosted deployment models, giving organizations the flexibility to own their infrastructure or consume it as a managed service. The platform is designed for performance and scale: its SDKs are lightweight, supporting local evaluation to minimize latency, and it integrates deeply with existing data stacks so you can use your warehouse or analytics system as the source of truth. ...

Downloads: 1 This Week

Last Update: 2026-02-04
See Project
20

Rogue

AI Agent Evaluator & Red Team Platform

Rogue is an open-source evaluation and red-team framework designed to test the reliability, safety, and policy compliance of AI agents. The platform automatically interacts with an AI agent by generating dynamic scenarios and multi-turn conversations that simulate real-world interactions. Instead of relying solely on static test scripts, Rogue uses an agent-as-a-judge architecture where one agent probes another agent to detect failures or unexpected behaviors.

Downloads: 2 This Week

Last Update: 3 days ago
See Project
21

NFH Self-Improvement Loop

Minimal adversarial framework for AI agent self-modification

NFH Self-Improvement Loop is a conceptual framework and implementation designed to model continuous self-improvement cycles using AI systems. It focuses on creating feedback loops where outputs are evaluated, refined, and reintroduced into the system for further improvement. The project emphasizes iterative learning, allowing systems to evolve over time through repeated evaluation and adjustment. It can be applied to areas such as content generation, decision-making, and personal productivity systems. The framework encourages structured reflection and optimization, ensuring that each iteration builds upon previous results. ...

Downloads: 0 This Week

Last Update: 12 hours ago
See Project
22

autoresearch-macos

AI agents running research on single-GPU nanochat training

autoresearch-macos is a macOS-focused adaptation of autonomous research loop systems inspired by the autoresearch paradigm, enabling AI agents to iteratively improve machine learning models through self-directed experimentation. The system follows a structured loop in which an agent modifies a training script, executes a fixed-duration experiment, evaluates performance metrics, and decides whether to keep or revert changes. It is designed to operate efficiently within macOS environments,...

Downloads: 0 This Week

Last Update: 2026-03-30
See Project
23

Prompt flow

Build high-quality LLM apps

Prompt flow is a suite of development tools designed to streamline the end-to-end development cycle of LLM-based AI applications, from ideation, prototyping, testing, and evaluation to production deployment and monitoring. It makes prompt engineering much easier and enables you to build LLM apps with production quality.

Downloads: 0 This Week

Last Update: 2025-01-09
See Project
24

Conjure

Interactive evaluation for Neovim

Interactive evaluation for Neovim (Clojure, Fennel, Janet, Racket, Hy, MIT Scheme, Guile). Conjure is an interactive environment for evaluating code within your running program. The core features of Conjure are language agnostic (although it’s targeted at Lisps for now), with each language client providing their own extra tools. Here are the currently supported languages, contributions, and 3rd party plugins that add clients are highly encouraged! You can find a comparison table for all...

Downloads: 2 This Week

Last Update: 2026-04-09
See Project
25
$DeepSeek Math$

DeepSeek Math

Pushing the Limits of Mathematical Reasoning in Open Language Models

DeepSeek-Math is DeepSeek’s specialized model (or dataset + evaluation) focusing on mathematical reasoning, symbolic manipulation, proof steps, and advanced quantitative problem solving. The repository is likely to include fine-tuning routines or task datasets (e.g. MATH, GSM8K, ARB), demonstration notebooks, prompt templates, and evaluation results on math benchmarks. The goal is to push DeepSeek’s performance in domains that require rigorous symbolic steps, calculus, linear algebra, number theory, or multi-step derivations. ...

Downloads: 1 This Week

Last Update: 2025-10-03
See Project