yolov4.weights free download

bert4torch

An elegent pytorch implement of transformers

An elegant PyTorch implement of transformers.

Downloads: 0 This Week

Last Update: 2026-01-14

See Project

AirLLM

AirLLM 70B inference with single 4GB GPU

...This layer-wise inference approach allows models with tens of billions of parameters to run on devices with only a few gigabytes of VRAM. AirLLM preprocesses model weights so that each transformer layer can be loaded independently during computation, reducing the memory footprint while still performing full inference. As a result, developers can experiment with models that previously required specialized high-end GPUs.

Downloads: 1 This Week

Last Update: 2026-03-10

See Project

Code World Model (CWM)

Research code artifacts for Code World Model (CWM)

CWM (Code World Model) is a 32-billion-parameter open-weights language model. It is developed by Meta for enhancing code generation and reasoning about programs. It is explicitly trained on execution traces, action-observation trajectories, and agentic interactions in controlled environments. It has been developed to better capture how code, actions, and state interact over time. The repository provides inference code, reproducibility scripts, prompt guides, and more.

Downloads: 0 This Week

Last Update: 2025-09-26

See Project

LLaMA 3

The official Meta Llama 3 GitHub site

This repository is the former home for Llama 3 model artifacts and getting-started code, covering pre-trained and instruction-tuned variants across multiple parameter sizes. It introduced the public packaging of weights, licenses, and quickstart examples that helped developers fine-tune or run the models locally and on common serving stacks. As the Llama stack evolved, Meta consolidated repositories and marked this one deprecated, pointing users to newer, centralized hubs for models, utilities, and docs. Even as a deprecated repo, it documents the transition path and preserves references that clarify how Llama 3 releases map into the current ecosystem. ...

Downloads: 15 This Week

Last Update: 2025-10-08

See Project

Qwen

The official repo of Qwen chat & pretrained large language model

...These models, which range from smaller to larger configurations, are designed for a wide range of natural language processing tasks. They are openly available for research and commercial use, with Qwen's code and model weights shared on GitHub. Qwen's capabilities include text generation, comprehension, and conversation, making it a versatile tool for developers looking to integrate advanced AI functionalities into their applications.

1 Review

Downloads: 13 This Week

Last Update: 2026-03-05

See Project

EmoLLM

Pre & Post-training & Dataset & Evaluation & Depoly & RAG

...Its repository includes multiple model variants and training configurations spanning several underlying model families, including InternLM, Qwen, DeepSeek, Mixtral, LLaMA, and others, which shows that the initiative is structured as a broad ecosystem rather than a single release. The project also covers more than just model weights, with material for datasets, fine-tuning, evaluation, deployment, demos, RAG, and related subprojects such as its psychological digital assistant work.

Downloads: 1 This Week

Last Update: 2026-03-06

See Project

GLM-4-Voice

GLM-4-Voice | End-to-End Chinese-English Conversational Model

...GLM-4-Voice builds upon the bilingual strengths of the GLM architecture, supporting both Chinese and English, and is designed to handle long-form conversations with context retention. The repository provides model weights, inference demos, and setup instructions for deploying speech-enabled AI systems.

Downloads: 2 This Week

Last Update: 2 days ago

See Project

Coconut

Training Large Language Model to Reason in a Continuous Latent Space

...It supports training across multiple reasoning paradigms—including standard Chain-of-Thought (CoT), no-thought, and hybrid configurations—using configurable training stages and latent representations. The repository is built with Hugging Face Transformers, PyTorch Distributed, and Weights & Biases (wandb) for logging, supporting large-scale experiments on mathematical and logical reasoning datasets such as GSM8K, ProntoQA, and ProsQA.

Downloads: 0 This Week

Last Update: 2 days ago

See Project

Ling

Ling is a MoE LLM provided and open-sourced by InclusionAI

Ling is a Mixture-of-Experts (MoE) large language model (LLM) provided and open-sourced by inclusionAI. The project offers different sizes (Ling-lite, Ling-plus) and emphasizes flexibility and efficiency: being able to scale, adapt expert activation, and perform across a range of natural language/reasoning tasks. Example scripts, inference pipelines, and documentation. The codebase includes inference, examples, models, documentation, and model download infrastructure. As more developers and...

Downloads: 0 This Week

Last Update: 2025-09-30

See Project

Text-to-LoRA (T2L)

Hypernetworks that adapt LLMs for specific benchmark tasks

...The project provides a reference implementation of the Doc-to-LoRA method, which allows language models to quickly encode factual information or contextual constraints into lightweight LoRA modules. Developers and researchers can experiment with how textual task descriptions can generate LoRA weights that modify model behavior in real time.

Downloads: 0 This Week

Last Update: 2026-03-09

See Project

TigerBot

TigerBot: A multi-language multi-task LLM

...The project provides both base models and chat-optimized variants that can be used for dialogue systems, question answering, and general language understanding tasks. In addition to model weights, the repository includes training scripts, inference tools, and configuration files that allow researchers and developers to reproduce experiments or fine-tune the models for specific applications.

Downloads: 0 This Week

Last Update: 2026-03-06

See Project

Agents 2.0

An Open-source Framework for Data-centric Language Agents

...In this framework, each node in the pipeline represents a step in the reasoning or action process, while prompts and tools act as adjustable parameters analogous to neural network weights. During training, the system performs a forward execution where the agent completes a task and records the trajectory of prompts, outputs, and tool usage. A prompt-based loss function is then applied to evaluate the quality of the outcome, generating language-based gradients that guide improvements to the agent pipeline.

Downloads: 0 This Week

Last Update: 2026-03-04

See Project

Gemma

Gemma open-weight LLM library, from Google DeepMind

Gemma, developed by Google DeepMind, is a family of open-weights large language models (LLMs) built upon the research and technology behind Gemini. This repository provides the official implementation of the Gemma PyPI package, a JAX-based library that enables users to load, interact with, and fine-tune Gemma models. The framework supports both text and multi-modal input, allowing natural language conversations that incorporate visual content such as images.

Downloads: 0 This Week

Last Update: 2025-11-18

See Project

VibeThinker

Diversity-driven optimization and large-model reasoning ability

VibeThinker is a compact but high-capability open-source language model released by WeiboAI (Sina AI Lab). It contains about 1.5 billion parameters, far smaller than many “frontier” models, yet it is explicitly optimized for reasoning, mathematics, and code generation tasks rather than general open-domain chat. The innovation lies in its training methodology: the team uses what they call the Spectrum-to-Signal Principle (SSP), where a first stage emphasizes diversity of reasoning paths (the...

Downloads: 0 This Week

Last Update: 2025-11-19

See Project

Qwen2-Audio

Repo of Qwen2-Audio chat & pretrained large audio language model

Qwen2-Audio is a large audio-language model by Alibaba Cloud, part of the Qwen series. It is trained to accept various audio signal inputs (including speech, sounds, etc.) and perform both voice chat and audio analysis, producing textual responses. It supports two major modes: Voice Chat (interactive voice only input) and Audio Analysis (audio + text instructions), with both base and instruction-tuned models. It is evaluated on many benchmarks (speech recognition, translation, sound...

Downloads: 0 This Week

Last Update: 2025-09-23

See Project

MiniMax-M1

Open-weight, large-scale hybrid-attention reasoning model

MiniMax-M1 is presented as the world’s first open-weight, large-scale hybrid-attention reasoning model, designed to push the frontier of long-context, tool-using, and deeply “thinking” language models. It is built on the MiniMax-Text-01 foundation and keeps the same massive parameter budget, but reworks the attention and training setup for better reasoning and test-time compute scaling. Architecturally, it combines Mixture-of-Experts layers with lightning attention, enabling the model to...

Downloads: 0 This Week

Last Update: 2025-12-01

See Project

Grok-1

Open-source, high-performance Mixture-of-Experts large language model

Grok-1 is a 314-billion-parameter Mixture-of-Experts (MoE) large language model developed by xAI. Designed to optimize computational efficiency, it activates only 25% of its weights for each input token. In March 2024, xAI released Grok-1's model weights and architecture under the Apache 2.0 license, making them openly accessible to developers. The accompanying GitHub repository provides JAX example code for loading and running the model. Due to its substantial size, utilizing Grok-1 requires a machine with significant GPU memory. ...

1 Review

Downloads: 33 This Week

Last Update: 2025-02-27

See Project

YAYI

Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM

YAYI is an open-source large language model project developed to provide a multilingual conversational AI system capable of performing a wide variety of natural language processing tasks. The model is trained on diverse datasets covering multiple languages and domains so that it can support applications ranging from dialogue systems to text analysis and knowledge retrieval. The architecture is based on transformer-style language models optimized for conversational understanding and...

Downloads: 0 This Week

Last Update: 2026-03-05

See Project

InternLM

Official release of InternLM series

...The repository highlights multiple model sizes intended to serve different needs, from efficient research and prototyping to more capable deployments for complex scenarios. Beyond model weights, the project emphasizes an ecosystem view, pointing developers to compatible tools and projects across training and inference so teams can build end-to-end workflows. InternLM’s direction includes strong general-purpose capabilities and ongoing iterations that target improved reasoning, coding, and tool-use behaviors. The broader InternLM ecosystem also includes training tooling and guidance aimed at making fine-tuning and adaptation more accessible across hardware setups, including smaller single-GPU environments and larger multi-node configurations.

Downloads: 0 This Week

Last Update: 2026-03-04

See Project

Chinese Llama 2 7B

The first Chinese LLaMA2 model in the open source community

...The project provides a version of LLaMA-2 that has been further trained on Chinese data so it can better understand and generate text in Chinese while maintaining compatibility with the original model ecosystem. In addition to the model weights, the repository also includes supervised fine-tuning datasets and training resources that help developers build chat-optimized versions of the model. The project follows the input format used by the LLaMA-2 chat architecture, ensuring compatibility with existing optimization techniques and tools built for the LLaMA-2 ecosystem. ...

Downloads: 0 This Week

Last Update: 2026-03-06

See Project

LLaMA

Inference code for Llama models

“Llama” is the repository from Meta (formerly Facebook/Meta Research) containing the inference code for LLaMA (Large Language Model Meta AI) models. It provides utilities to load pre-trained LLaMA model weights, run inference (text generation, chat, completions), and work with tokenizers. Tokenizer utilities, download scripts, shell helpers to fetch model weights with correct licensing/permissions. Includes example scripts for chat completions and text completions to show how to call the models in code. This repo is a core piece of the Llama model infrastructure, used by researchers and developers to run LLaMA models locally or in their infrastructure. ...

Downloads: 0 This Week

Last Update: 2025-09-14

See Project

bert4keras

Keras implement of transformers for humans

...This is the keras version of the transformer model library re-implemented by the author and is committed to combining transformer and keras with as clean code as possible. The original intention of this project is for the convenience of modification and customization, so it may be updated frequently. Load the pre-trained weights of bert/roberta/albert for fine-tune. Implement the attention mask required by the language model and seq2seq. Pre-training code from zero (supports TPU, multi-GPU, please see pertaining). Compatible with keras, tf.keras.

Downloads: 0 This Week

Last Update: 2023-03-25

See Project

Search Results for "yolov4.weights"

Showing 22 open source projects for "yolov4.weights"

bert4torch

AirLLM

Code World Model (CWM)

LLaMA 3

Qwen

EmoLLM

GLM-4-Voice

Coconut

Ling

Text-to-LoRA (T2L)

TigerBot

Agents 2.0

Gemma

VibeThinker

Qwen2-Audio

MiniMax-M1

Grok-1

YAYI

InternLM

Chinese Llama 2 7B

LLaMA

bert4keras

Search Results for "yolov4.weights"

Showing 22 open source projects for "yolov4.weights"

bert4torch

AirLLM

Code World Model (CWM)

LLaMA 3

Qwen

EmoLLM

GLM-4-Voice

Coconut

Ling

Text-to-LoRA (T2L)

TigerBot

Agents 2.0

Gemma

VibeThinker

Qwen2-Audio

MiniMax-M1

Grok-1

YAYI

InternLM

Chinese Llama 2 7B

LLaMA

bert4keras

Related Searches

Related Categories