Page 3 | Best Open Source AI Image Generators 2026

AI Image Generators

View 485 business solutions

AI Image Generators Clear Filters

MicroStation by Bentley Systems is the trusted computer-aided design (CAD) software built specifically for infrastructure design.
Microstation enables architects, engineers, and designers to create precise 2D and 3D drawings that bring complex projects to life.

MicroStation is the only computer-aided design software for infrastructure design, helping architects and engineers like you bring their vision to life, present their designs to their clients, and deliver their projects to the community.

Learn More
Collect! is a highly configurable debt collection software
Everything that matters to debt collection, all in one solution.

The flexible & scalable debt collection software built to automate your workflow. From startup to enterprise, we have the solution for you.

Learn More
1

x-unet

Implementation of a U-net complete with efficient attention

Implementation of a U-net complete with efficient attention as well as the latest research findings. For 3d (video or CT / MRI scans).

Downloads: 1 This Week

Last Update: 2024-05-03
See Project
2

Deface GUI - Face Anonymization Tool

Graphical User Interface Face Anonymization Tool

This application is a professional tool with a graphical user interface that enables anonymization of faces using the Deface Engine. Cross-Platform Compatible (Linux-Windows) NOTE: To use on Windows, first install Python. Then, if necessary, install “pip install deface” (only if necessary).

1 Review

Downloads: 8 This Week

Last Update: 2025-10-13
See Project
3

FLUX.1 Krea

Powerful open source image generation model

FLUX.1 Krea [dev] is an open-source 12-billion parameter image generation model developed collaboratively by Krea and Black Forest Labs, designed to deliver superior aesthetic control and high image quality. It is a rectified-flow model distilled from the original Krea 1, providing enhanced sampling efficiency through classifier-free guidance distillation. The model supports generation at resolutions between 1024 and 1280 pixels with recommended inference steps between 28 and 32 for optimal balance of speed and quality. FLUX.1 Krea is fully compatible with the FLUX.1 architecture, making it easy to integrate into existing workflows and pipelines. The repository offers easy-to-use inference scripts and a Jupyter Notebook example to facilitate quick experimentation and adoption. Users can run the model locally after downloading weights from Hugging Face and benefit from a live demo available on krea.ai.

1 Review

Downloads: 4 This Week

Last Update: 2025-08-05
See Project
4

Free AI Watermark Remover - FreeRepair

AI-powered tool to quickly remove watermarks from images flawlessly

AI Watermark Remover (Free And Open-Source) & Make Blurry Images Clearer Or Larger Tool - FreeRepair, Simulation IOPaint Based On The Django Of Python With No Sign-Up. As a free, open-source, AI-powered tool, FreeRepair makes it easy to remove watermarks, logos, text or clutter from images, and blurry images can be made clearer or larger. No installation, no internet connection, it works out of the box, safe and secure, unlimited.

1 Review

Downloads: 4 This Week

Last Update: 2026-03-30
See Project
Data management solutions for confident marketing
For companies wanting a complete Data Management solution that is native to Salesforce

Verify, deduplicate, manipulate, and assign records automatically to keep your CRM data accurate, complete, and ready for business.

Learn More
5

Image Augment Generator

Dataset Image Augmentation Generator is a desktop application

Dataset Image Augmentation Generator is a desktop application that creates multiple variations of images to expand machine learning datasets. Users select a folder of images, choose from 30 augmentation techniques, and the tool generates new images with one click, then packages them into a downloadable ZIP file. Key Feature: 30 Augmentation Techniques 1. Geometric Transformations: Rotation (-3° to +3°), Horizontal/Vertical Flip, Random Shear, Zoom, Translation, Perspective Transform 2. Noise & Blur: Gaussian Noise, Salt & Pepper Noise, Motion Blur, Gaussian Blur 3. Color Adjustments: Brightness, Contrast, Color Jitter, Gamma Correction, Channel Shuffle, Posterize 4. Advanced Techniques: Elastic Deformation, Cutout, CLAHE, Edge Enhancement, Histogram Equalization, Fourier Noise 5. Deep Learning Methods: Mixup, CutMix, Random Occlusion Target Audience 1. Machine Learning Engineers 2. Data Scientists 3. Computer Vision Researchers 4. Students learning ML/CV 5. Anyone

Downloads: 2 This Week

Last Update: 2026-02-21
See Project
6

macara

A converter for seamless transformation of files, data, and media ...

This application consolidates various scripts, including an AI feature (rembg), into a singular platform. The design of this software is evolutionary, allowing for the seamless integration of additional scripts, menus, or windows as needed. Serving as a versatile tool, it facilitates efficient file management, especially when handling a substantial volume of images, whether sorting by name or other attributes. These scripts are crafted to complement generative art AI technologies like Dall-e or stable diffusion.

Downloads: 1 This Week

Last Update: 2023-10-23
See Project
7

AI Atelier

Based on the Disco Diffusion, version of the AI art creation software

Based on the Disco Diffusion, we have developed a Chinese & English version of the AI art creation software "AI Atelier". We offer both Text-To-Image models (Disco Diffusion and VQGAN+CLIP) and Text-To-Text (GPT-J-6B and GPT-NEOX-20B) as options. Making available complete source code of licensed works and modifications, which include larger works using a licensed work, under the same license. Copyright and license notices must be preserved. When a modified version is used to provide a service over a network, the complete source code of the modified version must be made available. Create 2D and 3D animations and not only still frames (from Disco Diffusion v5 and VQGAN Animations). Input audio and images for generation instead of just text. Simplify tool setup process on colab, and enable ‘one-click’ sharing of the generated link to other users. Experiment with the possibilities for multi-user access to the same link.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
8

BCI

BCI: Breast Cancer Immunohistochemical Image Generation

Breast Cancer Immunohistochemical Image Generation through Pyramid Pix2pix. We have released the trained model on BCI and LLVIP datasets. We host a competition for breast cancer immunohistochemistry image generation on Grand Challenge. Project pix2pix provides a python script to generate pix2pix training data in the form of pairs of images {A,B}, where A and B are two different depictions of the same underlying scene, these can be pairs {HE, IHC}. Then we can learn to translate A(HE images) to B(IHC images). The evaluation of human epidermal growth factor receptor 2 (HER2) expression is essential to formulate a precise treatment for breast cancer. The routine evaluation of HER2 is conducted with immunohistochemical techniques (IHC), which is very expensive. Therefore, for the first time, we propose a breast cancer immunohistochemical (BCI) benchmark attempting to synthesize IHC data directly with the paired hematoxylin and eosin (HE) stained images.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
9

Big Sleep

A simple command line tool for text to image generation

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Ryan Murdock has done it again, combining OpenAI's CLIP and the generator from a BigGAN! This repository wraps up his work so it is easily accessible to anyone who owns a GPU. You will be able to have the GAN dream-up images using natural language with a one-line command in the terminal. User-made notebook with bug fixes and added features, like google drive integration. Images will be saved to wherever the command is invoked. If you have enough memory, you can also try using a bigger vision model released by OpenAI for improved generations. You can set the number of classes that you wish to restrict Big Sleep to use for the Big GAN with the --max-classes flag as follows (ex. 15 classes). This may lead to extra stability during training, at the cost of lost expressivity.

Downloads: 0 This Week

Last Update: 2022-08-09
See Project
Failed Payment Recovery for Subscription Businesses
For subscription companies searching for a failed payment recovery solution to grow revenue, and retain customers.

FlexPay’s innovative platform uses multiple technologies to achieve the highest number of retained customers, resulting in reduced involuntary churn, longer life span after recovery, and higher revenue. Leading brands like LegalZoom, Hooked on Phonics, and ClinicSense trust FlexPay to recover failed payments, reduce churn, and increase customer lifetime value.

Learn More
10

CLIP Guided Diffusion

A CLI tool/python module for generating images from text

A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI. Text to image generation (multiple prompts with weights). Non-square Generations (experimental) Generate portrait or landscape images by specifying a number to offset the width and/or height. Uses fewer timesteps over the same diffusion schedule. Sacrifices accuracy/alignment for quicker runtime. options: - 25, 50, 150, 250, 500, 1000, ddim25,ddim50,ddim150, ddim250,ddim500,ddim1000 (default: 1000) Prepending a number with ddim will use the ddim scheduler. e.g. ddim25 will use the 25 timstep ddim scheduler. This method may be better at shorter timestep_respacing values. Multiple prompts can be specified with the | character. You may optionally specify a weight for each prompt.

Downloads: 0 This Week

Last Update: 2023-03-22
See Project
11

CogView

Text-to-Image generation. The repo for NeurIPS 2021 paper

CogView is a large-scale pretrained text-to-image transformer model, introduced in the NeurIPS 2021 paper CogView: Mastering Text-to-Image Generation via Transformers. With 4 billion parameters, it was one of the earliest transformer-based models to successfully generate high-quality images from natural language descriptions in Chinese, with partial support for English via translation. The model incorporates innovations such as PB-relax and Sandwich-LN to enable stable training of very deep transformers without NaN loss issues. CogView supports multiple tasks beyond text-to-image, including image captioning, post-selection (ranking candidate images by relevance to a prompt), and super-resolution (upscaling model-generated images). The repository provides pretrained models, inference scripts, and training examples, along with a Docker environment for reproducibility.

Downloads: 0 This Week

Last Update: 2 days ago
See Project
12

DALL-E in Pytorch

Implementation / replication of DALL-E, OpenAI's Text to Image

Implementation / replication of DALL-E (paper), OpenAI's Text to Image Transformer, in Pytorch. It will also contain CLIP for ranking the generations. Kobiso, a research engineer from Naver, has trained on the CUB200 dataset here, using full and deepspeed sparse attention. You can also skip the training of the VAE altogether, using the pretrained model released by OpenAI! The wrapper class should take care of downloading and caching the model for you auto-magically. You can also use the pretrained VAE offered by the authors of Taming Transformers! Currently only the VAE with a codebook size of 1024 is offered, with the hope that it may train a little faster than OpenAI's, which has a size of 8192. In contrast to OpenAI's VAE, it also has an extra layer of downsampling, so the image sequence length is 256 instead of 1024 (this will lead to a 16 reduction in training costs, when you do the math).

Downloads: 0 This Week

Last Update: 2023-05-24
See Project
13

Deep Daze

Simple command line tool for text to image generation

Simple command-line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). In true deep learning fashion, more layers will yield better results. Default is at 16, but can be increased to 32 depending on your resources. Technique first devised and shared by Mario Klingemann, it allows you to prime the generator network with a starting image, before being steered towards the text. Simply specify the path to the image you wish to use, and optionally the number of initial training steps. We can also feed in an image as an optimization goal, instead of only priming the generator network. Deepdaze will then render its own interpretation of that image. The regular mode for texts only allows 77 tokens. If you want to visualize a full story/paragraph/song/poem, set create_story to True.

Downloads: 0 This Week

Last Update: 2022-03-13
See Project
14

Deep Feature Rotation Multimodal Image

Implementation of Deep Feature Rotation for Multimodal Image

Official implementation of paper Deep Feature Rotation for Multimodal Image Style Transfer [NICS'21] We propose a simple method for representing style features in many ways called Deep Feature Rotation (DFR), while still achieving effective stylization compared to more complex methods in style transfer. Our approach is a representative of the many ways of augmentation for intermediate feature embedding without consuming too much computational expense. Prepare your content image and style image. I provide some in the data/content and data/style and you can try to use them easily. We provide a visual comparison between other rotation angles that do not appear in the paper. The rotation angles will produce a very diverse number of outputs. This has proven the effectiveness of our method with other methods.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
15

Diffusers-Interpret

Model explainability for Diffusers

diffusers-interpret is a model explainability tool built on top of Diffusers. Model explainability for Diffusers. Get explanations for your generated images. Install directly from PyPI. It is possible to visualize pixel attributions of the input image as a saliency map. diffusers-interpret also computes these token/pixel attributions for generating a particular part of the image. To analyze how a token in the input prompt influenced the generation, you can study the token attribution scores. You can also check all the images that the diffusion process generated at the end of each step. Gradient checkpointing also reduces GPU usage, but makes computations a bit slower.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
16

Diffusion WebUI Colab

Choose your diffusion models and spin up a WebUI on Colab in one click

The most simplistic Colab with most models included by default. Custom models can be added easily. Stable Diffusion 2.0 in testing phase. Choose your diffusion models and spin up a WebUI on Colab in one click. Share your generations in our mastodon server - (This is hosted by a third party. I am not associated with the instance in any way.) The instructions are on the Colab.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
17

Dynacover

Dynamic Twitter images and banners

Dynacover is a PHP GD + TwitterOAuth CLI app to dynamically generate Twitter header images and upload them via the API. This enables you to build cool little tricks, like showing your latest followers or GitHub sponsors, your latest content created, a qrcode to something, a progress bar for a goal, and whatever you can think of. You can run Dynacover in three different ways. As a GitHub action: the easiest way to run Dynacover is by setting it up in a public repository with GitHub Actions, using repository secrets for credentials. Follow this step-by-step guide to set this up - no coding is required. With Docker: you can use the public erikaheidi/dynacover Docker image to run Dynacover with a single command, no PHP is required. To further customize your cover, you can clone the dynacover repo to customize banner resources (JSON template and header images, both located at app/Resources), then build a local copy of the Dynacover Docker image to use your custom changes.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
18

G-Diffuser Bot

Discord bot and Interface for Stable Diffusion

The first release of the all-in-one installer version of G-Diffuser is here. This release no longer requires the installation of WSL or Docker and has a systray icon to keep track of and launch G-Diffuser components. The infinite zoom scripts have been updated with some improvements, notably a new compositer script that is hundreds of times faster than before. The first release of the all-in-one installer is here. It notably features much easier "one-click" installation and updating, as well as a systray icon to keep track of g-diffuser programs and the server while it is running. Run run.cmd to start the G-Diffuser system. You should see a G-Diffuser icon in your systray/notification area. Click on the icon to open and interact with the G-Diffuser system. If the icon is missing be sure it isn't hidden by clicking the "up" arrow near the notification area.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
19

GANformer

Generative Adversarial Transformers

This is an implementation of the GANformer model, a novel and efficient type of transformer, explored for the task of image generation. The network employs a bipartite structure that enables long-range interactions across the image, while maintaining computation of linearly efficiency, that can readily scale to high-resolution synthesis. The model iteratively propagates information from a set of latent variables to the evolving visual features and vice versa, to support the refinement of each in light of the other and encourage the emergence of compositional representations of objects and scenes. In contrast to the classic transformer architecture, it utilizes multiplicative integration that allows flexible region-based modulation and can thus be seen as a generalization of the successful StyleGAN network. Using the pre-trained models (generated after training for 5-7x less steps than StyleGAN2 models! Training our models for longer will improve the image quality further).

Downloads: 0 This Week

Last Update: 2023-03-22
See Project
20

Hunyuan3D-1

A Unified Framework for Text-to-3D and Image-to-3D Generation

Hunyuan3D-1 is an earlier version in the same 3D generation line (the unified framework for text-to-3D and image-to-3D tasks) by Tencent Hunyuan. It provides a framework combining shape generation and texture synthesis, enabling users to create 3D assets from images or text conditions. While less advanced than version 2.1, it laid the foundations for the later PBR, higher resolution, and open-source enhancements. (Note: less detailed public documentation was found for Hunyuan3D-1 compared to 2.1.). Community and ecosystem support (e.g. usage via Blender addon for geometry/texture). Integration into user-friendly tools/platforms.

Downloads: 0 This Week

Last Update: 2025-11-19
See Project
21

OpenAI DALL·E AsyncImage SwiftUI

OpenAI swift async text to image for SwiftUI app using OpenAI

SwiftUI views that asynchronously loads and displays an OpenAI image from open API. You just type in your idea and AI will give you an art solution. DALL-E and DALL-E 2 are deep learning models developed by OpenAI to generate digital images from natural language descriptions, called "prompts". You need to have Xcode 13 installed in order to have access to Documentation Compiler (DocC) OpenAI's text-to-image model DALL-E 2 is a recent example of diffusion models. It uses diffusion models for both the model's prior (which produces an image embedding given a text caption) and the decoder that generates the final image. In machine learning, diffusion models, also known as diffusion probabilistic models, are a class of latent variable models. They are Markov chains trained using variational inference. The goal of diffusion models is to learn the latent structure of a dataset by modeling the way in which data points diffuse through the latent space.

Downloads: 0 This Week

Last Update: 2025-08-14
See Project
22

OpenAI PicGen x23

OpenAI Picture Generator - experimental 23

'Picture Generator' is a console (command-prompt) desktop application developed using python 3.6.8 and other add-on libaries. The experimental application uses OpenAI api from OpenAI account to create pictures on a PC Apps. User is prompt to save the API Key when app starts-up (a one-time process, the api key is saved in api-key json file for later use). 'Generated pictures' are saved 'Gen Pics' folder in the same app folder. Compatible only for windows OS.

Downloads: 0 This Week

Last Update: 2023-07-03
See Project
23

OpenAI Web Application

A web application that allows users to interact with OpenAI's models

A web application that allows users to interact with OpenAI's modles through a simple and user-friendly interface. This app is for demo purpose to test OpenAI API and may contain issues/bugs. User-friendly interface for making requests to the OpenAI API. Responses are displayed in a chat-like format. Select Models (Davinci, Codex, DALL·E, Whisper) based on your needs. Create AI Images (DALL·E). Audio-Text Transcribe (Whisper). Highlight code syntax. Type in the input field and press enter or click on the send button to make a request to the OpenAI API. Use control+enter to add line breaks in the input field. Responses are displayed in the chat-like format on top of the page. Generate code, including translating natural language to code. Take advantage of DALL·E models to generate AI images. Utilize Whisper Model to transcribe audio into text.

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
24

PRESENTA Lib

The javascript presentation library for the automation era

PRESENTA Lib is a config-driven presentation library that creates modern web documents for the automation era. PRESENTA Lib requires a serializable object on purpose, to facilitate interoperability, and data transformation as well as fostering novel tools to create presentational documents. PRESENTA Lib is a javascript library without external dependencies. It comes as UMD, thus, you can install it in several ways. A PRESENTA Lib document contains a list of scenes that can be displayed one at a time. Each scene contains one or more block of content. The scene is responsible to keep blocks together. A block is a minimum unit that renders specific content from a given config object. PRESENTA Lib is designed to be extensible by using external plugins. Each scene can include one or more blocks. A block is responsible to render a specific content or media, such as text, image or video.

Downloads: 0 This Week

Last Update: 2023-03-22
See Project
25

PaddleGAN

PaddlePaddle GAN library, including lots of interesting applications

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on. PaddleGAN provides developers with high-performance implementation of classic and SOTA Generative Adversarial Networks, and supports developers to quickly build, train and deploy GANs for academic, entertainment, and industrial usage. GAN-Generative Adversarial Network, was praised by "the Father of Convolutional Networks" Yann LeCun (Yang Likun) as [One of the most interesting ideas in the field of computer science in the past decade]. It's the one research area in deep learning that AI researchers are most concerned about.

Downloads: 0 This Week

Last Update: 2023-03-22
See Project