Explore

I want to…

Use official models

Official models are always on, maintained, and have predictable pricing.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Updated 19.7K runs

Updated 2 runs

Updated 9 runs

Updated 47 runs

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

Updated 39 runs

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

Updated 32 runs

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

Updated 323 runs

Updated 49 runs

Updated 36 runs

Updated 24 runs

Updated 11 runs

An experimental model for testing out different failure modes

Updated 16 runs

black-forest-labs/flux-dev

A 12 billion parameter rectified flow transformer capable of generating images from text descriptions

Updated 13.4M runs

Run Wan2.1 14b or 1.3b with a lora

Updated 823 runs

Photomaker V1 optimized with Lightning 8steps

Updated 22 runs

Inpainting and video2video experiments with Wan 2.1

Updated 102 runs

Updated 52 runs

This model generates pose variation of a cartoon character. It preserves the cartoon identity. Use this model to augment training dataset for any cartoon character created through AI. The augmented dataset can be used to train a LoRA model.

Updated 3.2K runs

PNG Generation Model https://hipng.com/

Updated 33 runs

Updated 13.6K runs

Updated 35 runs

Updated 10.1K runs

Updated 266 runs

SOTA Open Source TTS

Updated 157 runs

"DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion"

Updated 64 runs

Updated 81 runs

Updated 20 runs

Microsoft Magma: A Foundation Model for Multimodal AI Agents

Updated 15 runs

Updated 49 runs

Updated 14 runs

Updated 24 runs

Updated 269 runs

Updated 477 runs

Updated 6 runs

Updated 23 runs

Updated 50 runs

CogView-4 model, which has 6B parameters, supports native Chinese input, and Chinese text-to-image generation.

Updated 61 runs

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning

Updated 78 runs

Run any ComfyUI workflow. Guide: https://github.com/replicate/cog-comfyui

Updated 2.5M runs

Updated 858 runs

Updated 67 runs

Updated 26 runs

Updated 23 runs

ibm-granite/granite-vision-3.2-2b

Granite-Vision-3.2-2B is a compact and efficient vision-language model, specifically designed for visual document understanding.

Updated 5.9K runs

Updated 79 runs