Vui
NotebookLM conversational speech model
Generate lip-synced videos from images and audio
Transcribe audio files with timestamps and download transcripts
Align text to your predefined text corpus with whisper-bidec
Generate images from text prompts
Ultra fast high quality image generation
FLUXllama Multilingual(to be add more languages)
Structure-Preserving Style Transfer with Canny, Depth & Flux
Generate artwork in a chosen style from your image
Wan: Open and Advanced Large-Scale Video Generative Models
Generate custom scenes with your own character image
Generate customized images using text and multiple images
Chat with Kimi-VL: respond to text, images, video, PDFs
Generate 3D video from input images
Large Animatable Human Model
Chat with AI using text, audio, images, and video
New Ghibli EasyControl model is now released!!