via “image and video generation with provider-specific model support”
Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, DeepSeek, Perplexity, Grok, Bielik, chat, vision, voice, RAG, image and video generation, agents, tools, MCP, plugins, speech synthesis and recognition, web search, memory, presets, assistants,and more. Linux, Windows, Mac
Unique: Provides a unified Image Generation mode supporting multiple providers (DALL-E, Imagen, Sora) with consistent parameter handling and local asset management; integrates video generation (Sora) alongside image generation in a single mode.
vs others: Compared to single-provider tools (DALL-E web, Midjourney), py-gpt supports multiple image models in one interface; compared to ChatGPT's image generation (OpenAI-only), py-gpt offers provider flexibility and local asset control.