diff --git a/1-planning/midjourney-alternatives-bn-blog.md b/2-outline/midjourney-alternatives-bn-blog.md similarity index 54% rename from 1-planning/midjourney-alternatives-bn-blog.md rename to 2-outline/midjourney-alternatives-bn-blog.md index 1a5a146..c512643 100644 --- a/1-planning/midjourney-alternatives-bn-blog.md +++ b/2-outline/midjourney-alternatives-bn-blog.md @@ -2,11 +2,12 @@ slug: midjourney-alternatives-bn-blog title: "Best Midjourney Alternatives in 2026" author: banatie -status: planning +status: outline created: 2026-01-12 updated: 2026-01-12 content_type: comparison channel: banatie.app/blog +assets_folder: assets/midjourney-alternatives-bn-blog/ primary_keyword: "midjourney alternative" primary_volume: 1300 primary_kd: 3 @@ -32,21 +33,33 @@ Comprehensive comparison of AI image generation tools as Midjourney alternatives --- -## Assets +# Outline -- `assets/midjourney-alternatives-bn-blog/brief.md` — full brief with structure and requirements +See [outline.md](assets/midjourney-alternatives-bn-blog/outline.md) --- -## Log +# Draft -### 2026-01-12 — @strategist -Created brief. Consolidated from three inbox ideas. Keyword research completed via DataForSEO ($0.35 spent). +*Pending — waiting for @writer* -Categories defined: -1. Models with Native Service (UI-First) -2. Open Source / Self-Hosted -3. API-First Platforms ← Banatie here -4. Aggregators (Multi-Model) +--- -Next: @architect creates outline. +# SEO + +*Pending — waiting for @seo* + +--- + +# Activity Log + +See [log-chat.md](assets/midjourney-alternatives-bn-blog/log-chat.md) + +--- + +# Assets + +- [brief.md](assets/midjourney-alternatives-bn-blog/brief.md) — full brief with structure and requirements +- [research-complete.md](assets/midjourney-alternatives-bn-blog/research-complete.md) — research on 19 services +- [outline.md](assets/midjourney-alternatives-bn-blog/outline.md) — article structure +- [log-chat.md](assets/midjourney-alternatives-bn-blog/log-chat.md) — agent activity log diff --git a/assets/midjourney-alternatives-bn-blog/log-chat.md b/assets/midjourney-alternatives-bn-blog/log-chat.md new file mode 100644 index 0000000..8c6b710 --- /dev/null +++ b/assets/midjourney-alternatives-bn-blog/log-chat.md @@ -0,0 +1,39 @@ +# Activity Log + +## 2026-01-12 @strategist + +**Action:** Created brief and initiated article + +**Changes:** +- Created `brief.md` with full structure and requirements +- Created main article file with metadata + +**Notes:** +- Consolidated from three inbox ideas +- Keyword research completed via DataForSEO ($0.35 spent) +- Ultra-low KD (3), solid volume (1,300) — quick win opportunity + +**For next agent:** @architect creates outline + +--- + +## 2026-01-12 @architect + +**Action:** Research phase + outline creation + +**Changes:** +- Created `research-complete.md` — comprehensive research on 19 services +- Created `outline.md` — article structure with word budgets + +**Notes:** +- Research covered 4 categories: API-First (6), UI-First (8), Open Source (2), Aggregators (3) +- Services researched via Perplexity with primary source verification +- Pricing data current as of January 2026 +- Validation Request included — 5 claims for optional verification (funding rounds, user counts) +- Most claims already verified during research, validation can be skipped if time-constrained + +**For next agent:** +- Option A: @validator for 5 low-priority claims (optional) +- Option B: @writer can proceed directly — research is comprehensive + +--- diff --git a/assets/midjourney-alternatives-bn-blog/outline.md b/assets/midjourney-alternatives-bn-blog/outline.md new file mode 100644 index 0000000..1036790 --- /dev/null +++ b/assets/midjourney-alternatives-bn-blog/outline.md @@ -0,0 +1,476 @@ +# Outline: Midjourney Alternatives + +## Article Structure + +**Type:** Comparison / Listicle hybrid +**Total target:** 2,800 words +**Reading time:** 12-14 min +**Services covered:** 19 (Runway removed, Reve added) + +--- + +## Badge System + +**Available badges:** +- `Free tier` — free access available (not just trial) +- `API` — programmatic access +- `Video` — video generation +- `Text` — strong text rendering in images +- `Vector` — native SVG/vector output +- `Commercial safe` — trained on licensed content, IP indemnification, content credentials +- `Chatbot interface` — conversational/chat-based interaction + +**Editing features (list individually where applicable):** +- `Inpaint` — edit specific areas +- `Outpaint` — extend image boundaries +- `Canvas` — freeform editing workspace +- `Live editing` — real-time generation while drawing +- `Object selection` — select and modify objects +- `Zoom out` — extend composition outward +- `Upscaling` — enhance resolution + +**Image reference features (list individually where applicable):** +- `Style ref` — match aesthetic/style from reference image +- `Pose ref` — match character pose from reference +- `Character ref` — maintain character identity across generations +- `Content ref` — match composition/layout from reference +- `Depth ref` — match 3D depth information + +--- + +## "Commercial Safe" Definition + +**For the article, explain briefly:** +"Commercial safe" means the AI is trained on licensed/public domain content (not scraped from the web), provides IP indemnification against copyright claims, and includes content credentials (metadata showing AI origin). Key examples: Adobe Firefly (Content Credentials, trained on Adobe Stock), Getty Images AI ($50k indemnification per image). + +--- + +## Introduction (100 words) + +**Goal:** Set context, acknowledge Midjourney's dominance, promise comprehensive alternatives. + +- Hook: Midjourney defined AI art but has limitations (no API, Discord-first history, no free tier) +- 2026 landscape: dozens of alternatives for different needs +- What this guide covers: UI-first, open source, API-first, aggregators +- Badge system explanation (quick reference) + +**NO:** Long history of AI image generation, "in today's digital landscape..." + +--- + +## Section 1: UI-First Platforms (850 words) + +**Goal:** Cover services with native web/app interfaces. Best for non-developers who want easy access. + +**Section intro (50 words):** These services have their own interfaces. No coding required. Best for quick generation and iteration. + +### 1.1 Midjourney — The Baseline (100 words) +- **Users:** 21M Discord members, 1.2-2.5M daily active, ~1.4M paying subscribers +- **Market share:** 26.8% (leading platform) +- **Pricing:** $10/mo (Basic, 3.3 GPU hrs) → $120/mo (Mega, 60 GPU hrs) +- **Cost per image:** ~$0.03-0.05 in Fast mode +- **Key features:** + - V7 model with video generation (5-21 sec clips) + - --sref (style reference) with versions --sv 1-6 + - --cref (character reference) with --cw weight 0-100 + - Omni-reference system for consistency + - Web app + Discord interface +- **Best for:** Artistic quality, community, consistent aesthetic +- **Badges:** `Style ref` `Character ref` `Video` `Upscaling` + +### 1.2 Leonardo AI (100 words) +- **Users:** 18M+ creators, ~1.2M monthly active +- **Free tier:** 150 tokens/day (resets daily) +- **Paid:** $12-60/mo (Artisan has unlimited Relax mode) +- **API:** $299/mo +- **Key features:** + - Image Guidance suite: Style Reference, Content Reference, Character Reference, Pose, Depth, Edge + - Real-time Canvas with inpaint/outpaint + - Motion 2.0 for video + - Elements (style LoRAs with adjustable strength) + - Phoenix model for quality +- **Best for:** Game assets, concept art, professional control, character consistency +- **Badges:** `Free tier` `API` `Video` `Style ref` `Pose ref` `Character ref` `Content ref` `Depth ref` `Inpaint` `Outpaint` `Canvas` `Upscaling` + +### 1.3 Adobe Firefly (100 words) +- **Free tier:** Limited via web app +- **Paid:** Creative Cloud subscription, IP indemnification on qualifying plans +- **Key features:** + - Firefly 5 model (4MP native resolution) + - Partner models: FLUX.2, Gemini, GPT + - Content Credentials on all images (C2PA standard) + - Trained only on Adobe Stock, public domain, licensed content + - Photoshop, Illustrator, Creative Cloud integration + - Style Kits for brand consistency +- **Best for:** Commercial projects, Adobe users, brand-safe content +- **Badges:** `Free tier` `API` `Commercial safe` `Style ref` `Inpaint` `Upscaling` + +### 1.4 ChatGPT / GPT-4o (100 words) +- **Free tier:** Limited access for free users +- **Paid:** ChatGPT Plus $20/mo +- **Key features:** + - GPT-4o native multimodal generation + - Best-in-class text rendering + - Anatomical accuracy (hands, faces) + - Conversational editing ("make the sky bluer") + - ~1 min per image generation time +- **Best for:** Conversational editing, text in images, iterative refinement +- **Badges:** `Free tier` `Text` `Chatbot interface` `Inpaint` + +### 1.5 Ideogram (80 words) +- **Free tier:** Yes, credit-based +- **Paid:** Credit packs +- **Cost per image:** 0.25-1 credit +- **Key features:** + - Ideogram 3.0 model + - **Best-in-class text rendering** (~90% accuracy vs Midjourney's 30%) + - Founded specifically to solve typography in AI images + - Magic Fill and Extend editing + - Multiple style modes (Realistic, Design, 3D, Anime) +- **Best for:** Logos, branding, text-heavy designs, marketing materials +- **Badges:** `Free tier` `Text` `Inpaint` + +### 1.6 Google Gemini / Imagen (120 words) +- **Models:** + - **Gemini 2.5 Flash Image** (codename: "Nano Banana") — speed-optimized + - **Gemini 3 Pro Image** (codename: "Nano Banana Pro") — quality-optimized + - **Imagen 3/4** — enterprise via Vertex AI +- **Free tier:** Gemini app (with watermark), AI Studio free prototyping (2.5 Flash) +- **Paid:** Nano Banana Pro requires payment in AI Studio; API ~$0.03/image +- **Key features:** + - Character and style consistency across edits + - Multi-image fusion (blend multiple photos) + - Search-grounded generation (Nano Banana Pro) + - Natural language precision edits + - Strong text rendering (especially Nano Banana Pro) +- **Best for:** Google ecosystem, conversational editing, multi-image workflows +- **Badges:** `Free tier` `API` `Text` `Chatbot interface` `Character ref` `Style ref` + +### 1.7 Recraft AI (100 words) +- **Users:** 4M+ +- **Free tier:** 50 generations/day +- **Paid:** $10-48/mo +- **Key features:** + - **Native SVG vector output** — one of only two AI tools with true vector generation (with Adobe Firefly) + - V3 model with strong prompt adherence + - Pattern generation, product mockups + - Brand consistency tools + - Accurate text rendering + - AI Eraser, Inpainting, Outpainting, Mockuper +- **Best for:** Logos, branding, vector graphics, icons, patterns +- **Badges:** `Free tier` `API` `Vector` `Text` `Inpaint` `Outpaint` `Upscaling` + +### 1.8 Reve AI (100 words) +- **Launched:** March 2025 +- **Free tier:** 100 credits on signup + 20/day +- **Paid:** $5 for 500 images (~$0.01/image) +- **Key features:** + - 12B parameter hybrid model + - #1 quality ranking (ELO 1167 in benchmarks) + - **Full commercial rights** on all images, including free tier + - Natural language editing + - Image remixing (combine multiple images) + - Drag-and-drop editor (beta) + - Enhanced text rendering +- **Best for:** Budget-conscious creators, commercial projects, high-quality output +- **Badges:** `Free tier` `Commercial safe` `Text` `Object selection` + +--- + +## Section 2: Open Source / Self-Hosted (400 words) + +**Goal:** Cover options for developers who want control, privacy, or cost savings at scale. + +**Section intro (50 words):** Run models on your hardware. Higher setup cost, lower per-image cost at scale. Full control over the pipeline. + +### 2.1 FLUX (Black Forest Labs) (150 words) +- **Models:** + - Schnell — speed optimized + - Dev — balanced (community favorite) + - Pro — commercial license + - Kontext — editing/context-aware +- **Self-hosting requirements:** + - Full: 16-24GB VRAM + - Quantized (GGUF): 6-8GB VRAM, 4GB possible with Q2 + - RAM: 16GB min, 32GB recommended +- **Key features:** + - ComfyUI as primary interface + - ControlNet: Flux Tools (Canny, Depth), XLabs collections + - LoRA training: FluxGym, Replicate trainer, fal.ai + - Top-tier prompt understanding +- **Best for:** Self-hosting, maximum control, cost optimization at scale +- **Badges:** `API` (via providers) `Style ref` `Pose ref` `Depth ref` `Inpaint` + +### 2.2 Stable Diffusion 3.5 (100 words) +- **License:** Community License (permissive, open source) +- **Models:** + - Large (8.1B params) + - Turbo (4-step fast generation) + - Medium (9.9GB VRAM requirement) +- **Hosted options:** DreamStudio (official), Stability AI API, many third-party UIs +- **Key features:** + - Superior prompt adherence + - Diverse styles + - Huge ecosystem of fine-tunes, LoRAs, ControlNets + - Foundation for many other tools +- **Best for:** Local deployment, customization, building custom pipelines +- **Badges:** `API` (via providers) `Style ref` `Pose ref` `Depth ref` `Inpaint` + +### 2.3 Civitai (150 words) +- **Type:** Model marketplace + web generator +- **Free tier:** Yes, Buzz credits +- **Key features:** + - Thousands of checkpoints: SD families, FLUX, video models + - On-site generation: txt2img, img2img, ControlNet + - LoRA trainer built-in + - Community: Bounties, Creator Program monetization + - Per-model licensing, Usage Control mode +- **Note:** 2025 changes include stricter moderation, some payment disruptions +- **Best for:** Model discovery, community fine-tunes, niche styles +- **Badges:** `Free tier` `Inpaint` + +--- + +## Section 3: API-First Platforms (900 words) + +**Goal:** Cover services designed for developers. Programmatic access, SDKs, infrastructure focus. + +**Section intro (80 words):** +Midjourney has no official API. These platforms fill the gap for developers who need programmatic image generation. + +Key considerations: +- Pricing model (per-image vs GPU-time) +- SDK support (Python, TypeScript, etc.) +- Model selection +- Latency and reliability + +### 3.1 Replicate (120 words) +- **Models:** 100+ official (FLUX, SDXL, GPT-Image-1), thousands community +- **Pricing:** Pay-per-output, varies by model + - Cheap models: ~$0.003/image + - Premium models (like Imagen): $0.03+/image +- **SDK:** Python, JavaScript +- **Key features:** + - Official Models program with quality guarantees + - Cog tool for custom model deployment + - Zero-scale economics (pay only when used) + - Acquired by Cloudflare (2025) — infrastructure play +- **Gotcha:** Stripe payment issues for some regions +- **Best for:** Model variety, serverless deployment, zero-scale economics +- **Badges:** `API` + +### 3.2 fal.ai (120 words) +- **Users:** 2M+ developers +- **Models:** 600+ including FLUX.2, day-zero access to new models +- **Pricing:** $0.03-0.04/image (Seedream, Kontext), GPU hourly available +- **SDK:** TypeScript (@fal-ai/client), Python, Swift +- **Key features:** + - Claims 4x faster than competitors + - Sub-second for Schnell + - Funding: $140M Series D (Dec 2025), $4.5B valuation +- **Best for:** Speed, TypeScript developers, latest models first +- **Badges:** `API` + +### 3.3 Runware (120 words) +- **Models:** 400,000+ via unified API (SD, FLUX, Imagen) +- **Pricing:** **Cheapest in market** + - $0.0006/image (FLUX Schnell) = 1,666 images per $1 + - $10 free credits (~1,000+ images) +- **SDK:** REST API, WebSocket +- **Key features:** + - Sonic Inference Engine (proprietary) + - Sub-second inference + - 0.1s LoRA cold starts + - 90% lower cost claim vs competitors +- **Best for:** Cost optimization, high volume production +- **Badges:** `API` + +### 3.4 Segmind (100 words) +- **Models:** 500+ including FLUX, Seedream, Ideogram, GPT-Image +- **Pricing:** Per-second billing, ~$0.002/s on A100 +- **Free tier:** $5 free credits +- **SDK:** JavaScript, Python, Swift +- **Key features:** + - PixelFlow workflow builder + - Workflow-to-API publishing + - Fine-tuning support +- **Best for:** Complex workflows, custom pipelines +- **Badges:** `Free tier` `API` + +### 3.5 Novita AI (100 words) +- **Models:** 10,000+ image models +- **Pricing:** $0.0015/image baseline +- **SDK:** Python +- **Key features:** + - Serverless GPU + - Hugging Face integration + - Startup Program ($10k credits) +- **Best for:** Budget projects, startups +- **Badges:** `API` + +### 3.6 Together AI (100 words) +- **Models:** 40+ (FLUX.2, SD3, Imagen, SeeDream) +- **Free tier:** 3 months free FLUX.1 Schnell +- **SDK:** OpenAI-compatible (Python, JS) +- **Key features:** + - Unified platform (text + image + video) + - Familiar API format for OpenAI users +- **Best for:** OpenAI SDK users, unified AI platform +- **Badges:** `Free tier` `API` + +### 3.7 Banatie (150 words) + +**Developer-native image generation for AI coding workflows.** + +Built for developers who use Claude Code, Cursor, and similar tools. The problem: generating images means leaving your IDE, using external tools, downloading files, organizing them manually. + +**Integration methods:** +- MCP Server — direct Claude Code / Cursor integration +- REST API — standard HTTP +- Prompt URLs — generate via URL parameters +- SDK/CLI — automation tools + +**Key features:** +- Prompt enhancement (AI improves prompts) +- Built-in CDN (global delivery) +- @name references (consistency across project) +- Project organization (automatic) + +**Differentiators vs alternatives:** +- MCP integration (unique) +- Built-in CDN (unique) +- Prompt URLs for on-demand generation (unique) +- Focus on developer workflow, not just API + +**Best for:** Developers using AI coding tools who want images without context-switching. + +**Badges:** `API` + +--- + +## Section 4: Aggregators (350 words) + +**Goal:** Cover platforms that provide access to multiple models through one interface/subscription. + +**Section intro (50 words):** One subscription, multiple models. Compare outputs side-by-side. Good for exploration and finding the right model for your use case. + +### 4.1 Poe (Quora) (120 words) +- **Models:** 100+ including FLUX-pro, GPT-Image, Imagen 3/4, DALL-E 3, Gemini +- **Free tier:** 3,000 pts/day (resets daily, doesn't roll over) +- **Paid:** $4.99-249.99/mo +- **API:** Released July 2025, OpenAI-compatible +- **Key features:** + - Multi-model comparison in one interface + - Custom bot creation + - App Creator +- **Best for:** Model exploration, one subscription for everything +- **Badges:** `Free tier` `API` `Chatbot interface` + +### 4.2 Krea.ai (120 words) +- **Models:** Flux, Veo 3, Kling, Runway, 20+ total +- **Free tier:** Yes +- **Key features:** + - **Real-time generation** — <50ms (industry leader) + - Real-time canvas: draw and see AI respond instantly + - 22K resolution upscaling + - In/out-painting +- **Best for:** Real-time iteration, concept artists, interactive co-creation +- **Badges:** `Free tier` `Live editing` `Canvas` `Inpaint` `Outpaint` `Upscaling` + +### 4.3 Freepik AI (110 words) +- **Models:** Mystic (proprietary), Flux, Ideogram +- **Key features:** + - Mystic: Fine-tuned on Flux/SD/Magnific, 2K default resolution + - Strong text rendering (outperforms Midjourney, DALL-E) + - All-in-one: stock assets + generation + editing + - AI Video (Veo), Sketch-to-Image, Custom Characters +- **Best for:** All-in-one creative workflow, marketing materials, text in images +- **Badges:** `Text` `Inpaint` `Upscaling` + +--- + +## Section 5: FAQ (250 words) + +**Goal:** Answer People Also Ask questions for SEO. Direct answers, no padding. + +### Is there an AI better than Midjourney? (50 words) +Depends on use case. For text rendering: Ideogram, Recraft, GPT-4o. For API access: fal.ai, Replicate, Banatie. For free tier: Leonardo AI, Gemini, Reve. For commercial safety: Adobe Firefly. For vectors: Recraft. Midjourney excels at artistic quality but lacks API and has no free tier. + +### What is similar to Midjourney but free? (50 words) +Leonardo AI (150 tokens/day), Gemini (unlimited in app with watermark), Reve (100 credits + 20/day), Ideogram (free tier), Poe (3,000 points/day). For unlimited free: self-host FLUX with ComfyUI (requires GPU). + +### Which AI image generator has no restrictions? (50 words) +Most services have content policies. Self-hosted options (FLUX, Stable Diffusion via Civitai) offer most freedom. Civitai has community models with varied restrictions. Note: "no restrictions" often means NSFW content — check individual model licenses. + +### Is Midjourney better than Stable Diffusion? (50 words) +Midjourney: easier to use, consistent artistic style, no setup required. Stable Diffusion: free, customizable, self-hostable, huge model ecosystem. For developers: SD/FLUX via API gives more control. For artists: Midjourney's quality-per-prompt is hard to beat. + +### Does Midjourney have an API? (50 words) +**No official API.** Third-party wrappers exist but violate ToS and risk account bans. For programmatic image generation, use: Replicate, fal.ai, Runware, Together AI, or Banatie. These provide similar quality models (FLUX) with proper API access. + +--- + +## Conclusion (50 words) + +**Goal:** Wrap up, no "best" declaration, direct to relevant option. + +- No single best alternative — depends on needs +- Quick decision guide: + - UI → Leonardo, Reve, or Firefly + - API → fal.ai, Runware, or Banatie + - Self-host → FLUX + - Explore → Poe or Krea +- Link to Banatie for developer workflow + +--- + +## Visual Assets Needed + +| Type | Description | Section | +|------|-------------|---------| +| Screenshots | Each service homepage or generation UI | All services | +| Badge icons | Feature badges visual system | Throughout | +| Diagram | Decision flowchart (optional) | Conclusion | + +## SEO Notes + +- H2 for section titles: UI-First, Open Source, API-First, Aggregators, FAQ +- H3 for individual services: Midjourney, Leonardo AI, etc. +- FAQ answers PAA directly for featured snippet potential +- "midjourney api" addressed in intro, FAQ, and API-First section +- Internal link to Banatie docs from Banatie section + +--- + +## Validation Request + +**Status:** Low priority — most claims verified during research + +### Claims to Verify (Optional) + +1. "Ideogram achieves ~90% text accuracy vs Midjourney's 30%" + - **Section:** 1.5 Ideogram + - **Type:** statistical / benchmark + - **Source found:** pxz.ai review, wavespeed.ai + - **Priority:** Low (already validated in research) + +2. "Reve Image 1.0 ranked #1 with ELO 1167" + - **Section:** 1.8 Reve AI + - **Type:** benchmark + - **Source found:** Artificial Analysis + - **Priority:** Low (already validated) + +3. "fal.ai raised $140M Series D at $4.5B valuation (Dec 2025)" + - **Section:** 3.2 fal.ai + - **Type:** factual / financial + - **Priority:** Medium + +4. "Midjourney has 21M Discord users, 26.8% market share" + - **Section:** 1.1 Midjourney + - **Type:** statistical + - **Source found:** Multiple (demandsage, quantumrun, etc.) + - **Priority:** Low (well-documented) + +### Recommended Approach + +Most claims verified via Perplexity research. Financial claims (funding rounds) are nice-to-have but not critical for a comparison guide. Add "as of January 2026" disclaimer for all pricing. diff --git a/assets/midjourney-alternatives-bn-blog/pricing-collection-prompt.md b/assets/midjourney-alternatives-bn-blog/pricing-collection-prompt.md new file mode 100644 index 0000000..dad68b2 --- /dev/null +++ b/assets/midjourney-alternatives-bn-blog/pricing-collection-prompt.md @@ -0,0 +1,63 @@ +# Pricing Collection Prompt for Claude Browser Extension + +**Purpose:** Collect current pricing data for all AI image generation services in the comparison article. + +**Date created:** 2026-01-12 + +--- + +## Task: Collect pricing data for AI image generation services + +Visit each URL below and extract current pricing information: + +### Services to check: + +**UI-First:** +1. midjourney.com/account/billing (or find pricing page) +2. leonardo.ai/pricing +3. firefly.adobe.com (find pricing) +4. openai.com/chatgpt/pricing (ChatGPT Plus for image generation) +5. ideogram.ai/pricing +6. gemini.google.com (check if image generation requires subscription) +7. recraft.ai/pricing +8. runwayml.com/pricing + +**Open Source (hosting costs):** +9. Check typical cloud GPU costs for running Stable Diffusion / Flux + +**API-First:** +10. replicate.com/pricing +11. fal.ai/pricing +12. runware.ai/pricing +13. segmind.com/pricing +14. novita.ai/pricing + +**Aggregators:** +15. krea.ai/pricing +16. freepik.com/ai (find pricing for AI tools) +17. together.ai/pricing + +### For each service, collect: +- Free tier (if exists): limits, features +- Paid tiers: name, price/month, key limits (credits, images, fast hours) +- Pay-as-you-go rate (if applicable): $ per image or $ per credit + +### Output format: + +``` +**[Service Name]** +- Free: [yes/no] — [limits if yes] +- Plans: + - [Plan name]: $X/mo — [key limits] + - [Plan name]: $X/mo — [key limits] +- Pay-per-use: $X per image (if available) +- Last checked: [date] +``` + +Note: Some sites may require scrolling or clicking "Pricing" link. If pricing is not publicly available, note that. + +--- + +## After collection + +Save results to: `assets/midjourney-alternatives-bn-blog/pricing-data.md` diff --git a/assets/midjourney-alternatives-bn-blog/research-complete.md b/assets/midjourney-alternatives-bn-blog/research-complete.md new file mode 100644 index 0000000..52d9c8d --- /dev/null +++ b/assets/midjourney-alternatives-bn-blog/research-complete.md @@ -0,0 +1,577 @@ +# Complete Research: Midjourney Alternatives for Developers + +**Research completed:** January 12, 2026 +**Scope:** 19 AI image generation services across 4 categories + +--- + +## CATEGORY 1: API-FIRST PLATFORMS + +### 1. Replicate + +**Models:** FLUX (Pro, Dev, 1.1), SDXL, Google Nano-Banana, ByteDance Seedream-4/4.5, Ideogram V3-Turbo, Stable Diffusion variants, OpenAI GPT-Image-1, 100+ official models. Community models available. + +**Pricing:** Pay-as-you-go, billed by output. SDXL: ~$0.012/prediction. SDXL Lightning: ~$0.42/run (~238 images/$1). Typical image: ~$0.003 (30 images/$1). Hardware: CPU $0.000100/sec to 8x H100 $0.012200/sec. Failed runs not charged. + +**SDK:** Python (extensive), JavaScript/Node.js. No CLI or MCP documented. + +**Image Delivery:** URLs returned in API response. No details on permanence/CDN. + +**Features:** Fine-tuning (LoRA training), bring-your-own-key for OpenAI, upscaling, background removal, image restoration. Reference images via FLUX Kontext Pro. Inpainting, seed control. + +**Batch:** GPT-Image-1 supports num_images parameter. "Thousands of images per second" capability claimed. + +**Gotchas:** Payment gateway issues for international users (Stripe). 204 outages tracked since April 2024. **Acquired by Cloudflare in 2025.** + +**Unique:** Official Models program (stable, predictably priced). Cog tool for custom model deployment. Zero-scale economics. + +--- + +### 2. fal.ai + +**Models:** 600+ generative media models. FLUX.2, FLUX.1 (schnell), SDXL (fast-sdxl), GPT-Image 1.5, Recraft V3, Stable Diffusion variants. + +**Pricing:** Serverless (per-output) or Compute (hourly GPU). +- GPU: H100 $1.89/h, H200 $2.10/h, A100 $0.99/h +- Seedream V4: $0.03/image (33 images/$1) +- Flux Kontext Pro: $0.04/image (25 images/$1) +- FLUX.2 Pro: $0.03/MP (first MP), $0.015/MP (additional) +- Free tier available + +**SDK:** JavaScript/TypeScript (@fal-ai/client), Python, Swift. No CLI/MCP documented. + +**Image Delivery:** URLs returned (WebP format on v3.fal.media CDN). sync_mode for data URIs. + +**Features:** Image-to-image, mask-based inpainting, upscaling (clarity upscaler), format control, aspect ratio, reference images with strength parameter. + +**Speed:** Claims "4x faster" than standard approaches. No specific benchmarks. + +**Unique:** FLUX.2 [dev] Turbo (10x cheaper, 6x more efficient). Day-zero model access. $140M Series D (Dec 2025), $4.5B valuation. 2M+ developers. Backed by Sequoia, NVIDIA, a16z. + +--- + +### 3. Runware + +**Models:** Thousands of models via unified API. Stable Diffusion, FLUX, Google Imagen 3.0/4.0 Ultra, Gemini Flash Image 2.5. 400,000+ models supported. + +**Pricing:** Pay-as-you-go per image (not GPU time). Range: $0.0006-$0.24/image. +- FLUX Schnell: $0.0006/image (1,666 images/$1, 0.6s) +- FLUX Dev: $0.0038/image (263 images/$1, 2s) +- SD 1.5: $0.0006/image (1,666 images/$1, 0.8s) +- SDXL: $0.0026/image (384 images/$1) +- $10 free credits for new users (~1,000 free images) + +**SDK:** REST API and WebSocket. No specific language SDKs documented. + +**Image Delivery:** Async API returns taskUUID + URL. + +**Features:** Text-to-image (sub-second latency), image-to-image, style transfer, captioning, background removal, upscaling, inpainting, outpainting, ControlNet, PhotoMaker, LayerDiffuse (alpha channels), up to 6 LoRAs simultaneously. + +**Speed:** Claims 20x faster than traditional cloud GPUs. 0.1s LoRA cold starts. Sub-second inference. + +**Gotchas:** Limited online discussion/user reviews. "Hasn't generated significant buzz." + +**Unique:** Sonic Inference Engine® (proprietary hardware). GPUs at ~100% utilization. Claims up to 90% lower cost. Renewable energy powered. SOC 2 compliant. $50M Series A (Dec 2025). + +--- + +### 4. Segmind + +**Models:** 500+ image and video models. FLUX.1 (multiple versions), Seedream 3.0/4.0, Ideogram 3.0, GPT-Image 1/Mini, Imagen 3. + +**Pricing:** Per-second billing. GPU: A100 $0.002/s, H100 $0.0043/s, L40S $0.0015/s. Flux-Pro fine-tuning: $3-9 based on steps. $5 free credits for new users. + +**SDK:** JavaScript/TypeScript, Python, Swift SDKs available. No CLI/MCP documented. + +**Image Delivery:** URLs returned. + +**Features:** Multimodal editing, image inpainting, img2img, upscaling, batch output (up to 15 images/prompt), reference images (up to 3). + +**Speed:** FLUX.1 Schnell: ~1.8s for 2K resolution. Consistent 3-5 second generation times. + +**Unique:** PixelFlow (custom multi-step workflow builder), VoltaML infrastructure, workflow-to-API publishing, fine-tuning for brand consistency. + +--- + +### 5. Novita AI + +**Models:** 200+ pre-integrated APIs with 10,000+ image models. Stable Diffusion SDXL 1.0, Qwen-Image-Edit. + +**Pricing:** Freemium. Pay-as-you-go primary. **$0.0015 per standard image** baseline. Startup Program: up to $10,000 credits. $0.50 starting credits for new users. + +**SDK:** Python SDK (`pip install novita-sdk`). JavaScript not confirmed. + +**Image Delivery:** Via API response. Storage details not specified. + +**Features:** Text-to-image, img2img, image refinement, background elimination, inpainting, upscaling & super-resolution. + +**Unique:** Serverless GPU infrastructure, custom model upload, rapid open-source model integration, Hugging Face integration. Dual-service: Model Inference API + GPU Cloud. + +--- + +### 6. Together AI + +**Models:** 40+ image and video models. FLUX.2 (dev, pro, flex), Stable Diffusion 3 Medium, HiDream-I1-Full, Google Imagen, Nano Banana, ByteDance SeeDream. + +**Pricing:** 3-month unlimited free access for FLUX.1 [schnell]. Per-model pricing (not detailed in sources). + +**SDK:** OpenAI-compatible SDKs in Python and JavaScript. + +**Image Delivery:** URLs returned via `response.data.url`. + +**Features:** Image-to-image, multi-reference consistency (FLUX.2 supports 4-reference inputs), brand compliance controls (hex code color matching), reliable text rendering. + +**Batch:** "n" parameter for up to 4 images per request. + +**Unique:** Unified platform (text, image, video in single API), OpenAI-compatible endpoints, production-grade infrastructure. + +--- + +## CATEGORY 2: UI-FIRST PLATFORMS + +### 7. Leonardo AI + +**Models:** Leonardo Phoenix, GPT Image 1.5, Lucid Origin. Hosted models include WAN, SVD. + +**Pricing:** +- Free: 150 tokens/day (5-8 tokens per image), watermarked images +- Apprentice: $12/mo ($10 annual) - 8,500 tokens/month +- Artisan: $30/mo ($24 annual) - 25,000 tokens/month +- Maestro: $60/mo ($48 annual) - higher limits +- API: Separate credits, Pro plan $299/mo for 200,000 credits + +**Features:** Text-to-image, img2img, style morphing, real-time inpainting/outpainting, AI upscaling, real-time canvas, Flow State (no-prompt generation), batch generation, video generation (Motion 2.0). + +**API:** Available for developers. Credit-based. + +**Unique:** "Relaxed Generation" mode for unlimited generations (slower, hosted models only). Custom model training. 18M+ creators. + +**Comparison to Midjourney:** Free tier available (MJ has none). More customization/control options. Leonardo: 5 tiers starting $12; MJ: 4 tiers starting $10. + +--- + +### 8. Adobe Firefly + +**Models:** +- Firefly Image Model 5 (public beta) - native 4MP, photorealistic, portraits, complex compositions +- Firefly Image Model 4 and 4 Ultra - up to 2K +- Firefly Video Model - up to 1080p +- Partner models: FLUX.1 Kontext, FLUX.2, Google Gemini 2.5 Flash Image, Imagen 3, OpenAI GPT, Runway, ElevenLabs, Topaz Labs, Luma AI, Veo3 + +**Pricing:** Free tier available through web app. Creative Cloud integration. "Unlimited generations" mentioned in Dec 2025 update. + +**API:** Firefly Services APIs: Text-to-Image (GA), Avatar (GA), Text-to-Video (beta). + +**Features:** Style references, Prompt to Edit (conversational editing), camera motion reference, video transitions, layered image editing (in dev), generative text edit. + +**Commercial Use:** All Adobe Firefly models marketed as "commercially safe." Content credentials attached to all generated images. + +**Integration:** Photoshop (Generative Fill with multiple models), Generative Upscale (Topaz), Adobe Express. + +**Unique:** Multi-model platform with choice across providers. All-in-one AI creative platform. Partner model integration. + +--- + +### 9. Ideogram + +**Models:** +- Ideogram 3.0 (March 2025) - highest visual fidelity, best text rendering +- Ideogram 2.0 (Aug 2024) - enhanced realism, multiple styles +- Ideogram 2a - fastest, speed-optimized + +**Pricing:** Credit-based. Free to start. +- 3.0: 4 credits/generation (4 images) = 1 credit/image +- 2.0: 2 credits/generation (4 images) = 0.5 credits/image +- 2a: 1 credit/generation (4 images) = 0.25 credits/image + +**API:** Not documented in sources. + +**Features:** Superior text rendering (biggest strength), auto style feature, multiple artistic styles (Realistic, 3D, Anime, Design), custom aspect ratios, color palette control, magic prompt algorithm. + +**Unique:** Best-in-class text rendering. Professional design focus (logos, branding, infographics). Vector-style graphics, layout elements. + +**Known Issues:** Sometimes incorrect subject counts. May require re-prompting for surreal/abstract art. + +--- + +### 10. OpenAI (DALL-E / GPT-4o) + +**Models:** +- GPT-4o - default image generator in ChatGPT (native multimodal integration) +- DALL-E 3 - separate tool within ChatGPT + +**Pricing:** Available to ChatGPT Plus ($20/mo), Pro, Team, Free users. API rolling out. + +**Features:** +- GPT-4o: Sophisticated editing, image-to-image transformation, accurate text rendering (even paragraphs), anatomically correct figures, precise prompt adherence, conversational refinement +- Upload images and request edits with contextual understanding + +**Comparison GPT-4o vs DALL-E 3:** +- Text rendering: GPT-4o handles complex layouts; DALL-E 3 struggles with longer passages +- Anatomical accuracy: GPT-4o consistent; DALL-E 3 has hand/pose errors +- Prompt adherence: GPT-4o more precise + +**Limitations:** Generation speed ~1 minute per image (improving over time). + +--- + +### 11. Google Gemini / Imagen + +**Models:** +- Gemini 2.5 Flash Image (aka "Nano Banana") - text-to-image, conversational editing, multi-image fusion +- Imagen 3 - enterprise via Vertex AI, higher quality +- Imagen 4 - Google's top offering as of 2025 + +**Pricing:** +- Gemini App: Free access for consumers +- Imagen API: ~$0.03/image (~33 images/$1) +- Vertex AI: Enterprise pricing + +**Access Methods:** +- Gemini App (Consumer) - free +- Gemini API via Google AI Studio (Developer) +- Vertex AI (Enterprise) - full governance, SynthID watermarks + +**Features:** Object removal, relighting, background changes, multi-image fusion, character/style consistency, conversational image edits. + +**Quality Issues:** Independent testing: DALL-E 13.5/15, Stable Diffusion 11/15, Gemini 3/15. Generation time 10+ seconds (vs 4-8s competitors). Struggles with complex prompt adherence. + +**Limitations:** +- Bias toward photorealism - often refuses edits on human photos +- No on-device generation (cloud required) +- Model in public preview status +- Cannot prevent model from generating text alongside images + +**Commercial:** Enterprise protections via Vertex AI: SynthID verification, tenancy controls, quotas. + +--- + +### 12. Recraft AI + +**Models:** Recraft V3 (aka "Red Panda") - proprietary model. Benchmark: ELO 1172 (vs DALL-E 984). + +**Pricing:** +| Plan | Cost | Monthly Credits | +|------|------|-----------------| +| Free | $0 | 50 daily (~1,500/mo) | +| Basic | $10/mo | 1,000 | +| Advanced | $27/mo | 4,000 | +| Pro | $48/mo | 8,400 | + +**Key Differentiator:** Native SVG vector output - direct scalable vector files from prompts. Essential for print, branding, logos. + +**Features:** +- Photorealistic + style consistency across assets +- Seamless pattern generation (textiles, washi tape) +- Background removal/replacement +- Image upscaling +- Product mockups (t-shirts, mugs, billboards) +- Real-time inpainting, color correction +- Drag-and-drop editor + +**Speed:** Under 10 seconds. Low-res previews near-instant. + +**API:** Listed as available, but no detailed docs in sources. + +**User Sentiment:** Overwhelmingly positive. G2 rating 4.6. "Best AI generator" quotes. 4M+ users, 700% growth, $30M Series B (May 2025). + +**Limitations:** +- No outpainting +- No bulk-download/batch export +- Blocked in some countries (sanctions) +- Limited mobile functionality +- Free tier depletes quickly + +**Best For:** Logo/brand design, graphic design, print/pattern design, product mockups, agencies with multiple client brands. + +--- + +### 13. Runway + +**Models:** +- Gen-3 Alpha: 10 credits/second +- Gen-3 Alpha Turbo: 5 credits/second (7x faster, half price, requires input image) +- Gen-4 Video: 12 credits/second +- Gen-4 Turbo: 5 credits/second +- Gen-4.5: Text-to-video (Standard+ plans) + +**Pricing:** +| Plan | Cost | Credits/mo | Best For | +|------|------|------------|----------| +| Free | $0 | 125 (one-time) | Testing | +| Standard | $12/mo | 625 | Freelancers | +| Pro | $28/mo | 2,250 | Professionals | +| Unlimited | $76/mo | 2,250 + unlimited relaxed | High-volume | + +**Image vs Video Costs:** +- Gen-4 Image 720p: 5 credits (~$0.05) +- Gen-4 Image 1080p: 8 credits +- Gen-4 Image Turbo: 2 credits +- 5-sec video: 25-60 credits +- 20-sec Gen-4 video: 240 credits (Turbo: 100) + +**Resolution:** Free/Standard = 720p-1080p. Pro+ = 4K. + +**Features:** Aleph (video editing), Act-Two (performance capture), upscaling to 4K. Watermark-free on paid plans. + +**API:** Not documented in sources. + +**Best For:** Video-first workflows. Freelancers, agencies, studios. + +--- + +### 14. Stability AI (Stable Diffusion 3.5) + +**Models:** +- SD 3.5 Large: 8.1B parameters, up to 1MP resolution +- SD 3.5 Large Turbo: 4-step distilled version, prioritizes speed +- SD 3.5 Medium: 2.5B parameters, 9.9 GB VRAM, consumer hardware + +**Licensing:** Stability AI Community License (permissive). + +**Features:** Superior prompt adherence, diverse outputs without extensive prompting, versatile styles (3D, photography, painting, line art), Query-Key Normalization for stability. + +**DreamStudio:** Status in 2025 not detailed in sources. + +--- + +## CATEGORY 3: OPEN SOURCE + +### 15. FLUX (Black Forest Labs) + +**Models:** +- FLUX.1 (foundational family) +- FLUX.1 Schnell (speed-optimized) +- FLUX.1 Dev (balanced) +- FLUX.1 Pro (commercial) +- FLUX.1 Kontext [dev/pro/max] (May 2025) - image editing + generation +- FLUX1.1 Pro, FLUX1.1 Pro Ultra (4MP/2K, Ultra + Raw modes) +- FLUX.2 + +**Licensing:** +- FLUX.1 Kontext [dev]: Open-weight (private beta) +- FLUX.1 Pro, Kontext [pro/max]: Proprietary, API only + +**Self-Hosting Requirements:** +- Original: 16-24GB VRAM recommended, 8-12GB minimum +- GGUF quantized: 6GB minimum, can run on 4-6GB with Q2-Q4 +- System RAM: 16GB minimum, 32GB recommended +- Full unquantized: 20GB+ VRAM + +**ComfyUI Integration:** Full support. GGUF loader custom node. Multiple workflow options. + +**ControlNet:** Flux Tools includes Canny and Depth models. XLabs-AI flux-controlnet-collections. InstantX FLUX.1-dev-Controlnet-Union-alpha. + +**LoRA Support:** Yes. Training tools: FluxGym, Replicate flux-dev-lora-trainer, fal.ai flux-lora-general-training. + +**Quality vs Midjourney:** Top-tier prompt understanding, strong photorealism. "Midjourney still has a slight edge in some photorealism tests." + +**Prompt Style:** Verbose, natural language narrative works best. Forgiving, responds well to experimentation. + +--- + +### 16. Civitai + +**What is it:** Model marketplace + integrated web-based generator. Hub for Stable Diffusion and Flux models. + +**Buzz Credits System (2025):** +- Resource surcharges for LoRA/LyCORIS/embeddings (increased GPU load) +- Vidu video: 600 Buzz/generation +- Credit card payments paused; alternative methods introduced + +**Models:** SD families, Flux models, Vidu, Wan 2.1, Hunyuan (video). Tens of thousands of checkpoints supported. On-site LoRA trainer. + +**Features:** txt2img, img2img, ControlNet preprocessors (Canny, Depth, Pose), upscalers, weighted LoRA attachments, video generation (T2V, I2V, R2V). + +**Community:** Model marketplace, content showcase, review system, Bounties marketplace, Creator Program monetization. + +**2025 Issues:** +- Stricter moderation (April 2025) - payment processor pressure +- Real-person likeness removal (May 2025) +- Payment disruptions (credit cards paused, ZKP2P paused) + +**API:** Not documented in sources. + +**Commercial Use:** Per-model licensing. Usage Control mode (on-site only, no downloads). + +--- + +## CATEGORY 4: AGGREGATORS + +### 17. Poe (Quora) + +**Image Models Available:** +- FLUX-pro-1.1 (photorealism) +- GPT-Image-1 (painterly, artistic) +- Imagen3, Imagen 4 +- DALL-E 3 +- Google Gemini 2.5 Flash Image (48% of image gen usage) +- Flux Kontext, Seedream 3.0 +- Runway Gen 4 Turbo, Veo 3 +- 100+ models total (text, image, voice, video) + +**Pricing (2025):** +- Free: 3,000 points/day (resets daily), ~150 messages/day +- $4.99/mo: 10,000 points/day +- $19.99/mo: 1 million points/month +- $49.99/mo: 2.5 million points/month +- $99.99/mo: 5 million points/month +- $249.99/mo: 12.5 million points/month +- Add-on: $30 per 1 million tokens + +**Image Generation Cost:** GPT-4o low-quality 1024x1024: 328 points + +**API:** Released July 2025. Uses existing point-based subscription. OpenAI-compatible chat format. + +**Features:** Multi-model comparison in one interface, custom bot creation without coding, App Creator for building image gen apps. + +**User Complaints:** Credits don't roll over (daily reset), price increases, payment issues for bot creators, bugs. + +**Unique:** All-in-one aggregator - one subscription for multiple premium AI models. Compare outputs side-by-side. + +--- + +### 18. Krea.ai + +**What is it:** Multi-functional creative AI suite with real-time generation. Changes creative workflow from "prompt-wait-revise" to active co-creation. + +**Models:** Flux, Veo 3, Kling, Hailuo, Wan, Runway. 1000+ styles, 20+ models total. + +**Pricing:** Free and paid plans available. Free: multiple images/day. Specific tiers not detailed. + +**Key Features:** +- **Real-time Canvas:** Split interface - canvas for input, AI render on other side. Images evolve as you draw/modify. "AI Strength" slider for control. +- **Speed:** Images in <50ms, sets in ~7 seconds. Flux generates 1024px in 3 seconds. +- **Enhancer:** Upscale images/videos up to 22K resolution. Premium: 4K/8K. +- **Generative Editing:** In/out-painting, object add/remove, style transfer. +- **Real-time Video:** Dynamic clips from text, images, or webcam. Abstract motion backgrounds, cinemagraphs. + +**User Sentiment:** Overwhelmingly positive. "Best AI imaging yet." "Outstanding real-time generation." Professional users praise controllability. + +**Commercial Use:** Confirmed for commercial purposes. Supports professional team workflows. + +**Best For:** Designers (rapid iteration), AI artists (precise control), concept artists (sketch to textured art in seconds), teams (moodboard to final in minutes). + +**Unique:** Real-time interactive workflow. Industry leader in real-time engine. + +--- + +### 19. Freepik AI + +**What is it:** All-in-one creative platform combining AI generation with stock assets, templates, and editing tools. + +**Models:** +- Mystic (Mystic 2.5) - proprietary, fine-tuned on Flux/SD/Magnific.ai. 2K resolution default. +- Flux and Flux 1.1 +- Ideogram +- Classic + +**Key Differentiator:** Excellent text rendering in images - outperforms Midjourney and DALL-E 3. + +**Features:** +- **Generation:** Text-to-image, multiple styles (photorealistic, 3D, illustration) +- **Editing:** Reimagine (4 variations), Resize/outpainting, Retouch, Background remover, Upscaler (to 4K) +- **Additional Tools:** AI Video (powered by Google Veo), AI Voice/Audio, Sketch-to-Image, Custom Characters, Custom Style (LoRA), Mockup Generator, AI Icon Generator, Video Upscaler + +**Pricing:** Mystic requires paid subscription. Specific tiers not detailed. + +**Quality:** Photorealistic results, especially portraits. "National Geographic quality" for realistic scenes. Not as refined as Firefly or Midjourney's cinematic style in some cases. + +**Best For:** Photorealistic content, professional marketing, 3D visualization, text-inclusive designs, all-in-one design workflows. + +**API:** Not documented in sources. + +--- + +## MIDJOURNEY STATUS (January 2026) + +**Confirmed:** +- Web interface operational at midjourney.com +- Mobile apps available (iOS, Android) +- Discord still available but NOT required +- **NO official API exists** + +**Pricing:** +- Basic: $10/mo (limited GPU time) +- Standard: $30/mo +- Pro: $60/mo +- Mega: $120/mo + +--- + +## KEY INSIGHTS FOR ARTICLE + +### Pricing Comparison (Cost per Image - API) +| Service | Cheapest Option | Notes | +|---------|-----------------|-------| +| Runware | $0.0006/image (FLUX Schnell) | 1,666 images/$1 | +| Novita AI | $0.0015/image | Baseline rate | +| Replicate | ~$0.003/image | 30 images/$1 | +| fal.ai | $0.03/image (Seedream V4) | 33 images/$1 | +| Gemini/Imagen | ~$0.03/image | Via API | + +### Pricing Comparison (Subscriptions) +| Service | Free Tier | Paid Starting | +|---------|-----------|---------------| +| Recraft | 50/day | $10/mo | +| Leonardo AI | 150 tokens/day | $12/mo | +| Runway | 125 one-time | $12/mo | +| Poe | 3,000 pts/day | $4.99/mo | +| Adobe Firefly | Yes (web) | Creative Cloud | +| Ideogram | Yes | Credit-based | +| Krea.ai | Yes | Not specified | + +### Free Tiers Summary +- Leonardo AI: 150 tokens/day +- Runware: $10 free credits (~1,000 images) +- Segmind: $5 free credits +- fal.ai: Free tier available +- Together AI: 3 months unlimited FLUX.1 Schnell +- Poe: 3,000 points/day +- Adobe Firefly: Free web access +- Ideogram: Free to start +- Recraft: 50 daily credits +- Runway: 125 credits one-time +- Krea.ai: Multiple images/day +- Gemini: Free in Gemini app + +### Best for Developers (API) +1. **Replicate** - Official Models program, Cog tool, zero-scale +2. **fal.ai** - TypeScript SDK, fastest speeds, day-zero models +3. **Runware** - Cheapest per-image, unified API for 400K models +4. **Together AI** - OpenAI-compatible, unified text/image/video + +### Best for Text in Images +1. **Ideogram** (best-in-class) +2. **Freepik Mystic** (outperforms MJ/DALL-E) +3. **FLUX models** +4. **GPT-4o** +5. **Recraft** (especially for branding) + +### Best for Vector Graphics +1. **Recraft** - Native SVG output + +### Best for Real-Time Generation +1. **Krea.ai** - Industry leader, <50ms generation + +### Best for Commercial Safety +1. **Adobe Firefly** - "Commercially safe" models, content credentials + +### Self-Hosting Options +- FLUX: 6-24GB VRAM depending on quantization +- SD 3.5 Medium: 9.9GB VRAM +- ComfyUI: Most popular interface +- Civitai: Model marketplace + generator + +### Aggregators Value Proposition +- **Poe:** One subscription for FLUX, GPT-Image, Imagen, DALL-E, etc. API available. +- **Krea.ai:** Real-time canvas + multiple models (Flux, Veo 3, Kling, Runway) +- **Freepik AI:** Multiple models + stock assets + editing tools +- **Adobe Firefly:** Partner models (FLUX.2, Gemini, GPT) + Adobe ecosystem + +### Video Capabilities +- **Runway:** Primary focus, Gen-3/Gen-4 models +- **Leonardo AI:** Motion 2.0 +- **Krea.ai:** Real-time video from text/images/webcam +- **Adobe Firefly:** Video model (1080p) +- **Poe:** Access to Veo 3, Runway Gen 4, Kling