Web2Comics User Manual
Complete guide for configuring providers, generating comics, understanding the UI, and obtaining API credentials for supported providers.
Last updated: March 2026 (aligned with current extension behavior).
Popup / Generator Wizard Overview
The popup is the main entry point from the extension toolbar icon. It contains the action launcher, the comic generator wizard, progress view, and a lightweight history modal.
Launcher
- Create Comic: opens the generator wizard for the current page.
- My Collection: opens your saved comics list in the popup.
Top-right buttons
- Settings (gear): opens the Options page.
- ? help icon: opens this manual at the popup section.
Right-click selection actions
- Generate comic from selected text (Default): starts generation immediately using default settings.
- Open Create Comic with selected text: opens popup composer prefilled with your selected text so you can adjust options first.
- Toolbar context menu also provides quick actions (open composer / open collection) from the extension icon menu.
Create Comic (Popup Generator Wizard)
Source Text
Auto (full page): extracts readable page content from the current tab.My highlighted text: uses text you selected on the page.Re-scan: retries extraction.Extracted Source Textpreview: verify what is used for generation.AI Input Preview: first 1,000 chars that are sent to storyboard generation.Article Sectiondropdown: appears when multiple extraction candidates are available.
Customize
| Control | Meaning |
|---|---|
| Comic Length | Number of comic panels to generate (3–12). |
| Story Goal | Objective preset (for example Summary, ELI5, Timeline, Meeting Recap). |
| Language | Output language for captions and narrative text. |
| AI Provider | Primary text/storyboard provider. Some providers are hidden until configured and validated. |
| Advanced Options | Shows narrative detail level, style presets, and custom style creation/selection. |
| Narrative Detail | Controls storyboard richness (low/medium/high). |
| Visual Style | Preset art direction or a custom saved style. |
| Custom (One-off) | Uses current custom-style defaults without creating a saved style entry. |
| Create New Style… | Opens a modal to enter style name + description, saves it, and adds it to the style list for future sessions. |
Readiness and Generate
- The wizard shows readiness chips for Content, Provider, and Generate.
- Grounding Confidence appears before generation with High/Medium/Low guidance and an
Auto-pick best story sectionaction. - The
Generate Comicbutton is disabled until content and provider settings are ready. - The readiness box explains what to fix and can provide a
Fix next stepshortcut. - When full-page extraction fails, selection fallback guidance is shown with quick actions (
Show Selection Hint,Retry Full Page).
Popup Progress View
After generation starts, Web2Comics can attempt to open the side panel while the popup shows progress (Chrome may block auto-open unless it is triggered by a direct user gesture).
- Status text: current phase (storyboard or image rendering).
- Progress bar: panel-level completion progress.
- Panel progress list: per-panel statuses such as
Pending,Sent,Receiving,Rendering,Completed. - Debug progress log (when debug flag is enabled): prompt/response step entries.
- Cancel: requests cancellation of the current generation job.
Popup My Collection
The popup history modal is a quick-access list of previously generated comics.
- Shows recent items with thumbnails and source metadata.
- Each item has an individual Delete action (with confirmation).
- Useful when you want to reopen a comic quickly without opening the side panel browser.
- The full browsing experience is available in the side panel My Collection view.
Options / Settings Overview
The Options page manages defaults, providers, prompt templates, local storage, and general extension information.
Options: General
Generation Defaults
- Default Panel Count: starting value used in popup.
- Detail Level: default storyboard detail level.
- Default Output Language: default language used for captions and narrative text.
- Default Style: default art style preset or saved custom style for new generations.
- Custom Style Name / Description: shown when creating a new reusable custom style entry from Options.
- Caption Length: preferred caption length for generated panels.
If you choose Custom... in Default Style, the inline custom-style editor appears with a Create Style button. After creating a style, it is selected in the list and the inline editor is hidden again.
Free-tier-first default setup: Web2Comics starts with Google Gemini selected for both text and image generation (when eligible/configured), plus a low-cost default generation profile (fewer panels and lower detail) to maximize the chance that a first run succeeds on free-tier limits.
Behavior
- Automatically open comic viewer after generation: attempts to open the side panel on completion (Chrome may block auto-open without a user gesture).
- Enable character consistency mode: provider hint to keep characters visually consistent across panels.
- Debug flag: shows detailed provider/panel errors and enables richer local debugging behavior.
Image Generation → Content Filtering Handling
Controls what happens when an image provider blocks a panel prompt.
| Mode | Behavior |
|---|---|
rewrite_and_retry | Rewrites the prompt into a more neutral editorial framing and retries once. |
replace_people_and_triggers | Replaces real-person names and sensitive trigger terms, then retries once. |
show_blocked | No retry. A blocked placeholder panel is inserted so the comic still completes. |
- Show "Rewritten" badge: marks panels that succeeded after a retry rewrite/sanitize pass.
- Log original + rewritten prompt locally: stores prompt details in local debug logs/panel metadata for troubleshooting.
Options: Providers
Configure and validate provider credentials. Web2Comics supports mixed-provider setups and provider-specific model selection for both text and image generation where available.
Provider Cards
- Google Gemini: text + image generation.
- Cloudflare Workers AI: text + image generation (account-scoped auth).
- OpenRouter: text + image generation (image support depends on your OpenRouter account/model availability and may route/fallback to a different backend model).
- Hugging Face Inference API: text + image generation (chat for text,
hf-inferenceimage tasks for images). - OpenAI: text + image generation (GPT + DALL-E).
Validation
- Use each provider card’s Validate button after entering credentials (buttons show a spinner while waiting).
- Web2Comics persists provider validation state in local storage and uses it to gate popup provider visibility/readiness.
- If a selected provider fails due to quota/budget/billing limits during generation, Web2Comics can automatically fall back to other configured providers (free-tier-first) for text and/or image generation.
- Google Gemini is the default first-run provider because it can cover both text and image generation with one key, but free-tier availability depends on Google account/project eligibility and region.
OpenAI Model / Speed Controls
| Control | Purpose |
|---|---|
| Text Model | Selects GPT model for storyboard generation. |
| Image Model | Selects DALL-E 2 / DALL-E 3 for panel image generation. |
| Image Quality | standard is faster; hd is slower (DALL-E 3 only). |
| Image Size | Smaller images are faster; invalid combinations are auto-normalized. |
Options: Prompts
Prompt templates let you customize storyboard and image prompts per provider scope.
Current support
- Provider scopes in the UI: OpenAI, Google Gemini, Cloudflare Workers AI, OpenRouter, and Hugging Face.
- Runtime template consumption (current build): provider templates are consumed for all configured providers during storyboard/image generation flows.
- Template types: Storyboard and Image.
- Prompt Library presets can be applied by use-case group to current provider or all providers.
- You can import prompt-library presets from a JSON array using
Import Prompt Library JSON. Imported presets are saved and available in future sessions.
Validation behavior
- Required placeholder checks prevent invalid saves.
- Unknown placeholder warnings are shown but can be used for future expansion.
Common placeholders
- Storyboard:
{{source_title}},{{source_url}},{{panel_count}},{{detail_level}},{{objective}},{{objective_label}},{{objective_guidance}},{{output_language}},{{output_language_label}},{{output_language_instruction}},{{style_prompt}},{{content}} - Image:
{{panel_caption}},{{panel_summary}},{{style_prompt}},{{panel_index}},{{panel_count}},{{output_language}},{{output_language_label}},{{output_language_instruction}}
Options: Storage
- Storage Usage: shows history count and rough storage data indicators.
- Maximum Cache Size: intended cap for local image/cache data handling.
- History Retention: target retention duration in days.
- Clear History: removes saved comics history.
- Clear Image Cache: clears cached image data (if used by current build).
- Export All Data: exports extension storage to JSON.
- Export Debug Logs: exports local debug logs to JSON.
- Google Drive Backup (Optional): connect OAuth and auto-save generated comics as interactive HTML into Drive folder
MyComics. - Connections (Optional): Google Drive, Facebook, and X use built-in OAuth flows; additional share-target connection toggles exist for Instagram and other targets such as LinkedIn, Reddit, and Email.
Options: About
Displays product/version information and informational links. This is also the safest place to confirm branding and extension identity when testing multiple local builds.
Comic Viewer Side Panel Overview
The side panel is the main viewing surface for generated comics, in-progress generation, and historical browsing.
Primary views
- Comic View: single-comic display and generation progress shell.
- My Collection: dedicated collection browser with keyboard support.
Header actions
- New Comic: opens the popup.
- Open in Tab: opens the current comic in a full browser tab view.
- Download: exports a single composite PNG image.
- Share: opens target-specific sharing flow (social/email/copy targets).
- ? help icon: opens this manual at the side panel section.
The header also shows live viewer counters for generated comics, panels, pages processed, and current local storage usage.
Side Panel: Comic View
Display modes
- Strip View: horizontal comic strip layout.
- Carousel: focused panel view with thumbnails.
- Panel View: grid/list view of all panels.
Layout Presets
Web2Comics includes a broad layout preset library (for example Single panel, 4-panel grid, Classic comic page, Manga page, Webtoon scroll, Masonry, Guided path, and Carousel). In the current implementation, presets are fully functional layout/view variants built on the side panel render engines and preset-specific styling rules.
Download Export
- Exports a single composite PNG image using the currently selected layout preset.
- Includes source title, source URL, source short-name (for example
cnn), panel thumbnails, captions, and attribution lineMade with Web2Comics.
Per-panel controls
- Regenerate panel, Regenerate caption, Make more factual, and Make simpler run scoped panel edits without rerunning the whole comic.
- Facts used blocks (when available) show entities, dates, numbers, and source snippet grounding.
- Jump to source opens the source and copies the source snippet for quick verification.
Policy handling indicators
- Rewritten panel succeeded after prompt rewrite/sanitization retry.
- Blocked panel is a provider-policy placeholder panel.
- View prompt button appears when debugging is enabled and prompt logging metadata is available.
Side Panel: Generation View
- Shows a comic-like placeholder shell during generation.
- Uses live panel statuses:
Pending,Sent,Receiving,Rendering,Completed,Error. - Matches the currently selected view mode/layout preset for continuity.
- Cancel stops the active job and shows a
Canceling...state while cancellation is being processed.
Side Panel: My Collection
- Dedicated browsing grid of previous comics.
- Cards include source short name and Original link.
- Keyboard support: tab focus,
Enter/Spaceto open, tablist navigation for primary view tabs. - Chunked loading with Show More for larger histories.
Appendix: How to Obtain Provider Keys / Tokens
Use local-only storage (Options page or local env files for tests). Do not commit secrets to git.
OpenAI (API Key)
- Open the OpenAI Platform API keys page: platform.openai.com/api-keys
- Sign in to the correct organization/project.
- Create a new secret key.
- Copy the key immediately (you may not be able to view it again).
- In Web2Comics Options → Providers → OpenAI, paste it and click Validate.
Reference: OpenAI Help: Where do I find my API key?
Google Gemini (Google AI Studio API Key)
- Open Google AI Studio: aistudio.google.com
- Open the API key page (Get API key / API keys).
- Select or create the correct Google Cloud project.
- Enable the Generative Language API for that project if prompted.
- Create an API key.
- Paste it into Web2Comics Options → Providers → Google Gemini and click Validate.
References:
Cloudflare Workers AI (Account ID + API Token)
- Sign in to Cloudflare Dashboard: dash.cloudflare.com
- Get your Account ID (Dashboard sidebar/account overview area).
- Create an API token with access to Workers AI / AI inference for that account.
- (Recommended) Use token auth instead of global key + email.
- For E2E tests, store
CLOUDFLARE_ACCOUNT_IDandCLOUDFLARE_API_TOKENin local env. - For extension runtime, configure Cloudflare credentials in the build/runtime path used by your current test setup.
Reference: Cloudflare Workers AI documentation
OpenRouter (API Key)
- Open OpenRouter keys page: openrouter.ai/keys
- Create a new API key.
- Copy the key and paste into Web2Comics Options → Providers → OpenRouter.
- Click Validate.
- Optional but recommended: test OpenRouter text and image models from the provider card using Test Text Model / Test Image Model because model availability varies by account and route.
Reference: OpenRouter Quickstart
Hugging Face Inference API (User Access Token)
- Sign in to Hugging Face: huggingface.co
- Open your access tokens page: huggingface.co/settings/tokens
- Create a token with appropriate inference permissions.
- Copy the token and paste into Web2Comics Options → Providers → Hugging Face.
- Click Validate.
- Select Hugging Face text and image models in the provider card (image generation uses Hugging Face
hf-inferenceimage endpoints).
Reference: Hugging Face Inference documentation
Testing Keys Locally (E2E)
For local E2E testing, Web2Comics supports loading secrets from a git-ignored file such as .env.e2e.local.
OPENAI_API_KEY=...
GEMINI_API_KEY=...
OPENROUTER_API_KEY=...
HUGGINGFACE_INFERENCE_API_TOKEN=...
CLOUDFLARE_ACCOUNT_ID=...
CLOUDFLARE_API_TOKEN=...