Ask Knox

Midjourney is where every honest conversation about image quality starts. v7 photorealism has cleared the bar that separates AI-generated images from photography in the minds of most viewers. The aesthetic coherence — the way light behaves, the way materials feel, the way composition flows — is the benchmark everything else is measured against in 2026.

The tradeoff is that Midjourney is stubbornly an interactive product — a full web app at midjourney.com (since 2024) plus the original Discord interface. No public API. No programmatic access. No automation. What you gain in quality you give up in scalability. Understanding what Midjourney is actually good for — and where it stops being the right tool — is the entire lesson.

The v7 Parameter System

Midjourney parameters attach to the end of your prompt text with -- prefix. The order does not matter but convention puts them after the full prompt string.

The core parameter set:

/imagine A lone samurai on a misty mountain ridge, cinematic, golden hour --ar 16:9 --v 7 --style raw --chaos 20 --q 2

Each parameter controls a distinct generation variable.

--ar (Aspect Ratio)

Aspect ratio is the first parameter to set because it shapes the composition before anything else. Common values: 16:9 (landscape, hero images), 9:16 (portrait, social vertical), 1:1 (square, thumbnails), 3:2 (editorial), 4:5 (Instagram portrait).

Midjourney composes the image natively to the specified ratio. Do not crop after the fact when you can specify correctly at generation time.

--v (Version)

Pin the version explicitly on production creative work — the evergreen rule — so your style references remain reproducible across Midjourney's default-version shifts. As of mid-2026, --v 8.1 is the current default (it became the default on April 30, 2026; v7 had held the default slot since June 17, 2025). If you omit --v, you get v8.1. The v7-to-v8 line continued the trajectory that made v7 a significant jump over v6.x — better photorealism, better text rendering, more accurate prompt adherence.

--style raw

By default, Midjourney applies its own aesthetic preferences on top of your prompt. It makes images "prettier" by its own standards — more saturated, more composed, more of a Midjourney aesthetic. --style raw disables this. The model executes your prompt more literally without the house style overlay.

For operators trying to hit a specific aesthetic that is not the Midjourney default, --style raw is almost always the right call. It trades some default polish for prompt fidelity.

--chaos

Chaos controls how varied the four initial image options are from each other. --chaos 0 produces four very similar interpretations — useful when you have a tight brief and want consistency to choose from. --chaos 100 produces four dramatically different interpretations — useful when you are exploring a concept space and want maximum variation to evaluate.

Default is 0. A value of 15–25 gives useful variety without losing coherence with the brief.

--q (Quality)

Quality controls render time and detail level. --q .25 renders fast and cheap (fewer GPU-minutes consumed from your subscription). --q 2 renders at maximum detail and takes proportionally longer.

Default is 1. For exploratory work, use --q .25 to iterate quickly. For final output, use --q 2.

Style References — The --sref Parameter

--sref is one of the most powerful tools in the Midjourney v7 parameter set. It takes an image URL and uses that image as a style anchor for the generation.

/imagine A corporate office with modern furniture --sref https://example.com/your-reference-image.jpg --sw 80

The --sw (style weight) parameter controls how strongly the reference image influences the output. The range is 0–1000, with a default of 100. Lower values give the reference less influence; higher values push the output closer to the reference style. Values between 60–100 typically produce the best balance of style fidelity and prompt adherence — above 400 results can become unpredictable.

The practical workflow: generate 10-15 variations on your core visual concept, curate the one that best captures the target aesthetic, save that URL, and use it as --sref in every subsequent generation for that project. You have now created a soft visual style guide that the model can execute against.

Vary (Region) — Targeted In-Painting

Vary (Region) is Midjourney's in-painting tool. After a generation, clicking "Vary (Region)" opens a canvas where you can select an area of the image and describe a different subject for that region.

This is non-destructive editing at the model level. The surrounding image remains locked. Only the selected region is re-generated with your new prompt. The model attempts to blend the new content into the existing lighting, shadow, and compositional context.

Common use cases:

Replace a background element without changing the subject
Swap out text or signage in an image
Modify clothing or props on a subject
Add or remove environmental elements

No Public API — Production Limitations

Midjourney's interface story has improved — the full web app at midjourney.com replaced Discord as the primary surface, and Discord remains available — but the production constraint is unchanged: there is no public API. There is no way to trigger generation from code. You cannot schedule it, pipe it from another service, or integrate it into a deployment workflow without third-party workarounds (browser automation, unofficial API wrappers) that violate terms of service.

For production pipelines — blog autopilot systems, content flywheels, scheduled social media generation — Midjourney is out of the chain. Leonardo AI and gpt-image-1 handle that use case.

Where Midjourney earns its place in a serious operator's workflow:

Creative direction phase. Before building a content pipeline, spend a session in Midjourney defining the visual aesthetic. What does the lighting look like? What is the composition language? What style anchors work for this brand? Get this right in Midjourney, then carry the direction into your API providers using descriptive language and style references.

High-stakes one-off images. Book covers, hero images for major articles, brand identity visuals. Cases where quality ceiling matters more than automation, and you have the time to iterate manually.

Style vocabulary development. Use Midjourney to test which style, lighting, and composition descriptors produce the output you want. The vocabulary you develop translates directly to your Leonardo and gpt-image-1 prompts.

Negative Prompts in Midjourney

Midjourney uses the --no parameter for negative prompts rather than a separate field.

/imagine A portrait of a CEO --no text, watermarks, blurry, low quality, extra limbs, deformed hands

Keep negative prompts focused on quality issues and elements that commonly appear uninvited. Overloading --no with too many terms can reduce prompt adherence for the positive instruction.

Lesson Drill

Open Midjourney. Take a visual concept from your actual work — an article you are planning to publish, a product you are building, a brand aesthetic you are developing. Generate at least 20 variations using different values for --chaos, --style raw vs default, and at least two --sref references. Document which parameter combinations produce output closest to your intended aesthetic. You are building the vocabulary you will carry into the API providers in subsequent lessons.

Bottom Line

Midjourney is the quality benchmark and the creative direction tool. It is not your production pipeline. Operators who understand this distinction use Midjourney where it wins — establishing visual language, validating aesthetic direction, and producing reference images — and then replicate that direction in programmatic providers. The creative work happens here. The automation work happens in Leonardo and the OpenAI stack. Know which stage you are in.

Midjourney — The Creative Professional's Tool