Skip to content Skip to footer

Midjourney V6 is Unbelievable – My Preliminary Ideas and Prompts

For some time, I’ve felt just like the AI picture era area is turning into too stagnant. There weren’t any important developments after DALL-E 3, and newer fashions have left me upset or feeling a bit impartial.

One factor stored me going for months: the discharge of Midjourney V6.

And when it lastly got here, it got here as a sudden Christmas current. Nobody actually anticipated V6 earlier than 2024, so this early sneak peek of its base mannequin — even with out a few of its functionalities — was a welcome shock.

It has been a few days and I am now prepared to provide my preliminary ideas on this new mannequin. On this article, I am going to define each enchancment and alter that got here with V6.

What’s New With V6?

Midjourney V6 is the AI picture generator’s newest mannequin, increasing upon the capabilities of earlier iterations and altering a few of its core functionalities. Like with V5, this mannequin isn’t the ultimate model of V6, relatively simply the bottom mannequin which will likely be progressively fine-tuned within the subsequent months.

Nevertheless, it’s nonetheless essentially the most succesful Midjourney mannequin thus far, with additions equivalent to:

  • Improved output creativity.
  • Higher immediate comprehension.
  • Strong upscalers.
  • Textual content era.

V6 can now take lengthy and complicated prompts, and create correct photos. It comes at a price although, with loyal Midjourney customers being requested to relearn the best way to immediate because the mannequin is extraordinarily delicate. I’m having the identical subject myself however, on the finish of the day, it’s a small value to pay.

How Are They Bettering The Mannequin?

Other than increasing its coaching set, Midjourney can be gathering person opinion to enhance its mannequin by way of A/B Testing. That is voluntary however customers can get free hours in the event that they take part, driving extra customers to fee photos on their web site.

Midjourney V6 Output High quality

Now that you just’ve been launched to the enhancements Midjourney made on its mannequin, it’s time to see it in motion. Listed below are some output examples from V6, grouped neatly by picture class:

Realism (Portraits)

Immediate A: a younger girl attending a music pageant, backlighting, portrait
Immediate B: a physics professor instructing his class, academia, close-up

In case you’ve been following together with my Midjourney critiques, you understand that I’ve lengthy been annoyed about its tendency to create waxy faces and overemphasize sure options. With V6, that’s develop into a factor of the previous.

Each of those photos look extremely actual. Even in the event you look intently, there aren’t any clear indicators that these are generated with an AI in any respect. These examples actually communicate to how far Midjourney has come from V5.2 in simply a few months.

Private Rating: 5 out of 5

Realism (Panorama)

Immediate A: solitary stone cabin in an enormous alpine meadow, wildflowers in bloom, snow-capped peaks within the distance
Immediate B: misty autumn forest path, fallen leaves carpeting the bottom, daylight filtering by way of the bushes at dawn

That is one other excellent rating for me. I attempted to journey Midjourney up through the use of conversational language however its improved coherence allowed it to satisfy each phrase in my immediate. As for the standard, they’re vibrant and vivid with out going overboard, present no signal of rendering points, the shadows make sense, and the depth of discipline is constant.

Private Rating: 5 out of 5

3D Renders

Immediate A: a minimap diorama of a quiet stylish library adorned with indoor vegetation
Immediate B: business images, a handcrafted ceramic bowl, earth tones, smooth lighting, vegetation

The extra that I take advantage of Midjourney V6, the extra I’m satisfied that it has no weak factors. These are each extremely correct 3D renders of the immediate topics. I significantly just like the composition of the bowl shot, with the pure mild coming from the window. Then again, the diorama is so detailed but it surely doesn’t lose that miniature feeling to it.

Private Rating: 5 out of 5

Pastiche

Immediate A: Magneto in a dvd display screen seize of Dragon Ball Z, drawn by Akira Toriyama, animated by Toei animation studio, 1985 Japanese anime
Immediate B: rows upon rows of lavender stretch to the horizon underneath a full moon, within the model of vincent van gogh

I’ve by no means actually had any subject with Midjourney imitating different artists earlier than, however they’ve positively stepped up their sport in V6. Their most noticeable enchancment is subtlety. For instance, once I generated Van Gogh photos earlier than, it did so by copying Starry Night time intently which resulted in a whole lot of spirals and stars within the sky. In V6, it took essentially the most recognizable traits of each Van Gogh portray and created an approximation of how the mannequin thinks Van Gogh would paint the immediate.

Private Rating: 5 out of 5

Structure and Inside Design

Immediate A: inside, a shed purposed as an artwork studio, bohemian, cottagecore, pure mild, whimsy, biophilic
Immediate B: exterior, a cathedral by Antoni Gaudí throughout sundown, structure

The inside design picture is just about excellent, for my part. The structure shot, however, is fairly good itself however the intricacy of the topic led to some rendering points. It’s hidden from afar however in the event you zoom in, you’ll be able to see some spires bleeding into one another.

Private Rating: 4.5 out of 5

Textual content Era

Immediate A: a bohemian espresso store named “Nook Espresso”
Immediate B: a professor writing “The Idea of Relativity” in a blackboard

Textual content era continues to be a weak spot for AI picture turbines, even with V6. Nevertheless, it’s value noting that this new mannequin could be the most effective in its section for textual content. The nook espresso textual content appears a bit of funky, but it surely’s nonetheless readable for essentially the most half. In the meantime, the textual content on the blackboard has some errors, however you’ll be able to nonetheless see what it’s attempting to write down.

In my testing, Midjourney V6 has been unbelievable with brief texts (1-3 phrases) but it surely turns into unreadable past that.

Private Rating: 4 out of 5

Excessive Context

Immediate A: a panoramic and cinematic portrait of a lone astronaut gazing out on the swirling nebulas of the Horsehead Nebula, their helmet reflecting the cosmic spectacle, as their giant spaceship explodes behind them. smooth and dramatic lighting. evoking a way of awe, marvel, and hazard.
Immediate B: a hyper-realistic portrait of an aged girl, her face etched with the traces of time and expertise, however her eyes shining with knowledge and heat. she sits in a sunlit room, surrounded by mementos of a life well-lived. the portrait captures each the great thing about age and the enduring power of the human spirit. wide-angle. impressed by rembrandt. 

Midjourney isn’t nearly as good as DALL-E 3 with GPT-4 in terms of immediate coherence, but it surely’s positively up there. It missed some traces in each prompts, just like the exploding spaceship and mementos, however many of the parts are nonetheless current, which is greater than I may say for Midjourney V5.2.

Private Rating: 4 out of 5

Common Rating

When tallied, my common rating of Midjourney V6 is 4.64 out of 5. That’s lower than half some extent away from an ideal rating, which reveals how unbelievable Midjourney is at its present stage.

In order for you extra examples of Midjourney V6’s output, I extremely counsel that you just learn our comparability articles in opposition to V5 and different AI picture turbines.

Professionals & Cons of Utilizing Midjourney V6

  • Important mannequin enchancment throughout the board.

  • Superb at immediate comprehension and textual content era.

  • Can now do sensible pictures.

  • Ethically outsources information consumption to its customers by way of A/B testing.

  • The very best AI picture generator available in the market.

  • Sluggish era pace at its present part.

  • Eliminated a few of V5.2’s options like area variations and panning. (These will return later)

  • Regardless of the brand new mannequin, Midjourney nonetheless has no separate net software.

Midjourney V6 vs. Different AI Picture Mills

DALL-E 3

Launched in October 2023, DALL-E 3 is the third model of OpenAI’s picture generator. Like V6, it was a big evolution from its earlier iteration, with a deal with each comprehension and textual content era. It’s obtainable by way of ChatGPT Plus or with Bing Create.

High quality Comparability

Portraits

Panorama

3D Product Mockups

Textual content Era

Excessive Context Prompts

What Makes DALL-E 3 Higher Than Midjourney V6?

  • Nonetheless considerably higher at nuance.
  • It may be accessed by way of a browser.
  • Much less liable to AI hallucination and rendering points.
  • Quicker era time than the present Midjourney mannequin.
  • GPT-4 processes your conversations or prompts into ones that may be higher understood by DALL-E 3.

What Makes DALL-E 3 Worse Than Midjourney V6?

  • Midjourney can now do textual content higher than DALL-E 3.
  • Midjourney is healthier at each realism and digital artwork.
  • It doesn’t have the identical customization options as Midjourney.
  • You need to use artist names as prompts for Midjourney.
  • DALL-E doesn’t provide you with management over the output’s side ratio.

Meta

Meta’s AI picture generator is a text-to-image generative mannequin which makes use of a mannequin known as Emu. It’s utterly free but it surely’s additionally morally ambiguous, extra so than different picture turbines, as this mannequin makes use of information from Fb and Instagram customers as its coaching set.

High quality Comparability

Portraits

Panorama

3D Product Mockups

Textual content Era

Excessive Context Prompts

What Makes Meta Higher Than Midjourney V6?

  • Considerably quicker era pace.
  • Meta is free.

What Makes Meta Worse Than Midjourney V6?

  • It doesn’t save previous prompts and art work.
  • It doesn’t have any customization options.
  • Meta’s creativity isn’t nearly as good as Midjourney.
  • Meta can’t do textual content and doesn’t comply with lengthy prompts effectively.
  • Meta makes use of Fb and Instagram person information as its coaching set.

Wrapping Up

It’s a bit of too early to inform, but when that is how good Midjourney V6 already is even at its base mannequin, then I don’t see any level in investing in different AI picture turbines. It’s so good that it blows different fashions out of the water. Solely DALL-E can catch as much as Midjourney now, and so they’re not even remotely shut.

That stated, it nonetheless has a pair shortcomings, significantly in comprehension and lengthy textual content era. However then once more, so does each different AI generator. 

Sooner or later, a mannequin will attain the purpose of singularity in AI picture era, and we’ll be transferring on to newer frontiers like text-to-video or image-to-video. I actually imagine that Midjourney’s going to be on the pinnacle of AI picture era — the one that may usher this artistic future.

Leave a comment

0.0/5