OpenAI president shares first picture generated by GPT-4o

Be part of us in returning to NYC on June fifth to collaborate with govt leaders in exploring complete strategies for auditing AI fashions relating to bias, efficiency, and moral compliance throughout numerous organizations. Discover out how one can attend right here.

OpenAI’s president Greg Brockman has posted from his X account what seems to be the primary public picture generated utilizing the corporate’s model new GPT-4o mannequin.

As you’ll see within the picture under, it’s fairly convincingly photorealistic, exhibiting an individual sporting a black T-shirt with an OpenAI brand writing chalk textual content on a blackboard that reads “Transfer between Modalities. Suppose we directly model P (text, pixels, sound) with one big autoregressive transformer. What are the pros and cons?”

A GPT-4o generated picture — a lot to discover with GPT-4o’s picture era capabilities alone. Group is working laborious to deliver these to the world. pic.twitter.com/5mO5aQxbaK

— Greg Brockman (@gdb) Might 15, 2024

The brand new GPT-4o mannequin, which debuted on Monday, improves upon the prior GPT-4 household of fashions (GPT-4, GPT-4 Imaginative and prescient, and GPT-4 Turbo) by being quicker, cheaper, and retaining extra info from inputs corresponding to audio and imaginative and prescient.

It’s in a position to take action as a result of OpenAI took a unique method from its prior GPT-4 class LLMs. Whereas these chained a number of totally different fashions collectively and transformed different media corresponding to audio and visuals to textual content and again, the brand new GPT-4o was educated on multimedia tokens from the get-go, permitting it to immediately analyze and interpret imaginative and prescient and audio with out first changing it into textual content.

VB Occasion

The AI Impression Tour: The AI Audit

Be part of us as we return to NYC on June fifth to interact with high govt leaders, delving into methods for auditing AI fashions to make sure equity, optimum efficiency, and moral compliance throughout numerous organizations. Safe your attendance for this unique invite-only occasion.

Request an invitation

Based mostly on the above picture, the brand new method is a noticeable enchancment over OpenAI’s final picture era mannequin DALL-E 3 which debuted in September 2023. I ran an identical immediate by way of DALL-E 3 in ChatGPT and right here is the end result.

As you possibly can see, the picture shared by Brockman created with GPT-4o improves considerably in high quality, photorealism, and accuracy of textual content era.

Nevertheless, GPT-4o’s native picture era capabilities should not but publicly obtainable. As Brockman alluded to in his X put up by saying “Team is working hard to bring those to the world.”

VB Each day

Keep within the know! Get the most recent information in your inbox every day

By subscribing, you conform to VentureBeat’s Phrases of Service.

Thanks for subscribing. Try extra VB newsletters right here.

An error occured.

Trending →

Gamers Championship 25: Chris Dobey takes third title of the yr forward of World Grand Prix | Darts Information

Boeing sending first astronaut crew to space after years of delay By Reuters

7-to-7 is the new 9-to-5: Research shows that workers’ days in the office are fewer but longer than pre-pandemic

Japan’s yen had a rollercoaster week amid suspected intervention

US stands to lose Canadian natural gas when LNG Canada terminal starts up By Reuters

OpenAI president shares first picture generated by GPT-4o

VB Occasion

You Might Also Like ↷

Amazon tablets are getting AI instruments, like writing help and computerized web site summaries

OpenAI’s DevDay 2024: 4 main updates that can make AI extra accessible and reasonably priced

On the spot harkens again to a pre-Google Firebase

Southwest director buys 3.6 million shares, opposes extra management adjustments By Reuters