6Pages write-ups are some of the most comprehensive and insightful I’ve come across – they lay out a path to the future that businesses need to pay attention to.
— Head of Deloitte Pixel
At 500 Startups, we’ve found 6Pages briefs to be super helpful in staying smart on a wide range of key issues and shaping discussions with founders and partners.
— Thomas Jeng, Director of Innovation & Partnerships, 500 Startups
6Pages is a fantastic source for quickly gaining a deep understanding of a topic. I use their briefs for driving conversations with industry players.
— Associate Investment Director, Cambridge Associates
Read by
BCG
500 Startups
Used at top MBA programs including
Stanford Graduate School of Business
University of Chicago Booth School of Business
Wharton School of the University of Pennsylvania
Kellogg School of Management at Northwestern University
Reading Time Estimate
14 min read
Listen on:
Apple PodcastsSpotifyGoogle Podcasts
1. OpenAI and advanced image generation
  • On Tuesday, OpenAI turned on GPT-4o’s Image Generation, a model “capable of precise, accurate, photorealistic outputs” that industry watchers are calling “insane.” Unlike OpenAI’s DALL-E, 4o Image Generation is a native capability of the GPT-4o multimodal model, which means it can take more precise direction from users’ prompts and iterate on images with users to produce more useful results. The release – which is among a swath of advanced image-generating features recently introduced by leading AI players (e.g. Google, xAI) – signals an inflection point for image generation.
  • 4o Image Generation can produce, for instance, high-fidelity versions of images in the Studio Ghibli style popularized by movies like My Neighbor Totoro. (OpenAI CEO Sam Altman changed his X profile photo to a Studio Ghibli-style version, which is up as of this writing.) It can generate a comic strip in which Elon Musk explains quantum computing, a version of a user’s profile pic as a Muppet astronaut, an image restyled in the ‘80s era, detailed street signage for witches with accurate text rendering, scientific diagrams in an illustrated style, a mockup of a game controller that exists nowhere else, a Mughal portrait in the Rembrandt style, a new furniture design based on a color swatch, a Korean organic restaurant menu with illustrations, or a high-end wedding invitation with iconography, among many other possibilities.
  • Useful image generation,” as OpenAI calls it, has long been limited by models’ challenges in taking close direction from users while iterating on images. The non-deterministic nature of prior generations of the technology meant that achieving a desired specific outcome often required some measure of luck. While generative AI could create “surreal, breathtaking scenes,” it was much harder to create images that conveyed precise information and meaning.
  • Alternatively, 4o Image Generation can incorporate world knowledge into images so the user can prompt the AI with just a high-level description if they prefer. For instance, it can design an infographic on why San Francisco is so foggy, a graphic with recipes for cocktails, or an educational poster about whales – all without explicitly providing the model with the specific information to be displayed.
  • The images produced by 4o Image Generation can be photorealistic, in addition to being able to mimic certain styles. Examples provided by OpenAI include a paparazzi-style photo of Karl Marx with shopping bags walking through a mall parking lot, a time-stamped developed photograph of a girl drinking a smoothie at a Toronto farmer’s market, and a fruit bowl full of miniature planets. Like other ChatGPT-generated images, the images produced by 4o are owned by the user and can be used within the scope of OpenAI’s usage policies.
  • OpenAI’s new image-generation feature is available to ChatGPT users in OpenAI’s Plus, Pro, and Team tiers, and will become available to Enterprise and Edu users next week. The Free tier originally had access immediately but OpenAI had to pull back because its GPUs are melting” due to demand. Free users will “soon” be able to generate 3 images per day, the same as DALL-E 3. Other tiers will see limits as well, according to OpenAI CEO Sam Altman. Developers will also be able to generate images using GPT‑4o through OpenAI’s API within the next few weeks.
  • OpenAI isn’t the only player working on native image generation, although it reportedly has the most advanced model among those tested. Its release follows soon after Google’s announcement of its own native image-generation feature for Gemini 2.0 Flash, to positive reviews. (Google’s Gemini 2.5 Pro is the model currently topping the leaderboards.) Google’s feature, like OpenAI’s, allows for story/illustration generation, conversational image-editing, incorporation of world understanding, and better text rendering. Google’s model will reportedly even remove watermarks from images.
  • Last week, Elon Musk’s xAI introduced image generation through its API, using its “grok-2-image-1212” model (which is more limited than its Grok 3). The feature can generate up to 10 images per request, at $0.07 per image, although it’s not capable yet of editing the image quality, size, or style. (Musk’s X social platform added a photorealistic image generator called Aurora from xAI in Dec 2024.) There are also image-generation offerings from players like Runway, Adobe, Playground AI, and others.
  • Perhaps most notably, Altman has hinted that OpenAI may open up some of its models to compete with open-source players like DeepSeek. Just this week, it added support for rival Anthropic’s open-source Model Context Protocol (MCP), which is used to connect a user’s data sources to AI applications.
  • We seem to be getting closer to the point where AI does most of the work and pure creativity can be of value on its own. In such a world, the only limits – at least in the digital realm – are the limits of the imagination. On the other hand, with fewer barriers to producing creative work, more people will become creators with less effort required of them to create. This could mean diminishing returns to average creativity, and perhaps even diminishing returns to above-average creativity if IP protections become eroded. Tools like OpenAI’s image generation are likely to end up having the greatest impact on the middle 50% of the creator economy, touching platforms like Etsy as well as professions like graphic design and photography.
Related Content:
  • Feb 21 2025 (3 Shifts): xAI’s Grok 3 is highly performant
  • Feb 7 2025 (3 Shifts): Distillation and AI economics
Become an All-Access Member to read the full brief here
All-Access Members get unlimited access to the full 6Pages Repository of751 market shifts.
Become a Member
Become a Member
Already a Member?
Disclosure: Contributors have financial interests in Meta, Alphabet, and OpenAI. Google and OpenAI are vendors of 6Pages.
Have a comment about this brief or a topic you'd like to see us cover? Send us a note at tips@6pages.com.
All Briefs
See more briefs

Get unlimited access to all our briefs.
Make better and faster decisions with context on far-reaching shifts.
Become a Member
Become a Member
Get unlimited access to all our briefs.
Make better and faster decisions with context on what’s changing now.
Become a Member
Become a Member