Muse, a new Text-To-Image Generation that can produce photos of a high quality
February 6, 2023
Meanwhile Google AI released a research paper about Muse, a new Text-To-Image Generation that can produce photos of a high quality comparable to those produced by models like the DALL-E 2 and Imagen at a rate that is far faster.
Muse uses a 900 million parameter model called a masked generative transformer to create visuals instead of pixel-space diffusion or autoregressive models. Google AI has trained a series of Muse models with varying sizes, ranging from 632 million to 3 billion parameters, finding that conditioning on a pre-trained large language model is crucial for generating photorealistic, high-quality images.