Quiz: which of these uses a Transformer (DiT) backbone for image generation?
All four are popular image models. Only one ditched UNet for a Transformer.
540 voters·Single choice·Quiz · 1 right answer·

Cast your vote
Pick one
540 voters so far
All four are popular image models. Only one ditched UNet for a Transformer.
