this post was submitted on 07 Oct 2023
995 points (97.7% liked)
Technology
59217 readers
2764 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I still don’t believe the avocado comic is one-shot AI-generated. Composited from multiple outputs, sure. But I have not once seen generative AI produce an image that includes properly rendered text like this.
Bing image creator uses the new DALL-E model which does hands and text pretty good.
generated this first try with the prompt a cartoon avocado holding a sign that says 'help me'
People forget just how fast this tech is evolving
Absolutely SDXL with loras already can do a lot of what it was thought impossible.
Yeah Everytime iv seen anyone say "iv never seen it" makes it really obvious how little people actually know about the tech or follow it.
They basically saw it once a year ago and think it's still the same.
Image generation tech has gone crazy over the past year and a half or so. At the speed it's improving I wouldn't rule out the possibility.
Here's a paper from this year discussing text generation within images (it's very possible these methods aren't SOTA anymore -- that's how fast this field is moving): https://openaccess.thecvf.com/content/WACV2023/html/Rodriguez_OCR-VQGAN_Taming_Text-Within-Image_Generation_WACV_2023_paper.html
Yeah I'm sceptical too, what tool and prompt was used to produce this?
Its Dalle 3 its not that difficult to generate something like that using dalle 3 here's some shreks I generated as a showcase Shrek 1 inage
Shrek 2 Image
Shrek 3 Image
All of these are just generated nothing else
Huh interesting it handles text relatively well
I found the avocado comic the easiest to tell, since the missing eyebrow was so insanely out of place.
Its not that difficult to generate something like that using dalle 3 here's some shreks I generated as a showcase Shrek 1 inage
Shrek 2 Image
Shrek 3 Image
All of these are just generated nothing else
Prompt and tool links? I know there are tools that try to pick out label text in the prompt and composite it after the fact, but I don’t consider this one-shot AI generated, even if it’s a single tool from the user’s perspective.
Its Dalle 3 like I said. As far as in aware Dalle 3 doesn't do that since the text isn't always perfect still. Can't really provide prompts since its been a bit, and the history on it isn't great, but I was just mostly shrek in x style and saying "x" do mind you Dalle is very heavily censored now, so you're now unlikely to be able to recreate that.
It's on - https://bing.com/create