Hacker Read

rm999 · 2022-04-09 08:15:55

That's exactly right, he specified a style with each and cherrypicked out of 40-60 pictures: https://twitter.com/nickcammarata/status/1512119623315075081

>Btw transparency for this now-viral thread: I didn’t just paste prompts into dall-e, I played with style (eg. cyberpunk, oil, etc) to keep it interesting and diverse

>If I had to quantify, I’d say I’d generate 2 or 3 batches (tweaking prompt) before choosing my fav two pics, each batch outputs 20 images (two tabs 10 per), so prob technically cherry picked 2 out of 60. That said usually other 58 weren’t really broken, just boring / bit less fun

reply

lioeters | karma 7636 | avg karma 2.51 · | 2022-08-25 00:17:22

> you have to think of the underlying labeled text-to-image sets as paint colors to mix, and prepare a palette accordingly.

Very insightful tip on how to harness the "creativity" of Dall-E and the like.

I see how the phrase "king of belgium" was too vague for Dall-E, so it didn't produce anything recognizable - but changing the words into known details, like "banker" and "salt and pepper hair", worked effectively to generate concrete imagery.

Hilarious results. :)

reply

totetsu | karma 3420 | avg karma 1.8 · | 2023-02-08 04:07:14

interesting.. but my guess is it's using a big library of generated image and prompt pairs? So all its suggested prompts are right out of someones 'stable diffusion prompt cheatsheet.pdf' . That is to say overly outputting commonly known artists, and things like 'trending on deviant art'

hiidrew | karma 458 | avg karma 2.26 · | 2022-07-21 09:06:42

It's an interesting set-up. Viewing other's images and seeing their exact prompts is just as entertaining as generating your own.

Zetobal | karma 1154 | avg karma 1.24 · | 2023-04-26 18:50:23

none and as an experienced user you should know that's it's not one shot and most of the time not even few shot... You can't compare cherry picked press images with few shots of a 5 second prompt. I don't know why you want to hype something up if you can't really compare it. It seems extremly attention grifting.

Just look at their cherry picks in this discord... https://discord.com/invite/pxewcvSvNx . It's overfitted on images with copyright (afghan girl) and doesn't show more "compositional power" at all most of the time ignoring half of the prompt.

reply

og_kalu | karma 4856 | avg karma 2.4 · | 2023-09-05 11:59:05

aside from the examples in the thread, He was taking requests and i thought these were impressive

https://twitter.com/j_stonemountain/status/16987819625744466... (describing each panel of multiple comic book page accurately incl text)

recognizing English cursive

https://twitter.com/j_stonemountain/status/16990947849404212...

reply

mathattack | karma 18067 | avg karma 2.35 · | 2014-04-01 18:27:52+00:00

Interesting. I assume the pictures are all predone? Or is he typing away like crazy?

davidbarker | karma 29316 | avg karma 15.8 · | 2022-05-22 07:56:24

I think I had around 12 human-created images in my batch.

Insanity | karma 4354 | avg karma 2.59 · | 2019-03-03 20:53:23

Did you read the article? The author says he used Photoshop on all of them..

dylanjcastillo | karma 431 | avg karma 3.81 · | 2022-09-01 04:32:47

Note that the guy who submitted the images to the contest created hundreds of images, and spent weeks finetuning the prompts and curating the results.

Midjourney/Stable Diffusion/DALLE are doing to Photoshop what it did to traditional drawing methods. But there's still a human in the loop.

reply

Anunayj | karma 597 | avg karma 1.77 · | 2023-05-23 17:11:46

And it's extremely funny since most of these text to image models were container Shutterstock watermarked images, to the point it thinks humans see the world with a big shutterstock watermark across it with certain prompts.

Gigachad | karma 15126 | avg karma 2.88 · | 2022-05-22 07:15:58

From the videos I have seen of it in use, there is no way the prompts had existing material. One prompt was “painting of a goat in the style of the Mona Lisa taking photos with an iPad”. And it spat out 10 images which showed exactly that.

tortilla | karma 15507 | avg karma 9.93 · | 2010-04-28 04:11:20+00:00

I think I read that he actually mis-scales the images on purpose because of a higher click-thru rate.

martin_a | karma 5171 | avg karma 2.79 · | 2022-09-02 09:39:55

It seems like the model was trained on images that were "trending on artstation", see here: https://www.reddit.com/r/DiscoDiffusion/comments/u01cnw/how_...

So it might be that all images have a distinctive look, or influences of it, and this phrase is becoming kind of an inside joke/meme.

reply

aranelsurion | karma 628 | avg karma 4.58 · | 2024-06-29 11:42:36

Wanted to give it a try just for fun, using the same prompts, base model and parameters (as far as I can tell), and the first 5 images that were created... will probably haunt me in my dreams tonight.

I don't know if it was me misconfiguring it, or if the images in post were really cherry-picked.

reply

hpenedones | karma 85 | avg karma 5.0 · | 2011-10-11 07:43:38+00:00

Yes! So now you can guess why he was loading a specific different parameter file for each picture! Translation: someone tried many different parameters for each image in this demo and them manually selected the ones that make a better result. This might still be useful if there is a good UI for users to do the same.

visarga | karma 12425 | avg karma 1.65 · | 2022-06-29 04:25:17

But then a human comes and selects one from a hundred images. Not to mention the human had to write the prompt, sometimes a very long and explicit one. I'd say that's enough human involvement to be able to use the image as his own.

oyster143 | karma 99 | avg karma 2.15 · | 2023-10-14 15:14:28

Using several iconic photos as starting points, we asked ChatGPT for a detailed description of each image and then fed it to DALL·E 3 to create new images. The process was repeated two more times.

MHordecki | karma 329 | avg karma 4.22 · | 2010-03-24 10:06:51

That made my day. I'm kind of worried though that the examples are hand-picked, and by default it doesn't look so nice. I would love, though, to know the algorithm behind this.

jv22222 | karma 6217 | avg karma 5.74 · | 2019-05-28 18:32:37+00:00

From his generative collection these ones really stand out:

https://img.inconvergent.net/img/gen/20170523-193637-305712-...

https://img.inconvergent.net/img/gen/20170520-230701-136920-...

It's almost hard to believe they are computer generated. They just seem so organic.

reply