tal

joined 1 year ago
[–] tal@lemmy.today 3 points 1 day ago (4 children)

Thanks for trying it out! Both the inpainting and outpainting -- the expansion -- worked better than I'd expected, though I dunno if that's exactly what M0oP0o's after.

[–] tal@lemmy.today 1 points 1 day ago* (last edited 1 day ago) (3 children)

Lots of odd artifacting, slow creation time and yes it had some issues with sailormoon.

It probably isn't worth the effort for most things, but one option might also be -- and I'm not saying that this will work well, but a thought -- using both. That is, if Bing Image Creator can generate images with content that you want but gets some details wrong and can't do inpainting, but Midjourney can do inpainting, it might be possible to take a Bing-generated image that's 90% of what you want and then inpaint the particular detail at issue using Midjourney. The inpainting will use the surrounding image as an input, so it should tend to try to generate similar image.

I'd guess that the problem is that an image generated with one model probably isn't going to be terribly stable in another model -- like, it probably won't converge on exactly the same thing -- but it might be that surrounding content is enough to hint it to do the right thing, if there's enough of that context.

I mean, that's basically -- for a limited case -- how AI upscaling works. It gets an image that the model didn't generate, and then it tries to generate a new image, albeit with only slight "pressure" to modify rather than retain the existing image.

It might produce total garbage, too, but might be worth an experiment.

What I'd probably try to do if I were doing this locally is to feed my starting image into the thing to generate prompt terms that my local model can use to generate a similar-looking image, and include those when doing inpainting, since those prompt terms will be adapted to trying to create a reasonably-similar image using the different model. On Automatic1111, there's an extension called Clip Interrogator that can do this ("image to text").

Searching online, it looks like Midjourney has similar functionality, the /describe command.

https://docs.midjourney.com/docs/describe

It's not magic -- I mean, end of the day, the model can only do what it's been trained on -- but I've found that to be helpful locally, since I'd bet that Bing and Midjourney expect different prompt terms for a given image.

Oh I also tried local generation (forgot the name) and wooooow is my local PC bad at pictures (clearly can’t be my lack of ability it setting it up).

Hmm. Well, that I've done. Like, was the problem that it was slow? I can believe it, but just as a sanity check, if you run on a CPU, pretty much everything is mind-bogglingly slow. Do you know if you were running it on a GPU, and if so, how much VRAM it has? And what you were using (like, Stable Diffusion 1.5, Stable Diffusion XL, Flux, etc?)

[–] tal@lemmy.today 1 points 1 day ago* (last edited 1 day ago) (11 children)

@M0oP0o@mander.xyz, as far as I can tell, you always use Bing Image Creator.

And as far as I can tell, @Thelsim@sh.itjust.works always uses Midjourney.

I don't use either. But as far as I know, neither service currently charges for generation of images. I don't know if there's some sort of different rate-limit that favors one over the other, or another reason to use Bing (perhaps Midjourney's model is intentionally not trained on Sailor Moon?), but I do believe that Midjourney can do a few things that Bing doesn't.

One of those is inpainting. Inpainting, for those who haven't used it, lets one start with an existing image, create a mask that specifies that only part of the image should be regenerated, and then regenerate that part of the image using a specified prompt (which might differ from the prompt used to generate the image as a whole). I know that Thelsim's used this feature before with Midjourney, because she once used it to update an image with some sort of poison witch image with hands over a green glowing pot, so I'm pretty sure that it's available to Midjourney general users.

I know that you recently expressed frustration with Bing's Image Creator's current functionality, wanted more.

Inpainting's time-consuming, but it can let a lot of images be rescued, rather than having to just re-reroll the whole image. Have you tried using Midjourney? Was there anything there that you found made it not acceptable?

[–] tal@lemmy.today 1 points 1 day ago* (last edited 1 day ago) (1 children)

it's just ComfyUI behaving this way. Nothing is loaded, no LLMs

Oh, okay, then I'm probably wrong on VRAM, then. On my system, it needs to actually run the nodes before the VRAM gets allocated. Sorry! I thought I had it...

[–] tal@lemmy.today 3 points 2 days ago* (last edited 2 days ago)

Just realized that I was dyslexic with the ordering directive, which worked well with Flux apparently not understanding it, since I wound up with the desired ordering; I'd intended to write "left to right".

[–] tal@lemmy.today 3 points 2 days ago* (last edited 2 days ago) (1 children)

Not yet! One thing that AI generated images right now are not so good at is maintaining a consistent portrayal of a character from image to image, which is something you want for illustrating a story.

You might be able to do something like that with a 3d modeler to pose characters, generate a wireframe, and then feed that wireframe into ControlNet. Or if you have a huge corpus of existing images of a particular character portrayed in a particular way, you could maybe create new images with them in new situations. But without that, it's hard to go from a text description to many images portrayed in a consistent way. For one image, it works, and for some things, that's fine. But you'd have a hard time doing, say, a graphic novel that way.

I suspect that doing something like that is going to require having models that are actually working with 3D internal representations of the world, rather than 2D, at a bare minimum.

[–] tal@lemmy.today 1 points 2 days ago* (last edited 2 days ago) (3 children)

it starts flipping frames between the nodes and a different set of nodes.

Yeah, I don't know what would cause that. I use it in Firefox.

Maybe try opening it in Chromium or a private window to disable addons (if you have your Firefox install set up not to run addons in private windows?)

I'm still suspicious of resource consumption, either RAM or VRAM. I don't see another reason that you'd suddenly smack into problems when running ComfyUI.

I'm currently running ComfyUI and Firefox and some relatively-light other stuff, and I'm at 23GB RAM used (by processes, not disk caching), so I wouldn't expect that you'd be running into trouble on memory unless you've got some other hefty stuff going on. I run it on a 128GB RAM, 128GB paging NVMe machine, so I've got headroom, but I don't think that you'd need more than what you're running if you're generating stuff on the order of what I am.

goes investigating

Hmm. Currently all of my video memory (24GB) is being used, but I'm assuming that that's because Wayland is caching data or something there. I'm pretty sure that I remember having a lot of free VRAM at some point, though maybe that was in X.

considers

Let me kill off ComfyUI and see how much that frees up. Operating on the assumption that nothing immediately re-grabs the memory, that'd presumably give a ballpark for VRAM consumption.

tries

Hmm. That went down to 1GB for non-ComfyUI stuff like Wayland, so ComfyUI was eating all of that.

I don't know. Maybe it caches something.

experiments further

About 17GB (this number and others not excluding the 1GB for other stuff) while running, down to 15GB after the pass is complete. That was for a 1280x720 image, and I was loading the SwinIR upscaler; while not used, it might be resident in VRAM.

goes to set up a workflow without the upscaler to generate a 512x512 image

Hmm. 21GB while running. I'd guess that ComfyUI might be doing something to try to make use of all free VRAM, like, do more parallel processing.

Lemme try with a Stable Diffusion-based model (Realmixxl) instead of the Flux-based Newreality.

tries

About 10GB. Hmm.

kagis

https://old.reddit.com/r/comfyui/comments/1adhqgy/how_to_run_comfyui_with_mid_vram/

It sounds like ComfyUI also supports the --midvram and --lowvram flags, but that it's supposed to automatically select something reasonable based on your system. I dunno, haven't played with that myself.

tries --lowvram

I peak at about 14GB for ComfyUI at 512x512, was 13GB for most of generation.

tries 1280x720

Up to 15.7GB, down to 13.9GB after generation. No upscaling, just Newreality.

Hmm. So, based on that testing, I wouldn't be incredibly surprised if you might be exhausting your VRAM if you're running Flux on a GPU with 12GB. I'm guessing that it might be running dry on cards below 16GB (keeping in mind that it looks like other stuff is consuming about 1GB for me). I don't think I have a way to simulate running the card with less VRAM than it physically has to see what happens.

Keep in mind that I have no idea what kind of memory management is going on here. It could be that pytorch purges stuff if it's running low and doesn't actually need that much, so these numbers are too conservative. Or it could be that you really do need that much.

Here's a workflow (it generates a landscape painting, something I did a while back) using a Stable Diffusion XL-based model, Realmixxl (note: model and webpage includes NSFW content), which ran with what looked like maximum VRAM usage of about 10GB on my system using the attached workflow prompt/settings. You don't have to use Realmixxl, if you have another model, should be able to just choose that other one. But maybe try running it, see if those problems go away? Because if that works without issues, that'd make me suspicious that you're running dry on VRAM.

realmixxx.json.xz.base64/Td6WFoAAATm1rRGBMDODKdlIQEWAAAAAAAAAJBbwA/gMqYGRl0APYKAFxwti8poPaQKsgzf7gNj HOV2cLGoVLRUIxJu+Mk99kmS9PZ9/aKzcWYFHurbdYORPrA+/NX4nRVi/aTnEFuG5YjSEp0pkaGI CQDQpU2cQJVvbLOVQkE/8eb+nYPjBdD/2F6iieDiqxnd414rxO4yDpswrkVGaHmXOJheZAle8f6d 3MWIGkQGaLsQHSly8COMYvcV4OF1aqOwr9aNIBr8MjflhnuwrpPIP0jdlp+CJEoFM9a21B9XUedf VMUQNT9ILtmejaAHkkHu3IAhRShlONNqrea4yYBfdSLRYELtrB8Gp9MXN63qLW582DjC9zsG5s65 tRHRfW+q7lbZxkOVt4B21lYlrctxReIqyenZ9xKs9RA9BXCV6imysPX4W2J3g1XwpdMBWxRan3Pq yX5bD9e4wehtqz0XzM38BLL3+oDneO83P7mHO6Cf6LcLWNzZlLcmpvaznZR1weft1wsCN1nbuAt5 PJVweTW+s1NEhJe+awSofX+fFMG8IfNGt5tGWU5W3PuthZlBsYD4l5hkRilB8Jf7lTL60kMKv9uh pXv5Xuoo9PPj2Ot2YTHJHpsf0jjT/N5Z65v9CDjsdh+gJ5ZzH8vFYtWlD1K6/rIH3kjPas23ERFU xoCcYk7R6uxzjZMfdxSy1Mgo6/VqC0ZX+uSzfLok1LLYA7RcppeyY4c/fEpcbOLfYCEr9V+bwI4F VDwzBENC412i8JTF8KzzqA0fPF8Q4MwAeBFuJjVq1glsFgYqTpihnw5jVc5UfALRSXS2vjQR78v5 XDmiK7EvUIinqDJjmCzV+dpnTbjBAURsZNgCU+IJEQeggVybB+DkjOGnr/iIjvaSylO3vu9kq3Kn Dhzd/kzXutPecPtterHkiPjJI+5a9nxJPMLMuAqmnsh2sk7LX6OWHssHhxd/b2O2Y4/Ej0WoIZlf GD2gOt57hHvTrQ/HaG1AA8wjbHsZXWW9MXbJtDxZbECLIYGfPW2tQCfBaqYlxGXurrhOfZlKPUVx K9etDItoDHdwUxeM4HbCdptOjcSWAYgjwcQ4gI3Ook/5YLRbL+4rIgOIwz643v/bMh2jiLa4MEYm 9a4O8GL4xED3kvRQSgt0SkkIRIHO1AJ042TQC450OKwEtfFgBpmFQ+utgUOObjA409jIEhMoOIeP kDqv62f0Uu4qojiX7+o8rrWp/knAkDoFWam1E3ZKEiCINRfmRAMqTsPr8Wq/TQZM5OKtMGBLK9LY GxLOOUBCahU5iNPm01X3STNRmQKtATPgqPcszNeLONnZqcWusbZKvwkCoX4Z75T+s+852oo65Li6 7WQ3SaDBrs47qXeUobVIOjlXO2ln2oRRtTRfbk7/gD6K6s5kBjPexHEEIGseJysilyHyT2+VMtSy cyET83Exi5KEnrtl7XgMI4GM1tDeaq6anNdW1VgXdS4ypk8xqHTpQgszuJCgh3ID5pfvbHzzX0A7 zC5A+4oGk98ihe0dJc+KLzigJuZLk7jX5A7sGkBtht7oKjH8qrYM//DbQXkZbI06h/FP+2aBz5/t U3zTsSHyoU3LwihFOj0TA+DKnZUnm4TJtX6ABQaJPTYwHgQJ/B77VI9A+RYf7qd9o4cGaGrLoOES QdGPFUOqO0vL9EkpTsxgCEjxApBIw1gTCiaBF8Dofp6vBrd1zY1mXP9p1UunaaFZtdmx/vrWkLXQ iO09P6waY+6daKtZ7i+3j0WGvBFHx32toDgd94wGWXBa+hfEEK3d6kq8eGRWJC+OEL9KgUrrN4ki vwPjGe/1DXxzPIvZrMP2BtoxO34E9VuvsnmofW3kZvtLZXC+97RznZ5nIpG4Vk+uOPs1ne/s1UD3 x0vyTviwiK+lFIu5T3NdxFFssClZSDyFUkzCZUpbsLjcH3dzbwEDX4Vjq6rAz2IbXUGU6aTQ7RO1 Q1iUHaCqHZzNJEKKFcBaL/IGbmeOPUZJ7G3TbEYcMhBPtsmYSwNJkQ0cGj/KKqPF6fxpvNEt+QNh isgyNP+AuP0xxQgwXbxW2kO/3Y70L5+eNs2L8u0gJBHevYTAebv/mORBcNcs8hpFVZLOAahajv0h zj++ssD9BcgBTVMEC+knn0HjVaRjIW3UPsDApNjIsF7h06hWAGG79VGJb3mQ6PcwQAAAALZxoY8E al4jAAHqDKdlAABPKdovscRn+wIAAAAABFla

EDIT: Keep in mind that I'm not an expert on resource consumption on this, haven't read material about what requirements are, and there may be good material out there covering it. This is my ad-hoc, five-minutes-or-so-of testing; my own solution was mostly to just throw hardware at the problem, so I haven't spent a lot of time optimizing workflows for VRAM consumption.

EDIT2: Some of the systems (Automatic1111 I know, dunno about ComfyUI) are also capable, IIRC, of running at reduced precision, which can reduce VRAM usage on some GPUs (though it will affect the output slightly, won't perfectly reproduce a workflow), so I'm not saying that the numbers I give are hard lower limits; might be possible to configure a system to operate with less VRAM in some other ways. Like I said, I haven't spent a lot of time trying to drive down ComfyUI VRAM usage.

[–] tal@lemmy.today 2 points 2 days ago (4 children)

Full Size

UI: ComfyUI

Model: STOIQNewrealityFLUXSD_F1DAlpha

A cute, adorable, loveable, happy cave spider.

The image is an illustration.

cute-cave-spider-workflow.json.xz.base64/Td6WFoAAATm1rRGBMC0DMpYIQEWAAAAAAAAALNZm2LgLEkGLF0APYKAFxwti8poPaQKsgzf7gNj HOV2cM7dN97GoIvoTF/iSMYEQoJZBJzI3UDxBXrl3MvcMb5e6h3lATBYjawf1klb5xXG0y783VSu rsL7JTqCcpc7ml6EBcQysA/lczXi4/z4NEU2MzJ6gMxLs5qhHCWRzV2fkgD1H3KHbl2of4CNliwR JNef6hoZP4HFMLybTP66HNwa0oszmGtepDwWHXSrlYVAso8imkQU11LM4qXGRXQKKo6EvjyTeYPG eZvQb4FxhvQIkQHLQWfLOyENpBMbl2mOgW0siOFXT72aS9mBHSeYP8AzU1giF+yk4IQh8W8UCh+k wncXTpG2M2mdiF0t0fMlAoOmMD0dDDSlpTIhgkEUgqjuFzi7YRnIgI9xJ4RbMQqrVgcEj4C8LWsP q8Pt9DnxVUmUQXwS04QFJpghbh6a73UsV1kjFiQZ9yo4RhR5ouTehVLXwyXn9E0/MW3nIZGdW6cW BqD/FwjDedfDigE0VsQAWD9OfH1I4pjkNePpmoHErOG2KQ9vec7Rd5iPWUIgxnBiNdg8Hczv3Ggy e02Chw1yzKXcTaIVW6tMxm5H62cyRRwy5a0Rn3KVLVQTyaZt2f+GplozjiLe3T1X8E7sKPoVWNsX cVGRPGouZdNO00VQlpmaKnpf9BXG+ujVGn0WCzsTBZZLqJNfGE5CwOMGqqib2L3pNFesoaku2U4n andtH2bHkiNNf1DpDmkAuNuGvmKRHfBXHVrU6+jcUbAjBZxe4kYsPP2+f5vJqNIWRPankSGF3+GF xjD4ntouwO3ruBHQlRMDf0Lcd6qy4ICW3OakgceBbk2vT42s9thrPuF779tKQ63RSN+nL/R9GyOb Tr7qEL71NSRqsK/hDhb2+lrc6p6LxPOPbHZJLDcLWunTtG8EayOjlE7K4iTB0AKloQg7xShjlDP8 ZQSTf3PwGLUBdJuv5aaqE+2payzFqCXxbp9849RL9f7mnarBlDj9E30qLRZZd9INv06Jk1OddXYY VGS2pDNDNMoFDmgywFxqd1ZWT4JZpPxUW1xUc/8P2FWs0W/o/EiCL99xDIa+LZn2UA8x5pDhZCbt +i3JcwbL8+XTDQHIdIp9RdN1eouuKtuhaklJlHcREEql6EQqldpf9htC6YtIZDdiJJViVSguOt7r iAcrVELFRGbvyrXkWzY7Jr7dt1nOGQrj9GDY3X2habyMXBD/3IIP3uwg05ZDXUVgYqV+fEYqAtE3 IuTiSDFjoLVr7IlByn99cUvfzMLsHud1K7EVi1E0DhNsf68mMyV177Fm3P4y2E4qc7XwrFAJ6GyP N5AVGcYSoozQ34UofUrpmmkU1ZDIQlLBZHICC7wGHkH+mINdXHB9/nOBpcJ/WwfZ5ruyjAh94GG7 mpvrskmGuWP3r2LgV8dPqf03Tnp7Jbe/NNUZKQ4cwh92/BP79cPY/T6iwPEZZS9ofebllBwwuM77 1kQ63ShrnI9yydhDKORd9qFNJSWd2/X/7VZv6gdczosU8YVaxq14UnVawJnkB4S4cN6Fvlwg9HBg IXDP9ITIH7lxG4iUY3gsEzUi8lgs6HYz5FMIYc82jfFnsnK3olAYB9cUPdE/K2nZKAHGRNoBmyvA pQC5dwZhurNZ5G32YRD8MVn4dRuGIllRRhukNVp1TUTebKqbgnsJZvZO1s2GNgtHlt/Dnn9l+jHy d8Ar4Y2gcxE0UobKgrY+1ONGwCWUxD4RZdGqZHdND88qsWd716JfxAwFcZ5r5WtzSkj6O4T+RC6E sTpxtctOPbh0pV+HiBIoaqkmRXmAzbwZeA9QXvJu6uPNXSUAXn6c9HfLlNabpdii7Gr31mNyNOtw BSxHDpWGyEvlXWpr3udsC4eypHZtRGl/iJsAAD0CzTvAZNWvDfbmkrAY8Ck7TEdvJ35uujJzDZVV X15wJ74T7ADWsjD/ZCTSjoq3ylt1nO1woSAhwpxdQS8S6rm8owXfQkWc4334/RIAGB6ZcrdNjRjg ORZw05NY5zdg9hE2fdy50SNVgvN2eIwTjp4bsfNt04z5te/h+InrVpvzFFjBL5SWLlOckqy7yoeB AHL0Slu0XJQhQb0aMfMGN8G46K1P5gAAvY1DGN46jOAAAdAMylgAAJlUxG+xxGf7AgAAAAAEWVo=

[–] tal@lemmy.today 1 points 2 days ago* (last edited 2 days ago) (5 children)

When a model is initially being loaded, I see slowdown, but once that has happened, I don't. I see that with Automatic1111 as well. Once it's been loaded, though, I don't get that. I regularly do (non-GPU-using) stuff on another workspace when rendering, can't detect any slowdown.

So I don't know what might be the cause. Maybe memory exhaustion? A system that's paging like mad might do that, I guess.

As to an alternative, it depends on what you want to do.

If you've never done local GPU-based image generation, then Automatic1111 is probably the most-widely-used UI (albeit the oldest).

If you want to run Flux and Flux-derived models -- which I'm using to generate my above image -- I believe I recall reading that while Automatic1111 cannot run them -- and maybe that's changed, have not been monitoring the situation -- the Forge UI can do so as well. But I've never used it, so I can't provide any real guidance as to setup.

kagis

Yeah, looks like Automatic1111 can't do Flux:

https://github.com/AUTOMATIC1111/stable-diffusion-webui/issues/16311

And it looks like Forge can indeed run Flux:

https://sandner.art/flux1-in-forge-ui-setup-guide-with-sdsdxl-tips/

If you're short of VRAM or RAM or something, though, I don't know if Forge will do better than ComfyUI. I think that I might at least try to diagnose what is causing the issue first, as there are some things that can be done to reduce resource usage, like generating images at lower resolution and relying more-heavily on tile-based upscaling. With at least some of the systems, haven't played around with ComfyUI here, there are also some command-line options to reduce VRAM usage in exchange for longer compute time, like --medvram or --lowvram in Automatic1111.

I don't think that there's a platform-agnostic way to see VRAM usage. I use a Radeon card on Linux, and there, the radeontop command will show VRAM usage. But I don't know what tools one would use in, say, Windows to look up the same numbers.

On Linux, top will show regular memory usage, can hit "M" to sort by memory usage. I'm pretty out of date in Windows or MacOS -- probably Task Manager or mmc on Windows and maybe top on MacOS as well? You may know better then me if you're accustomed to that platform.

I can maybe try to give some better suggestions if you can list any of the OS being used, what GPU you're running it on, and if you can, how much VRAM and RAM is on the system and if you can determine how much is being used.

[–] tal@lemmy.today 1 points 3 days ago* (last edited 2 days ago)

despite editing the .sh file to point to the older tarballed Python version as advised on Github, it still tells me it uses the most up to date one that's installed system wide and thus can't install pytorch.

Can you paste your commands and output?

If you want, maybe on !imageai@sh.itjust.works, since I think that people seeing how to get Automatic1111 set up might help others.

I've set it up myself, and I don't mind taking a stab at getting it working, especially if it might help get others over the hump to a local Automatic1111 installation.

[–] tal@lemmy.today 8 points 3 days ago* (last edited 2 days ago) (1 children)

venv nonsense

I mean, the fact that it isn't more end-user invisible to me is annoying, and I wish that it could also include a version of Python, but I think that venv is pretty reasonable. It handles non-systemwide library versioning in what I'd call a reasonably straightforward way. Once you know how to do it, works the same way for each Python program.

Honestly, if there were just a frontend on venv that set up any missing environment and activated the venv, I'd be fine with it.

And I don't do much Python development, so this isn't from a "Python awesome" standpoint.

[–] tal@lemmy.today 2 points 3 days ago* (last edited 3 days ago)

I called Hillary "Hillary", but that's to distinguish her from Bill Clinton, who I called "Clinton".

Honestly, you have to be a very commonly-used name before I'm going to use a single name for general purposes at all rather than a full name, so the set of people who have the chance to get into the "one name club" is very small.

49
Cats (lemmy.today)
 

UI: ComfyUI

Model: STOIQNewrealityFLUXSD_F1DAlpha

A huge, dangerous black jaguar lying next to a relaxed, white long-haired Angora cat. In the background is a blazing fireplace. The floor is black cherry.

cats-workflow.json.xz.base64/Td6WFoAAATm1rRGBMCCCpk6IQEWAAAAAAAAAGkuN9LgHRgE+l0APYKAFxwti8poPaQKsgzf7gNj HOV2cLF9DeDmoIvoTF/cVPw0agdxNfru2YvPSKLvRDPlN2F1d8yba+YwJMN/u3et9crpZQe0xf0X e4TlGW5cYDH+Fmnw78HMia8eHxjr11hvN1CjKJECZj6cxWQr5ky0YKVzeOErkrOUDMifScNF8ar7 0lZixjiV/oEgLa3gDczZdHFJzhbm4xDBx5gI5tJIx8m8EkEDHNaoSmTjv4TSsua4bQicgw6/mTPy NkufHUtCNV1ywEVaVQHw5oeUatKneTLYzgAlPF3bDx4oG8TMyis8lSQ2elf9IezyP88Yfsmk5bw+ 46cSJz2rH2m7hR8gsPEoIB1foZnaUp25ES9dAc60QtWD3o89wmLVDW1sdHDu+FwFfL5mSg15TjPN EowKss1+cvVmWJQGxEl9Mcy2SZ5/mkns2GnEQaa9fV2bGcs9H4YOoA2YusbSoGDFnFsWqbbjTTXe 8b+iXBG93rklvtPwq0bb3ARsYFvU/ir0eLaPu+EHrENW8BwYwf9VZU/DpvT1dbkaT1Xj0HYo7Yf6 go4T5+8gMD5BLYyXHL/ZMXJc1a2UdXca/xjZsexOWPO93SHnGkfdrWIF3lrISg31htvST2b/Rwpf BxlZWUoDsirV3KcPqRQNAZqfIEaiqr7tdbklorUVD809JClc7BF4GzePHeus7gir5WvvDFOVuPpr fJ1pn4U+FFtlml5gm2WpNycSWitGL48CgEx35jF3MFhYSwC3OE6/qMiSCXzQ6M851M8dU6ANsK9g 32uD6AktHnFfJxazyKoRqL8jeE9c7VBF4tI4j2MChFApXR/b7Xr6OBwOrbBa4K58WlzzC8XiUThd d56jx+23gzKwk6FuVXkoawOH0w1L0aLZXoidTJl8SVQz79gPwebLWWtPxm3Ut+fMTfI30l+qiSId emzkF4mZYQVVURSw4EWDhN1NW+Eucp+RezvOA3Jc+5LjrZp32ltjAOZ/IaRAKQa9C+uVxeFCkG/a rFi2A1SaWF63bVpkbYRmruDwBo3EZzMST9X5UInTwt1RX7Uxv2GEZYPWBJHhzV5FIDfWgqU1bfob QriecynkcRIEq4UyqxxRBEsRpmm5MfZ9U292RCGZ8SJBouSsUqGMo79IC6JLd+y51nYtXLQ7MhJs L+ADGtR+gzkILaOACaIIFXN7dsOEFHvi88ArliJDQZs095OQFgnlvWoPlwxpGCZZoRebN5BU8V7t Kb1auQ7/qrsH4Emf9hp4RYKgYMkdIu2j5NoGGx6P6FxtG2rj4/jGo+ys6vhfOzYN9NyZ1x2pOHZy c3E5S41jhP18eZL+lfIcG4ecuGAlLhJUYSDBNDN3OFjXEmFPtHcz+EerdlMHvMiY/2FC42kBJbct iBloMDRZKX8vC8mX03lNOjg6eoJ8SWJL/+5cIoPCBV1QweYP1Hcdt9M+zJ4UBtN694nvWzPbfwIV evxhrlwbEIncyrLXymwieq0Od25+mjdauruOd3Jia6OJrLwbkmUf/avzFGkAmHe9wQ/zvFPO7pyU Nz+fcJ1hGLZ61V3yVahGuvMnfvNU8/Oy6GCq1h1VF+SpZi/veyaj8GbsJdmTTWN8WlDncjbr0CoH TJ3UAzut+LaPd6BOdX7UQaUiir6veM/jPkeF7Za1KNPcOyd8cGaAxao7e0jRytShVqxlsOyu4T44 aEAAAAAu+a5LxBbsiAABngqZOgAAJ3U9R7HEZ/sCAAAAAARZWg==

 

UI: ComfyUI

Model: STOIQNewrealityFLUXSD_F1DAlpha

A delicious slice of pumpkin pie. The pie is topped with whipped cream that is sprinkled with cinnamon and nutmeg. There are delicious chocolate chip cookies and a delicious scoop of vanilla ice cream placed next to the slice of pie. The ice cream is topped with crushed candy cane sprinkles.

dessert-workflow.json.xz.base64/Td6WFoAAATm1rRGBMCtCqU7IQEWAAAAAAAAAAJGapngHaQFJV0APYKAFxwti8poPaQKsgzf7gNj HOV2cLF9DeDmoIvoTF/cVPw0agdxNfru2YvPSKLvRDPlN2F1d8yba+YwJMN/u3et9crpZQe0xf0X e4TlGW5cYDH+Fmnw78HMia8eHxjr11hvN1CjKJECZj6cxWQr5ky0YKVzeOErkrOUDMifScNF8ar7 0lZixjiV/oEgLa3gDczZdHFJzhbm4xDBx5gI5tJIx8m8EkEDHNaoSmTjv4TSsua4bQicgw6/mTPy NkufHUtCNV1ywEVaVQHw5oeUatKneTLYzgAlPF3bDx4oG8TMyis8lSQ2elf9IezyP88Yfsmk5bw+ 46cSJz2rH2m7hR8gsPEoIB1foZnaUp25ES9dAc60QtWD3o89wmLVDW1sdHDu+FwFfL5mSg15TjPN EowKss1+cvVmWJQGxEl9Mcy2SZ5/mkns2GnEQaa9fV2bGcs9H4YOoA2YusbSoGDFnFsWqbbjTTXe 8b+iXBG93rklvtPwq0bb3ARsYFvU/ir0eLaPu+EHrENW8BwYwf9VZU/DpvT1dbkaT1Xj0HYo7Yf6 go4T5+8gMD5BLYyXHL/ZMXJc1a2UdXca/xjZsexOWPO93SHnGkfdrWIF3lrISg31htvST2b/Rwpf BxlZWUoDsirV3KcPqRQNAZqfIEaiqr7tdbklorUVD809JClc7BF4GzePHeus7gir5WvvDFOVuPpr fJ1pn4U+FFtlml5gm2WpNycSWitGL48CgEx35jF3MFhYSwC3OE6/qMiSCXzQ6M851M8dU6ANsK9g 32uD6AktHnFfJxazyKoRqL8jeE9c7VBF4tI4j2MChFApXR/b7Xr6OBwOrbBa4K58WlzzC8XiUThd d56jx+23gzKwk6FuVXkoawOH0w1L0aLZXoidTJl8SVQz79gPwebLWWtPxm3Ut+fMTfI30l+qiSId emzkF4mZYQVVURSw4EWDhN1NW+Eucp+RezvOA3Jc+5LjrZp32ltjAOZ/IaRAKQa9C+uVxeFCkG/a rFi2A1SaWF63FwEhRCRG3jdZ3F1j1IQDYcHDwCxGADQuHBCMqUWxBbIMXsvnjd33bq3SuK91bxrk 4NenMfw0rwfNiZnxEam16zjwCdqumnJfnm2hQJyZD6k6tdyjGJTpvYEyYx36prbttuQDAcfLc/6Y OdahSHMtwl0dmFOZMXAaR6mUaPLRM/BcZ41s0Ay51OfkcSLP+u0mlKZTfblIesQSvpXI1PpZNudr b7JOrOJqb3WA6ruV0jd9/o0QtlhGBtsOjauvDu8mfdpqEx1+7/QM4bQeFoCiaP6sM0rwahuF6xE/ 9KNO9QNS47LTI/VBfpA5aZQDK8ngcQjq+3pp8yKIMLLjctn9i2XnmkJUKKknOK1D+7e9/v+7y9oU rw8khxkeuFCXcwwapg5CjlshYvXwYv8fB96n0W8CQNsK6qzXVfyMiT61Gm0lO2xwnofRo3sYEqrw TJc4Mzp9rEM5oXlnVDwhOBgU5smdAU//1dknB9dO6/sggHVwhtCDdWkvxRzSkr/ItyAoEEQZQGUH APwtXvOC5CyUcLToXVCy0JvarJY1Zkub8p2ZaNXCrCrGJQwFwGGeY0j5HJ2nzlbQ/DFeKnzCfgb9 H1oF4HdNxi/9pPyRGjISMQA07+E7S7Kb+hlfFmb7jTVf/lFjLvnI8J/e3mHkytRLl6YnHMIbTsBv 0z/ykcoAv0mw44LBlfH0W9wB79yaJ6/RJYAOR+escJzTytKN5NRa1AwGA4bAAAAAACTNqdh29uQ7 AAHJCqU7AABGYIntscRn+wIAAAAABFla

11
Ozymandias (lemmy.today)
submitted 1 week ago* (last edited 1 week ago) by tal@lemmy.today to c/imageai@sh.itjust.works
 

Inspired by this post trying to illustrate a poem, I wanted to see if Flux could understand poetic language in Ozymandias. I plugged the whole poem in. I think it did an okay job!

Ozymandias, by Percy Bysshe Shelley:

I met a traveller from an antique land,
Who said—“Two vast and trunkless legs of stone
Stand in the desert. . . . Near them, on the sand,
Half sunk a shattered visage lies, whose frown,
And wrinkled lip, and sneer of cold command,
Tell that its sculptor well those passions read
Which yet survive, stamped on these lifeless things,
The hand that mocked them, and the heart that fed;
And on the pedestal, these words appear:
My name is Ozymandias, King of Kings;
Look on my Works, ye Mighty, and despair!
Nothing beside remains. Round the decay
Of that colossal Wreck, boundless and bare
The lone and level sands stretch far away.”

workflow.json.xz.base64/Td6WFoAAATm1rRGBMCTDII+IQEWAAAAAAAAAN4WPETgHwEGC10APYKAFxwti8poPaQKsgzf7gNj HOV2cLF9DeDmoIvoTF/cVPw0agdxNfru2YvPSKLvRDPlN2F1d8yba+YwJMN/u3et9crpZQe0xf0X e4TlGW5cYDH+Fmnw78HMia8eHxjr11hvN1CjKJECZj6cxWQr5ky0YKVzeOErkrOUDMifScNF8ar7 0lZixjiV/oEgLa3gDczZdHFJzhbm4xDBx5gI5tJIx8m8EkEDHNaoSmTjv4TSsua4bQicgw6/mTPy NkufHUtCNV1ywEVaVQHw5oeUatKneTLYzgAlPF3bDx4oG8TMyis8lSQ2elf9IezyP88Yfsmk5bw+ 46cSJz2rH2m7hR8gsPEoIB1foZnaUp25ES9dAc60QtWD3o89wmLVDW1sdHDu+FwFfL5mSg15TjPN EowKss1+cvVmWJQGxEl9Mcy2SZ5/mkns2GnEQaa9fV2bGcs9H4YOoA2YusbSoGDFnFsWqbbjTTXe 8b+iXBG93rklvtPwq0bb3ARsYFvU/ir0eLaPu+EHrENW8BwYwf9VZU/DpvT1dbkaT1Xj0HYo7Yf6 go4T5+8gMD5BLYyXHL/ZMXJc1a2UdXca/xjZsexOWPO93SHnGkfdrWIF3lrISg31htvST2b/Rwpf BxlZWUoDsirV3KcPqRQNAZqfIEaiqr7tdbklorUVD809JClc7BF4GzePHeus7gir5WvvDFOVuPp7 7sUhDfu2AfSYgdXMhSC1cvLCBFqnsfozVlAhsje/Zb5RF/UcismjrjGlvTvg8bgD5YjQqjtib0IQ vZ4xn87olLgXfTtiKyvE8SW8tPzNd3uF2OBdp81bTKDVWmOYf5D3ZXfTK4N43NN3STYKzJnlmd5R cO7rh22x5QS1J/J81kKlLIsiwDuwTIuEzq/wxgTwCdYYz8P2QhSjdmuFwcRkQdlOtMoiBO84Im9y kXEkNEm/L//Nd22YSo3fJRVYDcjSVIIfZN4JtDgJyN1QdYW9+ks38vLy8MJG0JPRaWw+yRmBmmjm 0s/PerspQ9Awz+kxo0JCgNd3TgOAjS+hjBqmgP632drz15SgCuVxS3VnAx85xu3S4fE4YbuLQa8m SvSeBnsInkhzhcSPWMcVD+Rv9lNA2NUSbmUv9ilTnAhVcr5vcS+F2n23Zr8+33v6FY4lhgRTeXCQ L62pjy61Ter3gHvV3z5Fz3SPz4aCz4U3aWivl777kANSUlZ2UYfz5R6dgwUb7Wf4LV3t6Eo8RmAO Kf23NmfkLD2Tf7k6yjl/44ZG5ITQsCno3AeBF19deyZyWWvtNRI1j0WQJD26YLH5UAR+f9pt/ZaC s0BUkYsB1DI8DUVfLWS9ITJiLZtE/xDGqdr7hGy4RkZ5J0zyrzsCal+WQICnDZ96gaoRYjaEbT5P IiANMwmuuBuPuxcMn2BZImCEgZsIk899i+Cb0FVOxGzBfhetGf8PDb/lqs7SteT7Dez7C1L2jlYL oGa5iUVaHNBNZx+Vq8I9jceDHPj5eIr4rpsaK2CS8aWzRHzl8I7YYVmuRAAEOVgupEYmJXDJfKUM fFbldJLR5jjbqmi6q8se3xiW8GYCwxotuZG9h4E+eyrX9l4pM88KwDo7UEIkkSLqWCmVTEI6hNJl o320oKWdQYMm7nf1x+Qgmyhp3f/otgzRIUTZk6/9kH/lviF+Dw2Ps7i6CGXIxH9hO47njqObwhR5 LB4ZiMIVTAyHMFKG0nHp2hj8ceaauSvFlUfrM7HibRyV/rIQD4z2gmzwDErbg7upikvZNVBhz6Tv 7kTd2f+I6cmlTkfMbiTjnpjZFegLmWWPWUk68FowLtLTf5gZZRIHJeps3neiAD6V74qV6M5X9d1X QXcAMmdRDJEsg9QuPMAI3fECbCroY4cepg9s2BvrKdiU9Va8X/VKRmkCaTroLLKvaF3qrzwWRwPC +NeXyKPA+aGtOzueEgl2nj92spldUsLpzbLWzAeXAe/YxcjoqWtOOZbSkxSSvHfNAYGh6EnL/582 RYgMGf1P62rc+GZCOlv7uMpR6Wen1ZNUB5JpkCys10Fy1NcoLlzgkZUVuhFTCwAAAOgmLuRZv2SB AAGvDII+AADNLEqHscRn+wIAAAAABFla

 

Thelsim did a couple of Tarot-style cards a while back. I also just finally got Flux set up in ComfyUI -- had started a long time back, and dropped it.

Flux is a ComfyUI model that's pretty popular over on Reddit, both for the quality and because it uses English-style prompts rather than just a list of comma-separated prompt terms. I remembered Thelsim's project, wanted to see if I could turn out a full set of photographic-style Major Arcana in the first day using it. Turns out...yes! Usually when running Stable Diffusion, I'll generate maybe 20 images and pick the best, but this typically had something reasonable on the first try. It's certainly not flawless -- there are quirks in the image, but for anyone else thinking about playing with Flux, I wanted to put this out there, because I was unexpectedly happy with it, especially given that I've no experience at all with it. I would totally try and get it set up if you have a local generation setup!

Text was added with a script and ImageMagick, not in ComfyUI.

To get some kind of consistent appearance, I appended to each prompt "The theme is magical fantasy horror. The colors are blue, white, red, orange, and black. The photograph was taken with a Nikon D850." I also used "Photograph...at night" on each.

https://lemmy.today/pictrs/image/67cd880f-ed36-4074-b12b-3e509b0dafa2.png

Photograph of the Grim Reaper at night in a dark, gloomy field. The Grim Reaper is riding a white horse. The Grim Reaper is holding a simple black scythe. The Grim Reaper's hood only contains blackness. The sky is full of stars. The Grim Reaper is
wearing black gloves. The Grim Reaper is facing the camera.

https://lemmy.today/pictrs/image/2f0c296e-da19-4f31-8f35-cac07ce372dc.png

An photograph of a huge angel in the clouds playing a medieval trumpet at night. The angel is blowing into the trumpet. The angel is in profile. The zombies are climbing out of their graves in a graveyard. The dead are rising. There are snowy
mountains in the background.

https://lemmy.today/pictrs/image/b99ac2e5-a285-45f9-abc8-4267ae10398a.png

Photograph of a stern-looking young woman wearing a white blindfold and a toga sitting on a throne at night. The woman's right hand is holding a set of scales aloft. There is a longsword lying by the woman's feet. The woman is facing the scales.

Should really have a sword in one hand, scales in the other, but I wasn't able to quickly get that working; probably need more experience with Flux.

https://lemmy.today/pictrs/image/3ed8cbe7-e97b-4f67-849e-19bd798caaae.png

Photograph of an angel at night. The angel is pouring glowing liquid from one large goblet in their left hand into a goblet in their right hand. The angel has a halo.

https://lemmy.today/pictrs/image/18a9c09b-dcc1-46da-87d8-a0c09c851170.png

Photograph of a man wearing armor riding a Roman war chariot at night. The chariot is pulled by two galloping horses wearing barding. The horse on the left is white, and the horse on the right is black. The chariot is charging the camera. The
photograph is an action shot. The man is holding reins.

https://lemmy.today/pictrs/image/4558e24b-0b45-433f-8548-6bb1873bb2d1.png

Photograph of the Devil at night. The Devil is crouching on a pedestal. There are two nude demons sitting at the base of the pedestal. The demon in the lower-right quadrant of the photograph is male. The demon in the lower-left quadrant of the
photograph is female. The Devil is holding a flaming torch in his hand.

It did look like Flux understands directives relative to the portion of the image here ("quadrant"). I wasn't able to get the same technique going with Justice, though.

https://lemmy.today/pictrs/image/80bd8ca8-047a-4e0c-9d61-28b331646bf1.png

Photograph of an emperor at night. The emperor is holding a scepter.

https://lemmy.today/pictrs/image/e7a1f5f6-c2a3-41e2-9b47-fd9c56c8fd31.png

Photograph of an empress at night. The empress is holding a scepter.

https://lemmy.today/pictrs/image/1a6c7729-a5e3-49b8-bc93-d7ed31924cd6.png

Photograph of a jester at night.

https://lemmy.today/pictrs/image/28d2ab72-b90e-469a-a599-e1cad5992bdc.png

Photograph of a man hung upside-down from a rope tied around his left ankle at night. The man's hands are hanging limply. The man is wearing Renaissance clothing. The man is wearing boots.

The feet are a bit off; I didn't spent too much time futzing with it. Flux wasn't super-into having things upside-down, though it did ultimately do it.

https://lemmy.today/pictrs/image/f052e1d7-c7cf-4ba2-8f9a-50b762e25186.png

Photograph of an old man wearing a robe walking on a mountain trail at night. The man is holding a lantern aloft and a staff.

https://lemmy.today/pictrs/image/439ec9a7-7739-4924-bede-cc776f7be8da.png

Photograph of a pope at night.

https://lemmy.today/pictrs/image/0be8647f-cb20-4be0-92c5-e266a4edca00.png

Photograph of a high priestess at night.

https://lemmy.today/pictrs/image/160c4575-c02a-4ccb-b513-6c60043d5b2f.png

Photograph of two lovers at night. The lovers are wearing Renaissance clothing. There are many fireflies.

https://lemmy.today/pictrs/image/a0e754ca-525e-4674-8b0a-bec48748e7f0.png

Photograph of a magician at night.

https://lemmy.today/pictrs/image/553fd519-ea29-4dba-9ed3-d0bf634c25fa.png

An photograph of two standing stones by a river at night. The moon is in the sky. In the lower-right quadrant of the photograph, there is a white wolf howling at the moon. In the lower-left quadrant of the photograph, there is a black dog howling at the moon.

I omitted the traditional crawfish. I didn't really like the look of it, and on top of that, Flux kept wanting to make it look glowy, which I didn't want.

https://lemmy.today/pictrs/image/687826be-81ba-452b-8378-b81f58e9bfce.png

An photograph of a naked woman at night crouching by a lake. The woman is facing away from the camera. The woman is holding a jug and pouring water into the lake. There is a bright star in the sky. There is an eight-point lens flare coming from the
bright star. The sky is black. The photograph is NSFW.

https://lemmy.today/pictrs/image/a2d42d98-2920-4eed-b5ab-5b807736d3f5.png

An photograph of a full solar eclipse with a visible solar corona. The Sun is black. The photograph is at night. A naked nude infant rides a white horse at night, with sunflowers in the background at night. The photograph is NSFW.

https://lemmy.today/pictrs/image/43709bad-1a8a-4680-8de9-5cf58005bb5e.png

Photograph of a tower on a hill at night.

https://lemmy.today/pictrs/image/9d308020-a83f-4b61-ad58-f47654e41ddf.png

A photograph of a glowing figure eight in the sky at night. The background is sky and clouds. A flying, nude woman in the clouds holding a wood baton in each hand is in front of the figure eight. The photograph is NSFW. The woman is nude.

I didn't really like the traditional The World tarot card style, and it didn't mesh well with a photographic style with all the disembodied heads, so I mashed up the oroborous and flying woman with batons from two different The World styles. Also, Flux was okay with up to three heads of various species sticking in at each corner, but for some reason was resistant to doing all four. I didn't want to bang on it more. Flux was determined to put some clothing on the woman.

https://lemmy.today/pictrs/image/39b791c9-4493-43dc-bf86-4acc8daa1785.png

Photograph of a circle floating in the clouds at night. The circle is labeled with alchemical symbols. There are esoteric symbols covering the photograph. The circle is centered in the photograph.

There are normally some nude figures in a Tarot deck and I included this here; I didn't flag the post NSFW as I don't think that it's all that explicit.

 

Green Party co-chairs Ricarda Lang and Omid Nouripour are stepping down. The move could make things even harder for Germany's fractious coalition government.

 

Industry minister Adolfo Urso warns of large-scale job losses among carmakers unless Green Deal rules are relaxed

view more: ‹ prev next ›