/sdg/ - Stable Diffusion General

>Starter guides:
rentry.org/GUItard - current guide
rentry.org/tqizb - AMD guide

>Current scripts:
github.com/hlky/stable-diffusion-webui

>Colab with new model
colab.research.google.com/drive/1AfAmwLMd_Vx33O9IwY2TmO9wKZ8ABRRa

>Prompt helper:
lexica.art/

>SAAS sites:
beta.dreamstudio.ai - Free, New accounts get 200 generations
pornpen.ai - NSFW Allowed, Free
dezgo.com - Contains Ads, Free

Related:
/h/ /sdhb/: /vg/ /aids/: /mlp/ /ppp/: Official discord: discord.gg/stablediffusion
NSFW Discord: discord.gg/Crkrd76ueK
Emad's Twitter: twitter.com/EMostaque

>Recent News:
Emad announces plans for custom model training: twitter.com/EMostaque/status/1561780596107612161
Emad announces future anime oriented weights: twitter.com/EMostaque/status/1562192103823708162
Emad announces animation Soon: twitter.com/EMostaque/status/1561778925906395140

Attached: RMS.png (1024x1024, 2.14M)

Other urls found in this thread:

dreambooth.github.io/
twitter.com/RinonGal/status/1563092813310410752?s=20&t=N1nlP3JVrnLyVwN2osvOoA
getalpaca.io/
reddit.com/r/StableDiffusion/comments/wyduk1/show_rstablediffusion_integrating_sd_in_photoshop/
rentry.org/sdamd-arch4edu
twitter.com/AnonBabble

Attached: batch_size: 1 cfg_scale: 7.5 ddim_eta: 0 ddim_steps: 64 height: 512 n_iter: 9 prompt: neuromancer, attractive, sexy, dark, gloomy, desaturated, award-winning, by Greg Rutkowski sampler_name: k_euler_a seed: 2613338574 target: txt2img toggles: - 1 - 2 - .png (1536x2048, 3.42M)

Ok, very funny SD now give me a 3rd worlder flag.

Attached: Screenshot_20220827_230221.png (1151x722, 481.95K)

>mfw when i think of the possibilities

Attached: seed_120597_00028.png (576x768, 618.47K)

>Nekopara
Kind of looks like the characters I guess.

Attached: Nekopara.jpg (1536x1536, 885.27K)

Attached: output.png (1024x1024, 1.36M)

>Error: Maximum file size allowed is 4 MB

Attached: 00035 resized.png (1024x1024, 1.37M)

that is a 3rd worlder flag

I've been doing ML computer vision stuff mostly as a hobby the past two years,
mostly stuff like object detection, bounding, and segmentation , and just stuff that help me organize my pictures. what should I be learning to go from being a dilettante to someone proficient in the field? I heard some tell me to learn the math behind it , I'm little bit apprehensive about that since as someone that did an EE program nearly all the theoretical math i went over in there never came up again, is ML different in this respect.

the hell? what is the script using the decode the image?

What do you mean?

oh yeah meant here , got confused

whats the memory requirements for ESRGAN? I keep getting OOM when I try it

What did I fuck up and how do I fix it?

Attached: Opera 8_27_2022 11_07_18 AM.png (960x480, 38.35K)

Interesting news. Paper from Google that shows a technique better than textual inversion for getting consistent characters and objects in a different context dreambooth.github.io/ you could fine tune the model with your own pictures of a subject and it will keep perfect fidelity in new contexts. This will solve the weakness the current AI have.
> "How does this differ technically from Textual Inversion?"
> "One key component of our method is to finetune using the images of a subject, different from Textual Inversion. This allows for strong identity preservation which is super important for this problem. You can also check the Textual Inversion first author's response to this question here. twitter.com/RinonGal/status/1563092813310410752?s=20&t=N1nlP3JVrnLyVwN2osvOoA We are working on code and next steps. Our method is general and should work on other text-to-image models that are not Imagen, and we would like to implement it on some of the latest open-source models!"

There is also a photoshop plugin coming soon, register for the beta here getalpaca.io/
Here is a demo reddit.com/r/StableDiffusion/comments/wyduk1/show_rstablediffusion_integrating_sd_in_photoshop/

Attached: download-min.png (1024x1024, 561.01K)

I still don't know what you mean.
I tried to upload a 2048x2048 pic but it was too big so I resized it.

>did you think it was just equivalent to MS Paint resize?

No and I wouldn't dare to be that much of an arrogant but it is obvious that I lack basics of how it really works, so thank you for informative post and Wikipedia link. I was comparing its results to Gigapixel's so there's my confusion about image manipulation levels.

What CSS is that and how did you add it to the webui?
Also, sadly, in the eyes of AI the US of A is still the first world.

It already did, retard.

Attached: 1637463572754.png (1024x1024, 1.52M)

My small-ish company does a shitload of ML although I myself am not involved with that side of it, from what I understand it's most important to be smart, comfortable dipping into C++, and at least capable of understanding the maths if a task requires that you really dive into that. I'm guessing that if you were to spend 200 hours across 3 months training in something, though, more of the same of what you've done would be more valuable for finding a job and being useful in it than 200 hours of the underlying maths. OTOH reading your way through one respected underlying textbook in that time would probably be good just to prove you're capable of understanding the theory, so that you can go apply it when needed. Maybe someone who actually works in the field can give you better advice than my guesses though.

>try using fix faces with gfpgan
>error pops up saying it can't load C++ ops
>both versions of pytorch and torchvision are compatible
wat do?

Attached: index.png (512x512, 423.94K)

Attached: a_highly_detailed_masterful_painting_of_a_comfy_decorated_christmas_tree_in_winter_outside_a_cabin_in_countryside_of_Finland__concept_art__highly_detailed__atmospheric_lighting__rembrandt_8ohwsoxavz1k.png (512x512, 476.17K)

Arch chads with AMD GPU's:

Don't waste your precious spare time and CPU cycles building ROCm from scratch.

Here is a fixed guide for AMD without the stupid build steps:

rentry.org/sdamd-arch4edu

Attached: 000012.2938777587.02.png (512x640, 477.82K)

what system and gpu?

kek

found a bug: when using batch, seed for the first output gets written in the yml for all outputs
so all the filenames I've been giving to the images have been bs

Attached: batch_size: 1 cfg_scale: 7.5 ddim_eta: 0 ddim_steps: 64 height: 512 n_iter: 9 prompt: 4chan meetup, attractive, sexy, dark, gloomy, desaturated, award-winning, by Greg Rutkowski sampler_name: k_euler_a seed: 2574011149 target: txt2img toggles: - 1 - 2 -.png (1536x2048, 2.47M)

Just a guess, but do you have two virtual envs with the same name?

only joshing with you mate. Happy friday

looks sick although google didn't release imagen even to API and I can't see any concrete claim that they'll actually release this, rather than just "get it working with" SD/DALLE but keep it for nobody or academics only. Maybe I'm blind

>. Maybe someone who actually works in the field can give you better advice than my guesses though.
I applied for CV jobs knowing I couldn't actually get them just so that I can get a phone interview so I can ask them directly,
they only gave me very general/vague and unhelpful answers. Maybey its a dick move to waste their time like but I dont know who else to ask, I want to learn I just dont know what direction to go.

is installing the SDKs actually required/recommended? I do not have them installed and have been running without issue

Hey can anyone help me generate giantess content

Ai sucks at making giga-giantess stuff

Arch and 6700xt

reinstall visual studio and cuda

from last thread im currently generating more avatar/navi images

Attached: download (40).png (512x512, 386.18K)

navi user here

Attached: download (24).png (512x512, 372.81K)

Attached: grid-0855.png (1024x1024, 1.27M)

I don't know for sure what GUI that is but that's not the behaviour in Webui, however that's because it manually increments the seed by 1 and does another pic, up to the "batch size". i think the original script from SD however takes your batch size and generates pics one by one from that seed, so like, seed 42069 generates a certain pic and the 2nd seed from it is unreachable except by generating from 42069 again and generating at least 2 pics. that's why webui changed it, if you make a dozen pics and the 11th one is great you have to at least generate another 11 to work with it again

seed is bs

Attached: batch_size: 1 cfg_scale: 7.5 ddim_eta: 0 ddim_steps: 64 height: 512 n_iter: 9 prompt: python, attractive, sexy, dark, gloomy, desaturated, award-winning, by Greg Rutkowski sampler_name: k_euler_a seed: 2731028331 target: txt2img toggles: - 1 - 2 - 3 - 4.png (1536x2048, 2.78M)

I fixed that error on my system by installing pytorch rocm nighties rather than the stable release

Attached: download (26).png (512x512, 385.5K)

There's no point in telling other people text seeds, text seeds just give you random numbers.

Attached: download (31).png (512x512, 452.37K)

lol why are grids in webp now

I'm using hlky's webui from the guide
it getsthe seed right in the file name but wrong in the yaml

bs=bullshit

>パンツ
Garbage.

Disclaimer:
Though I do have (very limited) academic experience with computer vision this is not my main field of study.
I will talk about neural networks and statistics more broadly.

Practical side:
Stanford University has published freely available courses for neural networks.
The exercises have you implement standard feed-forward neural networks using NumPy routines.
The exercises are honestly pretty hard but they do a good job of teaching you how to implement efficient numerical computations with NumPy/TensorFlow (NumPy routines are literally 100x faster than Python loops).
Requires a lot of linear algebra.

Theoretical side:
Learn about the method of maximum likelihood.
This is the foundation for cost functions such as cross entropy or mean squared error.
An important point is how bias and variance is defined in this context and how it relates to neural networks.
Statistics is a very deep rabbit hole though.

Quite honestly though, the field of "data scientists" and ML practicioners is already full of dilettantes.
If you can name the correct definition of linear regression and explain how it applies to xy data you will already be in the top 10%.

Attached: パンツ.jpg (1536x1536, 1.04M)

webp is a faster image file type

Attached: 1650541237775.png (512x512, 281.8K)

Nice, top left almost spelt spartan correctly.

Attached: out.jpg (2048x1536, 3.02M)

heres hoping for some good navi images when the gen finishes

Attached: download (27).png (512x512, 340.13K)

You are on Any Forums. Change it yourself.

because I ctrl+z'd one too many times before submitting a pull request by accident and hlky hasn't pulled my follow-up fix to it yet

>NumPy routines are literally 100x faster than Python loops
yeah I noticed, does it use c bindings?

Why must I suffer?

Attached: 20220828032406_504996002.png (512x512, 317.57K)

>pantsu
Works.

Attached: pantsu.jpg (1536x1536, 509.99K)

>100x faster than Python loops
thats that one thing I noticed when I starting learning python,
loops are fucking slow in this, why is all machine learning scripted in it?