Nerdy spielt mit Stable Cascade[GER/ENG]

View this thread on: d.buzz | hive.blog | peakd.com | ecency.com

hive-121566·@nerdtopiade·a year ago

0.000 HBD

Nerdy spielt mit Stable Cascade[GER/ENG]

<hr>
https://files.peakd.com/file/peakd-hive/nerdtopiade/48Ka6D1uKhpgnTnzJmTzMBkhL7Ws3b9mk8rRJctrKqztKjV4hxUx4MU2QYhnSm7mp1.png
<p>
</hr>
<p>
<hr>
<div class=pull-left>
Guten Tag meine lieben Squadis.<p>
Es ist mal wieder soweit Nerdy kann über AI schreiben.<p>Stability AI hat etwas neues heraus gebracht ,nämlich Stable Cascade eine neues Model zur Generierung von Bildern.<p> Für Stable Cascade wurde auch eine andere Methode zum Trainieren benutzt die sogenannte <a href="https://openreview.net/forum?id=gU58d5QeGv">Würstchen Methode</a>. Wer mehr über die Würstchen Methode wissen will der klickt auf den vorrigen link :)<p>
 </div>
<div class=pull-right>
Good day my dear Squadis.<p>
It's time again for Nerdy to write about AI.<p>Stability AI has released something new, namely Stable Cascade, a new model for generating images.<p> 
For Stable Cascade, another method of training was also used, the so-called <a href="https://openreview.net/forum?id=gU58d5QeGv">sausage method</a>. If you want to know more about the sausage method, click on the previous link :)<p>
</div>
<p>
</hr>
<hr>
<div class=pull-left>
Stable Cascade generiert Bilder nach einem dreistufigen Prozess:<p> Zunächst wird mit dem Diffusionsmodell der Stufe C ein latentes Bild mit niedriger Auflösung erzeugt.<p> Dieser Latenzwert wird dann mit dem Diffusionsmodell der Stufe B hochskaliert.<p>Dieses hochskalierte latente Bild wird dann erneut hochskaliert und mit der Stufe A VAE in das fertige Bild umgewandelt.<p>
Stabilty AI bietet die Modele in 2 Varianten an "Normal" und Light und ich wollte einfach heraus finden ob große Unterschiede gibt bei den Modelen.<p> Aus diesem Grund habe ich 4 Bilder generiert einmal mit den "Normalen" Modelen und einmal mit den "Light" Modelen.<p>
Wie immer habe ich die gleichen Prompts, Seeds ,Sampler und die gleichen CFG(4 und 2.2) Werte benutzt bei den Bildern.<p> Die Prompts und Seeds findet ihr <a href="https://peakd.com/hive-121566/@nerdtopiade/ich-habe-mit-comfyui-rumgespielt">hier</a><p>Auf der linken Seite seht ihr immer das "Light" Bild und auf der rechten das "Normale" Bild.
</div>
<div class=pull-right>
Stable Cascade generates images according to a three-stage process:<p>
First, a latent image with low resolution is generated using the diffusion model of level C.<p> This latent value is then scaled up using the diffusion model of level B.<p>
This upscaled latent image is then upscaled again and converted into the final image using the A VAE stage.<p>
Stabilty AI offers the models in 2 variants "Normal" and "Light" and I just wanted to find out if there are big differences in the models.<p> For this reason I generated 3 pictures once with the "Normal" models and once with the "Light" models.<p>
As always, I used the same prompts, seeds, samplers and the same CFG(4 and 2.2) values for the pictures.<p> You can find the prompts and seeds <a href="https://peakd.com/hive-121566/@nerdtopiade/ich-habe-mit-comfyui-rumgespielt">here</a><p>You will always see the "Light" image on the left and the "Normal" image on the right.
</div>
</hr>
<p>
<hr>
<div class=pull-left>
https://files.peakd.com/file/peakd-hive/nerdtopiade/48K688nYjjaMJ76w7d1wEQVQcaD3vetDnsJTCtMoWJo7SSiuxsbwNTY6oKQnuGydF6.png<p>
https://files.peakd.com/file/peakd-hive/nerdtopiade/48RNtZcKsSVzrLowzBZjvDigs7DHZSjn9eYniw3uk3wcPqE8DUbdgq4PeXgEgdWLSc.png<p>
https://files.peakd.com/file/peakd-hive/nerdtopiade/48VisGLkBVyfRE4pj2PByumdbcYhME9CakqQKtxsMfQm92UdjYJSPPCLE6Qcz6c1od.png
https://files.peakd.com/file/peakd-hive/nerdtopiade/48rveNoWxhKspGsxhwmJJ7wvsVCtUFHQvTgHsaGTax99ZasGcLX1RUCxe99q8zEEG8.png
<p>
</div>
<div class=pull-right>
https://files.peakd.com/file/peakd-hive/nerdtopiade/48GCAMavgHRF9sMEob8mp2ALijHq3fKEohDBYCMSnUVKaT8aR6R7t32x63QrnTBm1d.png<p>
https://files.peakd.com/file/peakd-hive/nerdtopiade/48RsrrZ3v538gY4688BybfZeNCvZNkUwmAtoc3a632ZMoZyF7Pf4275zBd9rm1GU2P.png<p>
https://files.peakd.com/file/peakd-hive/nerdtopiade/48QQzhF8GgqbQUW9wDYqQNfd3JZjKTANAD4aRTu7YyG4soqBBn9JfmZrkojbwnvhYw.png<p>
https://files.peakd.com/file/peakd-hive/nerdtopiade/488za9o36oSZAkPwKEvuc5xzvNmhDaRGswqg6ugDk47cJEtAfNTKb4T6Va1dPM2scU.png
</div>
</hr>
<p>
<hr>
<div class=pull-left>
Eins fällt sofort auf die Light Bilder sehen alle so aus als hätte jemand den Zoom betätigt um ganz nahe zu sein.<p>
Was mir auch auf gefallen ist das normale Model verweigert viel öfters die Ausgabe eines Bildes und spuckt nur Artefakte aus (wie beim Auto).<p>Unsere Red Sonja wollten beide Modele erst ausgeben als ich Anime an den Anfang des Prompts geschrieben habe.<p>
Das Light Model hat auch Probleme mit den Schriften,das Titelbild ist mit dem normalen Model entstanden.<p>
Runter laden könnt ihr euch die Modele bei <a href="https://huggingface.co/stabilityai/stable-cascade/tree/main">Hugginface</a>.<o>Stage A gehört in den VAE folder der Comfyui installation, Stage B und C in den Unet Folder.<p>Zusätzlich benötigt ihr noch den Text Encoder (auch bei Huggingface) welcher dann in den Clip(nicht clip_vision) Ordner unter Models kopiert wird.<p>
Wer den Workflow benötigt lädt ihn sich einfach <a href="https://comfyworkflows.com/workflows/15b50c1e-f6f7-447b-b46d-f233c4848cbc">hier</a> herunter.<p>
Für das Light Model benötigt ihr mindestens 6 GB Vram für das normale Model mindestens 10 GB Vram und damit alles funktioniert müsst ihr einmal comfyui updaten.<p>
Wieder sehr viel zu lesen geworden,aber ich hoffe euch hat es gefallen.
</div>
<div class=pull-right>
One thing is immediately noticeable: the light pictures all look as if someone has used the zoom to get very close.<p>
What I also noticed is that the normal model often refuses to output an image and only spits out artifacts ( just like the car).<p>Our Red Sonja only wanted to output both models when I wrote Anime at the beginning of the prompt.<p>
The light model also has problems with the fonts - the cover picture was created with the normal model.<p>
You can download the models at <a href="https://huggingface.co/stabilityai/stable-cascade/tree/main">Hugginface</a>.<o>Stage A belongs in the VAE folder of the Comfyui installation, Stage B and C in the Unet folder.<p>In addition, you need the Text Encoder (also from Huggingface) which is then copied into the Clip (not clip_vision) folder under Models.<p>
If you need the workflow, simply download it <a href="https://comfyworkflows.com/workflows/15b50c1e-f6f7-447b-b46d-f233c4848cbc">here</a>.<p>
For the light model you need at least 6 GB Vram for the normal model at least 10 GB Vram and for everything to work you have to update comfyui once.<p> Once again a lot to read, but i hope u liked it.
</div>
</hr>
<p>
<hr>
<center>
<a href="https://www.twitter.com/nerdtopia"><img src="https://cdn.steemitimages.com/DQmcPstDSW6gtzTG7JRSCgpziXzYs6YgP9uFX7QnfRpSdkA/twitter.png"></a> <a href="https://www.instagram.com/nerdtopiade/"><img src="https://cdn.steemitimages.com/DQmVWMogDChXsBMqJVtKgdB3i9XiP4cMuF8VK7BQwWUbE9t/insta.png"></a> 
</center>

👍 gerdtrudroepke, captain.blog, nerdopi, driveforkids, portalvotes, sbi4, kaldewei, sbi-tokens, meins0815, steemillu, matt-kirby, sneakyninja, mastergerund, thedailysneak, babysavage, ravensavage, helpie-caster, cryptoknight12, helpie, magdalena1b, ciderjunkie, devmarvel, niallon11, bennettitalia, doomsdaychassis, free-reign, bflanagin, melor9, lillywilton, derosnec, paulag, guchtere, meno, socent, soulturtle, zipporah, tinyhousecryptos, kggymlife, joeyarnoldvn, kgswallet, crimo, jlsplatts, markaustin, backinblackdevil, nerdtopiade, voxmortis, clau-de-sign, slacktmusic, celinavisaez, pundito, keepinitsteem, calimeatwagon, rcshad0w, gerusan, letsplaywhatelse, emitste, kanrat, bilderkiste, retard-gamer-de, jeronimorubio, camuel, elector, goldfoot, botito, tobor, hadaly, dotmatrix, chomps, freysa, lunapark, weebo, otomo, buffybot, psybot, chatbot, misery, freebot, cresus, filosof103, honeybot, dtake, quicktrades, droida, c0wtschpotato, sylmarill, dera123, rambutan.art, bluerobo, bluefinstudios, monsterbuster, reiseamateur, orionvk, fabio, dashforcenews-de, lifestylechill, freiheitsbote, dash-embassy, shopinbit, my-art-way, dirkzett, condeas, dotwin1981, alucian, hornet-on-tour, siphon.tribes, ai4fun, slothbuzzcurator, ravenmus1c, anandkj611, slothbuzz, tangmo, starthilfe, icedragonzes,

properties (23)vote details (113)