Stable Diffusion XL Turbo can generate AI images as fast as you can type - eviltoast
  • L3ft_F13ld!@links.hackliberty.org
    link
    fedilink
    English
    arrow-up
    56
    arrow-down
    11
    ·
    edit-2
    1 year ago

    XL Turbotastic Mega Ginormous, etc. Hate naming schemes like this. Why not just make it v2.0 or the Pro version instead? Why use multiple words that make it sound bigger and better? Marketing BS that just sounds dumb.

    • simple@lemm.ee
      link
      fedilink
      English
      arrow-up
      50
      arrow-down
      2
      ·
      edit-2
      1 year ago

      Not sure why you have a problem with it, the naming here makes a lot of sense if you know the context.

      Stable Diffusion --> The original SD with versions like 1.5, 2.0, 2.1 etc

      Stable Diffusion XL --> A version of SD with much bigger training data and support for much larger resolutions (hence, XL)

      Stable Diffusion XL Turbo --> A version of SDXL that is much faster (hence, Turbo)

      They have different names because they’re actually different things, it’s not exactly a v1.0 --> v2.0 scenario

      • at_an_angle@lemmy.one
        link
        fedilink
        English
        arrow-up
        6
        arrow-down
        3
        ·
        1 year ago

        Naming schemes that aren’t clear are absolute garbage.

        What if you’re new to it, and there are 6 different recent versions of something all named with a description instead of version number? Is Jumbo newer than Mega?

        Fuck it, I’m ranting about this because it still upsets me.

        I wanted to buy a 3DS to play Shovel Knight and Binding of Issac. Reading up on them, BoI would only play on a New 3DS XL. Cool.

        Went to the store and bought a new 3DS XL only to find out I got the wrong one. What I wanted was a NEW 3DS XL, and what I got was a 3DS XL that was new. There is a difference, and it took me 4 days to notice, and I was working out of town for the next month. So I can’t return it. FUN!

        So screw naming new versions of things with names instead of numbers. But somehow, Microsoft screwed that one up.

        KISS: Keep it simple, stupid.

        • simple@lemm.ee
          link
          fedilink
          English
          arrow-up
          5
          ·
          edit-2
          1 year ago

          Sure, 3DS names are dumb, but this is definitely not the case here. Using version numbers instead of different names for different things causes insane confusion and having to over-explain what it is.

          See: DLSS

          DLSS 2 is just DLSS 1 but better. DLSS 3 is frame generation that isn’t compatible with most hardware. DLSS 3.5 is similar to DLSS 2 but includes enhanced raytracing denoising.

          It’s a nightmare. Making a version 2, 3, 4 etc of something also makes it sound like there’s no reason to use the old version, whereas a lot of people are still using the regular stable diffusion over stable diffusion XL.

          Imagine if the discussion was “Hey don’t use Stable Diffusion 3 since you need a lot of VRAM, you should be using Stable Diffusion 1.5 or Stable Diffusion 2.1, but also it’s worth getting a new GPU for Stable Diffusion 4 cuz it’s very fast but has lower quality than version 3”

      • Phoenixz@lemmy.ca
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        Yeah but the next version has yet a bigger training set, so what then? XXL? and what about the next ? Turbo was already used, so now we call it Nitro? This is not the “new kids” movies, you know…

    • grue@lemmy.world
      link
      fedilink
      English
      arrow-up
      35
      arrow-down
      2
      ·
      1 year ago

      Why not just make it v2.0 or the Pro version instead?

      “Pro version” is equally cringe.

      • L3ft_F13ld!@links.hackliberty.org
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 year ago

        Yeah I get that. Would just have made more sense given that it’s widely used. Though I’ve been told why the name is so weird and it makes some sense now

      • tsonfeir@lemm.ee
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        1
        ·
        1 year ago

        Here are my suggestions:

        Stable Diffusion Free

        Stable Diffusion Paid with Limitations

        Stable Diffusion Paid Unlimited

    • DABDA@lemmy.world
      link
      fedilink
      English
      arrow-up
      10
      ·
      1 year ago

      I agree with you in general, but for Stable Diffusion, “2.0/2.1” was not an incremental direct improvement on “1.5” but was trained and behaves differently. XL is not a simple upgrade from 2.0, and since they say this Turbo model doesn’t produce as detailed images it would be more confusing to have SDXL 2.0 that is worse but faster than base SDXL, and then presumably when there’s a more direct improvement to SDXL have that be called SDXL 3.0 (but really it’s version 2) etc.

      It’s less like Windows 95->Windows 98 and more like DOS->Windows NT.

      That’s not to say it all couldn’t have been better named. Personally, instead of ‘XL’ I’d rather they start including the base resolution and something to reference whether it uses a refiner model etc.

      (Note: I use Stable Diffusion but am not involved with the AI/ML community and don’t fully understand the tech – I’m not trying to claim expert knowledge this is just my interpretation)

      • barsoap@lemm.ee
        link
        fedilink
        English
        arrow-up
        3
        ·
        1 year ago

        AFAIU SDXL is actually an erm genetic descendant of SD1.5, with its architecture expanded, weights transferred from 1.5, and then trained on bigger inputs (512x512 in the end is awfully small). SD2.0 is a completely new model, trained from scratch and as far as I’m aware noone’s actually using it. Also noone is using the SDXL refiner if you go to civitai it’s all models with detailer capabilities baked in, what you do see is workflows that generate an image, add some noise at the very end and repeat the last couple of steps. Using the base sdxl refiner on the output of other sdxl models is sometimes right-out comical because it sometimes has no idea what it’s looking at and then produced exquisitely surface texture details of the wrong material. Say a silk keyboard because it doesn’t realise that it’s supposed to be ABS and, well, black silk exists.

    • foggy@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 year ago

      Im just glad we’re moving away from purposely misspelled product SEO hacks.