Why Mark Zuckerberg wants to redefine open source so badly - eviltoast
  • will_a113@lemmy.ml
    link
    fedilink
    English
    arrow-up
    101
    ·
    11 hours ago

    Kinda funny how when mega corps can benefit from the millions upon millions of developer hours that they’re not paying for they’re all for open source. But when the mega corps have to ante up (with massive hardware purchases out of reach of any of said developers) they’re suddenly less excited about sharing their work.

  • fuzzy_feeling@programming.dev
    link
    fedilink
    English
    arrow-up
    41
    arrow-down
    5
    ·
    11 hours ago

    Meta’s Llama models also impose licensing restrictions on its users. For example, if you have an extremely successful AI program that uses Llama code, you’ll have to pay Meta to use it. That’s not open source. Period.

    open source != no license restrictions

    According to Meta, “Existing open source definitions for software do not encompass the complexities of today’s rapidly advancing AI models. We are committed to keep working with the industry on new definitions to serve everyone safely and responsibly within the AI community.”

    i think, he’s got a point, tho

    is ai open source, when the trainig data isn’t?
    as i understand, right now: yes, it’s enough, that the code is open source. and i think that’s a big problem

    i’m not deep into ai, so correct me if i’m wrong.

    • GoodEye8@lemm.ee
      link
      fedilink
      English
      arrow-up
      1
      ·
      14 minutes ago

      I understand the same way and I think there’s a lot of gray area which makes it hard to just say “the data also needs to be open source for the code to be open source”. What would that mean for postgreSQL? Does it magically turn closed source if I don’t share what’s in my db? What would it mean to every open source software that stores and uses that stored data?

      I’m not saying the AI models shouldn’t be open source, I’m saying reigning in the models needs to be done very carefully because it’s very easy to overreach and open up a whole other can of worms.

    • TimeSquirrel@kbin.melroy.org
      link
      fedilink
      arrow-up
      8
      arrow-down
      2
      ·
      7 hours ago

      I don’t think any of our classical open licenses from the 80s and 90s were ever created with AI in mind. They are inadequate. An update or new one is needed.

      Stallman, spit out the toe cheese and get to work.

  • Kompressor@lemmy.world
    link
    fedilink
    English
    arrow-up
    18
    ·
    11 hours ago

    Desperately trying tap in to the general trust/safety feel that open source software typically has. Trying to muddy the waters because they’ve proven they cannot be trusted whatsoever

    • kava@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      1
      ·
      edit-2
      5 hours ago

      when the data used to train the AI is copyrighted, how do you make it open source? it’s a valid question.

      one thing is the model or the code that trains the AI. the other thing is the data that produces the weights which determines how the model predicts

      of course, the obligatory fuck meta and the zuck and all that but there is a legal conundrum here we need to address that don’t fit into our current IP legal framework

      my preferred solution is just to eliminate IP entirely

      • FooBarrington@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        59 minutes ago

        when the data used to train the AI is copyrighted, how do you make it open source?

        When part of my code base belongs to someone else, how do I make it open source? By open sourcing the parts that belong to me, while clarifying that it’s only partially open source.

      • jacksilver@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        5 hours ago

        I mean, you can have open source weights, training data, and code/model architecture. If you’ve done all three it’s an open model, otherwise you state open “component”. Seems pretty straightforward to me.

  • paraphrand@lemmy.world
    link
    fedilink
    English
    arrow-up
    7
    ·
    9 hours ago

    If people could stop redefining words, that would go a long way to fixing our current strife.

    Not a total solution, but it would clarify the discussion. I loathe people who redefine and weaponize words.

    • FooBarrington@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      1
      ·
      58 minutes ago

      No, because that would no longer be open in the open source sense.

      It’s either open for everyone, or it isn’t open.