• ayaya@lemdro.id
      link
      fedilink
      English
      arrow-up
      1
      ·
      4 months ago

      If the model isn’t overfitted it’s also not even copying. By their nature LLMs are transformative which is the whole point of fair use.

        • ayaya@lemdro.id
          link
          fedilink
          English
          arrow-up
          1
          ·
          4 months ago

          Again, even an exact copy is not stealing. It’s copyright infringement. Theft is a different crime.

          But paraphrasing is not copyright infringement either. It’s no different than Wikipedia having a synopsis for every single episode of a TV series. Telling someone about what a work contains for informational purposes is perfectly fine.

        • A_Very_Big_Fan@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          edit-2
          4 months ago

          !Arthur Dent has his home demolished while humans simultaneously have Earth demolished by an alien race called Vogons, but him and Ford Prefect escape by hitchhiking onto the Vogon ship. They’re discovered and thrown into space, but miraculously saved by Ford’s relative (can’t remember how they’re related) and his ship The Heart of Gold, which is powerful but unpredictable. They wind up on a mythical planet due to that unpredictability, and learn that Earth was a designer planet created to calculate the ultimate answer to the ultimate question of life, the universe, and everything. (The famous “42” thing). The whole crew escapes the planet and decides to go to The Restaurant at the End of The Universe to eat and watch the universe end.!<

          Have I just stolen The Hitchhikers Guide to the Galaxy and given it to you?

    • cmhe@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      4 months ago

      “Copying is theft” is the argument of corporations for ages, but if they want our data and information, to integrate into their business, then, suddenly they have the rights to it.

      If copying is not theft, then we have the rights to copy their software and AI models, as well, since it is available on the open web.

      They got themselves into quite a contradiction.

      • masterspace@lemmy.ca
        link
        fedilink
        English
        arrow-up
        2
        ·
        4 months ago

        You realize that half of Lemmy is tying themselves in inconsistent logical knots trying to escape the reverse conundrum?

        Copying isn’t stealing and never was. Our IP system that artificially restricts information has never made sense in the digital age, and yet now everyone is on here cheering copyright on.

      • Buffalox@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        edit-2
        4 months ago

        If copying is not theft, then we have the rights to copy their software

        No we don’t, copying copyrighted material is copyright infringement. Which is illegal. that does not make it theft though.
        Oversimplifying the issue makes for an uninformed debate.

    • GamingChairModel@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      4 months ago

      Yeah, I’m not a fan of AI but I’m generally of the view that anything posted on the internet, visible without a login, is fair game for indexing a search engine, snapshotting a backup (like the internet archive’s Wayback Machine), or running user extensions on (including ad blockers). Is training an AI model all that different?

      • sugar_in_your_tea@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        0
        ·
        4 months ago

        Yes, it kind of is. A search engine just looks for keywords and links, and that’s all it retains after crawling a site. It’s not producing any derivative works, it’s merely looking up an index of keywords to find matches.

        An LLM can essentially reproduce a work, and the whole point is to generate derivative works. So by its very nature, it runs into copyright issues. Whether a particular generated result violates copyright depends on the license of the works it’s based on and how much of those works it uses. So it’s complicated, but there’s very much a copyright argument there.

        • TheRealKuni@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          4 months ago

          An LLM can essentially reproduce a work, and the whole point is to generate derivative works. So by its very nature, it runs into copyright issues.

          Derivative works are not copyright infringement. If LLMs are spitting out exact copies, or near-enough-to-exact copies, that’s one thing. But as you said, the whole point is to generate derivative works.

      • Snot Flickerman@lemmy.blahaj.zone
        link
        fedilink
        English
        arrow-up
        1
        ·
        edit-2
        4 months ago

        Is it that or is it that the laws are selectively applied on little guys and ignored once you make enough money? It certainly looks that way. Once you’ve achieved a level of “fuck you money” it doesn’t matter how unscrupulously you got there. I’m not sure letting the big guys get away with it while little guys still get fucked over is as big of a win as you think it is?


        Examples:

        The Pirate Bay: Only made enough money to run the site and keep the admins living a middle class lifestyle.

        VERDICT: Bad, wrong, and evil. Must be put in jail.

        OpenAI: Claims to be non-profit, then spins off for-profit wing. Makes a mint in a deal with Microsoft.

        VERDICT: Only the goodest of good people and we must allow them to continue doing so.


        The IP laws are stupid but letting fucking rich twats get away with it while regular people will still get fucked by the same rules is kind of a fucking stupid ass hill to die on.

        But sure, if we allow the giant companies to do it, SOMEHOW the same rules will “trickle down” to regular people. I think I’ve heard that story before… No, they only make exceptions for people who can basically print money. They’ll still fuck you and me six ways to Sunday for the same.

        I mean, the guys who ran Jetflicks, a pirate streaming site, are being hit with potentially 48 year sentences. Longer than a lot of way more serious fucking crimes. I’ve literally seen murderers get half that.

        But yeah, somehow, the same rules will end up being applied to us? My ass. They’re literally jailing people for it right now. If that wasn’t the case, maybe this argument would have legs.

        But AI companies? Totes okay, bro.

  • dinckel@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    4 months ago

    Just yet another proof, that the more 0’s you have in your valuation, the less the laws apply to you

  • CriticalMiss@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    4 months ago

    Sure bud, pirating some Microsoft Studio video games and windows ISOs right now. What? I found them on the open web!

  • kibiz0r@midwest.social
    link
    fedilink
    English
    arrow-up
    1
    ·
    4 months ago

    Pirating Windows for your own personal, private use, which will never directly make you a single dollar: HIGHLY ILLEGAL

    Scraping your creative works so they can make billions by selling automated processes that compete against your work: Perfectly fine and normal!

    • yesman@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      arrow-down
      1
      ·
      4 months ago

      Do people still pirate Windows? You can download the iso directly from Microsoft’s website and you don’t need a registration key anymore.

  • interdimensionalmeme@lemmy.ml
    link
    fedilink
    English
    arrow-up
    0
    arrow-down
    1
    ·
    4 months ago

    Copyright infrigment is not theft, training models is not copyright infringement either. We need a law equivalent to when an artist says “he’s inpired by someone else” . That it specifically is illegal to do that without permission if you use a machine. That will force big tech to pay a pittance for it and it will instakill all the small player.

    • Elias Griffin@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      edit-2
      4 months ago

      Copyright Infringment strawman argument. When considering AI, we are not talking legal copyright infringement in the relationship between humans vs AI. Humans are mostly concerned with being obsoleted by Big Tech so the real issue is Intellectual Property Theft.

      artificial INTELLIGENCE stole our Intellectual Property

      Do you see it now?

      • interdimensionalmeme@lemmy.ml
        link
        fedilink
        English
        arrow-up
        0
        arrow-down
        1
        ·
        4 months ago

        It’s only theft as long as you cling to the failed “copyright” model.

        Big tech couldn’t steal anything if we don’t respect their property rights in the first place.

        By reifying copyright under the AI paradigm, we maintain big tech’s power over us.

        The truth is chatgpt belong to us. ClosedAI is just the compiler of the data.

        If we finally end the failed experiment of copyright, we destroy their mote.