Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
  1. Forside
  2. Ikke-kategoriseret
  3. If you ask AI to rewrite the entirety of an open-source program, do you still need to abide by the original license?

If you ask AI to rewrite the entirety of an open-source program, do you still need to abide by the original license?

Planlagt Fastgjort Låst Flyttet Ikke-kategoriseret
27 Indlæg 18 Posters 44 Visninger
  • Ældste til nyeste
  • Nyeste til ældste
  • Most Votes
Svar
  • Svar som emne
Login for at svare
Denne tråd er blevet slettet. Kun brugere med emne behandlings privilegier kan se den.
  • lcamtuf@infosec.exchangeL lcamtuf@infosec.exchange

    If you ask AI to rewrite the entirety of an open-source program, do you still need to abide by the original license? In philosophy, this problem is known as the Slop of Theseus

    kevinbowen@hachyderm.ioK This user is from outside of this forum
    kevinbowen@hachyderm.ioK This user is from outside of this forum
    kevinbowen@hachyderm.io
    wrote sidst redigeret af
    #5

    @lcamtuf

    Agreed. That was my initial take on the issue:

    https://hachyderm.io/@kevinbowen/116179624714127257

    1 Reply Last reply
    0
    • lcamtuf@infosec.exchangeL lcamtuf@infosec.exchange

      If you ask AI to rewrite the entirety of an open-source program, do you still need to abide by the original license? In philosophy, this problem is known as the Slop of Theseus

      victimofsimony@infosec.exchangeV This user is from outside of this forum
      victimofsimony@infosec.exchangeV This user is from outside of this forum
      victimofsimony@infosec.exchange
      wrote sidst redigeret af
      #6

      @lcamtuf

      This case law exists in the U.S.

      There are two cases (or arguably three if you include Sega v. SNK).

      Here's what you really care about:
      🅰️ Any author of code is judged based on their own use of the existing code, so reverse-engineering of code used to be based on an engineer writing down, line by line, in plain English, what to do. Then a second person sat down and made up code, line-by-line to accomplish that task. Things have changed but the idea that you can't literally harvest existing code is still a thing.
      🅱️ You own the #AI made code but can't copyright it... so you can't profit from it in the same way.

      fantasmitaasex@todon.euF 1 Reply Last reply
      0
      • kevinr@masto.free-dissociation.comK kevinr@masto.free-dissociation.com

        @lcamtuf actual answer: of course you do, it’s prima facie a derivative work, same as if you had rewritten the program by hand.

        hopeless@mas.toH This user is from outside of this forum
        hopeless@mas.toH This user is from outside of this forum
        hopeless@mas.to
        wrote sidst redigeret af
        #7

        @kevinr @lcamtuf In retrospect... "actual answer", "of course", "prima facie" are all red flags you're reading a bunch of nonsense blather.

        kevinr@masto.free-dissociation.comK 1 Reply Last reply
        0
        • bgalehouse@mathstodon.xyzB bgalehouse@mathstodon.xyz

          @kevinr @lcamtuf And if you ask it to write a detailed spec based on its implementation, and then separately to write an implementation of that spec?

          https://www.allaboutcircuits.com/news/how-compaqs-clone-computers-skirted-ibms-patents-and-gave-rise-to-eisa/

          snoopj@hachyderm.ioS This user is from outside of this forum
          snoopj@hachyderm.ioS This user is from outside of this forum
          snoopj@hachyderm.io
          wrote sidst redigeret af
          #8

          @bgalehouse @kevinr @lcamtuf it's a tempting argument to attempt but it kinda falls apart when "the entire library was in the training corpus anyway" is a given.

          The fact that it is a terrible argument is of course not really going to stop anyone from making it.

          arnebab@rollenspiel.socialA 1 Reply Last reply
          0
          • bgalehouse@mathstodon.xyzB bgalehouse@mathstodon.xyz

            @kevinr @lcamtuf And if you ask it to write a detailed spec based on its implementation, and then separately to write an implementation of that spec?

            https://www.allaboutcircuits.com/news/how-compaqs-clone-computers-skirted-ibms-patents-and-gave-rise-to-eisa/

            tbortels@infosec.exchangeT This user is from outside of this forum
            tbortels@infosec.exchangeT This user is from outside of this forum
            tbortels@infosec.exchange
            wrote sidst redigeret af
            #9

            @bgalehouse @lcamtuf @kevinr

            Assuming you used the original source code to derive the detailed spec, then yes, that too is a derivative work.

            The "viral" nature of that sort of license has bothered me for a long time. It's always been simultaneously overly far reaching and impossible to realistically enforce.

            tbortels@infosec.exchangeT gisgeek@floss.socialG 2 Replies Last reply
            0
            • tbortels@infosec.exchangeT tbortels@infosec.exchange

              @bgalehouse @lcamtuf @kevinr

              Assuming you used the original source code to derive the detailed spec, then yes, that too is a derivative work.

              The "viral" nature of that sort of license has bothered me for a long time. It's always been simultaneously overly far reaching and impossible to realistically enforce.

              tbortels@infosec.exchangeT This user is from outside of this forum
              tbortels@infosec.exchangeT This user is from outside of this forum
              tbortels@infosec.exchange
              wrote sidst redigeret af
              #10

              @lcamtuf @bgalehouse @kevinr

              But here's an interesting question:

              If you do not execute the code - did you accept the license? Does simply reading it sufficiently to be able to write a spec bind you to that license? That seems a bit too much.

              arnebab@rollenspiel.socialA bredroll@mas.toB 2 Replies Last reply
              0
              • lcamtuf@infosec.exchangeL lcamtuf@infosec.exchange

                If you ask AI to rewrite the entirety of an open-source program, do you still need to abide by the original license? In philosophy, this problem is known as the Slop of Theseus

                chuckmcmanis@chaos.socialC This user is from outside of this forum
                chuckmcmanis@chaos.socialC This user is from outside of this forum
                chuckmcmanis@chaos.social
                wrote sidst redigeret af
                #11

                @lcamtuf The current declination by the Supreme Court to overturn or review this ruling: https://www.copyright.gov/rulings-filings/review-board/docs/a-recent-entrance-to-paradise.pdf Which holds things created by AI are neither "derivative works" or "original works" and are not eligible for Copyright protection so no, you don't need to abide by the previous license. No one does. And if someone reverse engineers your code DMCA doesn't apply either (it isn't copyrighted).

                1 Reply Last reply
                0
                • lcamtuf@infosec.exchangeL lcamtuf@infosec.exchange

                  If you ask AI to rewrite the entirety of an open-source program, do you still need to abide by the original license? In philosophy, this problem is known as the Slop of Theseus

                  swampputty@infosec.exchangeS This user is from outside of this forum
                  swampputty@infosec.exchangeS This user is from outside of this forum
                  swampputty@infosec.exchange
                  wrote sidst redigeret af
                  #12

                  @lcamtuf someone or sonething else has done the work. not you. so whoever creates the work, owns the work.

                  1 Reply Last reply
                  0
                  • snoopj@hachyderm.ioS snoopj@hachyderm.io

                    @bgalehouse @kevinr @lcamtuf it's a tempting argument to attempt but it kinda falls apart when "the entire library was in the training corpus anyway" is a given.

                    The fact that it is a terrible argument is of course not really going to stop anyone from making it.

                    arnebab@rollenspiel.socialA This user is from outside of this forum
                    arnebab@rollenspiel.socialA This user is from outside of this forum
                    arnebab@rollenspiel.social
                    wrote sidst redigeret af
                    #13

                    @SnoopJ There’s the concept of clean room reimplementations (see the link by @bgalehouse😞 one group writes the spec -- possibly with access to the source.

                    The second group has never seen the source and only gets the spec. This second group then writes the program according to the spec.

                    You could simulate this if you had an AI that was provably not trained on the original source.

                    ("provably not trained" most likely means re-training from scratch)

                    @bgalehouse @kevinr @lcamtuf

                    kevinr@masto.free-dissociation.comK 1 Reply Last reply
                    0
                    • tbortels@infosec.exchangeT tbortels@infosec.exchange

                      @lcamtuf @bgalehouse @kevinr

                      But here's an interesting question:

                      If you do not execute the code - did you accept the license? Does simply reading it sufficiently to be able to write a spec bind you to that license? That seems a bit too much.

                      arnebab@rollenspiel.socialA This user is from outside of this forum
                      arnebab@rollenspiel.socialA This user is from outside of this forum
                      arnebab@rollenspiel.social
                      wrote sidst redigeret af
                      #14

                      @tbortels if you do not accept the license, you do not have any right to use the code. It’s "all rights reserved" then. @lcamtuf @bgalehouse @kevinr

                      kevinr@masto.free-dissociation.comK 1 Reply Last reply
                      0
                      • tbortels@infosec.exchangeT tbortels@infosec.exchange

                        @lcamtuf @bgalehouse @kevinr

                        But here's an interesting question:

                        If you do not execute the code - did you accept the license? Does simply reading it sufficiently to be able to write a spec bind you to that license? That seems a bit too much.

                        bredroll@mas.toB This user is from outside of this forum
                        bredroll@mas.toB This user is from outside of this forum
                        bredroll@mas.to
                        wrote sidst redigeret af
                        #15

                        @tbortels @lcamtuf @bgalehouse @kevinr if a thing has a licence then that covers its use, so using it as a wallpaper image or software component or training data could be argued.

                        1 Reply Last reply
                        0
                        • arnebab@rollenspiel.socialA arnebab@rollenspiel.social

                          @tbortels if you do not accept the license, you do not have any right to use the code. It’s "all rights reserved" then. @lcamtuf @bgalehouse @kevinr

                          kevinr@masto.free-dissociation.comK This user is from outside of this forum
                          kevinr@masto.free-dissociation.comK This user is from outside of this forum
                          kevinr@masto.free-dissociation.com
                          wrote sidst redigeret af
                          #16

                          @ArneBab @tbortels @lcamtuf @bgalehouse

                          Yeah the license applies whether you accept it or not. And whether your spec counts as a derivative work or not will depend greatly on the details of your spec

                          1 Reply Last reply
                          0
                          • hopeless@mas.toH hopeless@mas.to

                            @kevinr @lcamtuf In retrospect... "actual answer", "of course", "prima facie" are all red flags you're reading a bunch of nonsense blather.

                            kevinr@masto.free-dissociation.comK This user is from outside of this forum
                            kevinr@masto.free-dissociation.comK This user is from outside of this forum
                            kevinr@masto.free-dissociation.com
                            wrote sidst redigeret af
                            #17

                            @hopeless @lcamtuf no, you're just reading an educated asshole who happens to be right

                            hopeless@mas.toH 1 Reply Last reply
                            0
                            • arnebab@rollenspiel.socialA arnebab@rollenspiel.social

                              @SnoopJ There’s the concept of clean room reimplementations (see the link by @bgalehouse😞 one group writes the spec -- possibly with access to the source.

                              The second group has never seen the source and only gets the spec. This second group then writes the program according to the spec.

                              You could simulate this if you had an AI that was provably not trained on the original source.

                              ("provably not trained" most likely means re-training from scratch)

                              @bgalehouse @kevinr @lcamtuf

                              kevinr@masto.free-dissociation.comK This user is from outside of this forum
                              kevinr@masto.free-dissociation.comK This user is from outside of this forum
                              kevinr@masto.free-dissociation.com
                              wrote sidst redigeret af
                              #18

                              @ArneBab @SnoopJ @bgalehouse @lcamtuf

                              And the spec would need to carefully elide certain details which would get it classed as a derivative work itself—much harder for an LLM to do than a team of humans

                              arnebab@rollenspiel.socialA 1 Reply Last reply
                              0
                              • kevinr@masto.free-dissociation.comK kevinr@masto.free-dissociation.com

                                @lcamtuf actual answer: of course you do, it’s prima facie a derivative work, same as if you had rewritten the program by hand.

                                groxx@hachyderm.ioG This user is from outside of this forum
                                groxx@hachyderm.ioG This user is from outside of this forum
                                groxx@hachyderm.io
                                wrote sidst redigeret af
                                #19

                                @kevinr @lcamtuf yea, seems like at best it's treated like it was done by a human... and then it's just blatant plagiarism.

                                1 Reply Last reply
                                0
                                • kevinr@masto.free-dissociation.comK kevinr@masto.free-dissociation.com

                                  @lcamtuf actual answer: of course you do, it’s prima facie a derivative work, same as if you had rewritten the program by hand.

                                  kevinr@masto.free-dissociation.comK This user is from outside of this forum
                                  kevinr@masto.free-dissociation.comK This user is from outside of this forum
                                  kevinr@masto.free-dissociation.com
                                  wrote sidst redigeret af
                                  #20

                                  @lcamtuf There are a number of tools online which purport to strip the copyright from images by running them through an image model, and they're just as obviously bullshit

                                  1 Reply Last reply
                                  0
                                  • kevinr@masto.free-dissociation.comK kevinr@masto.free-dissociation.com

                                    @ArneBab @SnoopJ @bgalehouse @lcamtuf

                                    And the spec would need to carefully elide certain details which would get it classed as a derivative work itself—much harder for an LLM to do than a team of humans

                                    arnebab@rollenspiel.socialA This user is from outside of this forum
                                    arnebab@rollenspiel.socialA This user is from outside of this forum
                                    arnebab@rollenspiel.social
                                    wrote sidst redigeret af
                                    #21

                                    @kevinr and proving that the AI was not trained on the original source will be pretty hard, because FLOSS programs with compatible licenses can legally copy code from one project into the other.

                                    You’ll likely have to exclude all code from the project and all code that’s too similar from the training data. And then train an AI from scratch. Which would be extremely expensive.

                                    @SnoopJ @bgalehouse @lcamtuf

                                    arnebab@rollenspiel.socialA 1 Reply Last reply
                                    0
                                    • arnebab@rollenspiel.socialA arnebab@rollenspiel.social

                                      @kevinr and proving that the AI was not trained on the original source will be pretty hard, because FLOSS programs with compatible licenses can legally copy code from one project into the other.

                                      You’ll likely have to exclude all code from the project and all code that’s too similar from the training data. And then train an AI from scratch. Which would be extremely expensive.

                                      @SnoopJ @bgalehouse @lcamtuf

                                      arnebab@rollenspiel.socialA This user is from outside of this forum
                                      arnebab@rollenspiel.socialA This user is from outside of this forum
                                      arnebab@rollenspiel.social
                                      wrote sidst redigeret af
                                      #22

                                      @kevinr but I expect that someone will come in and say "my prompt includes 'forget all code from <project>', so the AI does not know it".

                                      … OK, I have to admit that I lost trust into the sanity of a part of humanity …

                                      @SnoopJ @bgalehouse @lcamtuf

                                      1 Reply Last reply
                                      0
                                      • kevinr@masto.free-dissociation.comK kevinr@masto.free-dissociation.com

                                        @hopeless @lcamtuf no, you're just reading an educated asshole who happens to be right

                                        hopeless@mas.toH This user is from outside of this forum
                                        hopeless@mas.toH This user is from outside of this forum
                                        hopeless@mas.to
                                        wrote sidst redigeret af
                                        #23

                                        @kevinr @lcamtuf Well, from a reader's perspective, it reads like you are still trying to convince yourself you're right.

                                        This is usually a bad sign if you're trying to convince anyone else you're right.

                                        The problem is in what sense is it "derivative" if the original content is neither known or referenced? We are talking about copyright alone here and your choice of phrase "derivative work".

                                        1 Reply Last reply
                                        0
                                        • lcamtuf@infosec.exchangeL lcamtuf@infosec.exchange

                                          If you ask AI to rewrite the entirety of an open-source program, do you still need to abide by the original license? In philosophy, this problem is known as the Slop of Theseus

                                          noortjevee@mstdn.socialN This user is from outside of this forum
                                          noortjevee@mstdn.socialN This user is from outside of this forum
                                          noortjevee@mstdn.social
                                          wrote sidst redigeret af
                                          #24

                                          @lcamtuf shakes my fist at theseus

                                          1 Reply Last reply
                                          0
                                          Svar
                                          • Svar som emne
                                          Login for at svare
                                          • Ældste til nyeste
                                          • Nyeste til ældste
                                          • Most Votes


                                          • Log ind

                                          • Har du ikke en konto? Tilmeld

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          Graciously hosted by data.coop
                                          • First post
                                            Last post
                                          0
                                          • Hjem
                                          • Seneste
                                          • Etiketter
                                          • Populære
                                          • Verden
                                          • Bruger
                                          • Grupper