• 0 Posts
  • 3 Comments
Joined 1 year ago
cake
Cake day: June 26th, 2023

help-circle
  • Let’s remove the context of AI altogether.

    Say, for instance, you were to check out and read a book from a free public library. You then go on to use some of the book’s content as the basis of your opinions. More, you also absorb some of the common language structures used in that book and unwittingly use them on your own when you speak or write.

    Are you infringing on copyright by adopting the book’s views and using some of the sentence structures its author employed? At what point can we say that an author owns the language in their work? Who owns language, in general?

    Assuming that a GPT model cannot regurgitate verbatim the contents of its training dataset, how is copyright applicable to it?

    Edit: I also would imagine that if we were discussing an open source LLM instead of GPT-4 or GPT-3.5, sentiment here would be different. And more, I imagine that some of the ire here stems from a misunderstanding of how transformer models are trained and how they function.