If something is in the public domain, there is no copyright. That’s what public domain means. Now, someone could try to place something into the public domain incorrectly that still has someone else’s copyright claim on it, but LLMs don’t do that (usually): a work created via an LLM is in the public domain. Nobody reserves any rights.
Because there are no rights reserved, there’s no copyright issues.
BUT that doesn’t mean that infringement hasn’t already been committed by the person who created the training set IF you stand by the argument that a training set has no right to include a work unless it’s in the public domain or permission has been granted by any rights holders.
That last bit I covered earlier; it is a philosophical stance people take, but it’s not the only one, and as of now it has no legal backing. Others claim fair use, which pre-empts any copyright claims. And remember, this is about creating the training set and NOT about generative works, which are in the public domain.
If something is in the public domain, there is no copyright. That’s what public domain means. Now, someone could try to place something into the public domain incorrectly that still has someone else’s copyright claim on it, but LLMs don’t do that (usually): a work created via an LLM is in the public domain. Nobody reserves any rights.
Because there are no rights reserved, there’s no copyright issues.
BUT that doesn’t mean that infringement hasn’t already been committed by the person who created the training set IF you stand by the argument that a training set has no right to include a work unless it’s in the public domain or permission has been granted by any rights holders.
That last bit I covered earlier; it is a philosophical stance people take, but it’s not the only one, and as of now it has no legal backing. Others claim fair use, which pre-empts any copyright claims. And remember, this is about creating the training set and NOT about generative works, which are in the public domain.