The corporations already have all the data, users literally gave it to them by uploading it. Open source only has scrapped data. If you start regulating, you kill open source but the big players will literally just shrug it off.
Traditional artists already lost. It sucks but now we get to find out if the winner is all of society or only just Adobe and Shutterstock.
I’m not sure what you mean. FOSS generative image models are already better than the corpo paid for ones, and it isn’t even close. They’re more flexible and have way more features and tools than what you can get out of a discord bot or cloud computing subscription.
Thanks for sharing your thoughts. I think I can see why your side, but correct me if I misunderstood.
I don’t think we need primary ownership here. This site doesn’t have primary ownership in the social media market, yet it benefits users. Having our own spaces and tools is always something worth fighting for. Implying we need “primary ownership” is a straw man and emotional language like “massively”, “ridiculous”, and “left holding the bag” can harm this conversation. This is a false dilemma between two extremes: either individuals have primary ownership, or we have no control.
You also downplay the work of the vibrant community of researchers, developers, activists, and artists who are working on FOSS software and models for anyone to use. It isn’t individuals merely participating, it’s a worldwide network working for the public, often times leading research and development, for free.
One thing I’m certain of is that no that one can put a lid on this. What we can do is make it available, effective, and affordable to the public. Mega-corps will have their own models, no matter the cost. Just like the web, personal computers, and smartphones were made by big corporations or governments, we were the ones who turned them into something that enables social mobility, creativity, communication, and collaboration. It got to the point they tried jumping on ourtrends.
I feel two ways about it. Absolutely it is recorded in a retrieval system and doing some sort of complicated lookup. So derivative.
On the other hand, the whole idea of copyright or other so called IP except maybe trademarks and trade dress in the most limited way is perverse and should not exist. Not that I have a better idea.
Absolutely it is recorded in a retrieval system and doing some sort of complicated lookup.
It is not.
Stable Diffusion’s model was trained using the LAION-5B dataset, which describes five billion images. I have the resulting AI model on my hard drive right now, I use it with a local AI image generator. It’s about 5 GB in size. So unless StabilityAI has come up with a compression algorithm that’s able to fit an entire image into a single byte, there is no way that it’s possible for this process to be “doing some sort of complicated lookup” of the training data.
What’s actually happening is that the model is being taught high-level concepts through repeatedly showing it examples of those concepts.
I would disagree. It is just a big table lookup of sorts with some complicated interpolation/extrapolation algorithm. Training is recording the data into the net. Anything that comes out is derivative of the data that went in.
Yes, absolutely. They want AI to be people such that copyright applies and such that they can claim the AI was inspired just like a human artist is by the art they’re exposed to.
We need a license model such that AI is only allowed to be trained on content were the license explicitly permits it and that no mention is equal to it being disallowed.
We need a license model such that AI is only allowed to be trained on content were the license explicitly permits it and that no mention is equal to it being disallowed.
That is the default model behind copyright, which basically says that the only things people can use your copyrighted work for without a license are those which are determined to be “fair use”.
I don’t see any way in which today’s AI ought to be considered fair use of other people’s writings, artwork, etc.
deleted by creator
Actually in the age of basically permanent copyrigt, this brings at least some balance
deleted by creator
The corporations already have all the data, users literally gave it to them by uploading it. Open source only has scrapped data. If you start regulating, you kill open source but the big players will literally just shrug it off.
Traditional artists already lost. It sucks but now we get to find out if the winner is all of society or only just Adobe and Shutterstock.
Don’t ignore the plethora of FOSS models regular people can train and use. They want to trick you into thinking generative models are a game only for the big boys, while they form up to attempt regulatory capture to keep the small guy out. They know they’re not the only game in town, and they’re afraid we won’t need them anymore.
deleted by creator
I’m not sure what you mean. FOSS generative image models are already better than the corpo paid for ones, and it isn’t even close. They’re more flexible and have way more features and tools than what you can get out of a discord bot or cloud computing subscription.
deleted by creator
Thanks for sharing your thoughts. I think I can see why your side, but correct me if I misunderstood.
I don’t think we need primary ownership here. This site doesn’t have primary ownership in the social media market, yet it benefits users. Having our own spaces and tools is always something worth fighting for. Implying we need “primary ownership” is a straw man and emotional language like “massively”, “ridiculous”, and “left holding the bag” can harm this conversation. This is a false dilemma between two extremes: either individuals have primary ownership, or we have no control.
You also downplay the work of the vibrant community of researchers, developers, activists, and artists who are working on FOSS software and models for anyone to use. It isn’t individuals merely participating, it’s a worldwide network working for the public, often times leading research and development, for free.
One thing I’m certain of is that no that one can put a lid on this. What we can do is make it available, effective, and affordable to the public. Mega-corps will have their own models, no matter the cost. Just like the web, personal computers, and smartphones were made by big corporations or governments, we were the ones who turned them into something that enables social mobility, creativity, communication, and collaboration. It got to the point they tried jumping on our trends.
I feel two ways about it. Absolutely it is recorded in a retrieval system and doing some sort of complicated lookup. So derivative.
On the other hand, the whole idea of copyright or other so called IP except maybe trademarks and trade dress in the most limited way is perverse and should not exist. Not that I have a better idea.
It is not.
Stable Diffusion’s model was trained using the LAION-5B dataset, which describes five billion images. I have the resulting AI model on my hard drive right now, I use it with a local AI image generator. It’s about 5 GB in size. So unless StabilityAI has come up with a compression algorithm that’s able to fit an entire image into a single byte, there is no way that it’s possible for this process to be “doing some sort of complicated lookup” of the training data.
What’s actually happening is that the model is being taught high-level concepts through repeatedly showing it examples of those concepts.
I would disagree. It is just a big table lookup of sorts with some complicated interpolation/extrapolation algorithm. Training is recording the data into the net. Anything that comes out is derivative of the data that went in.
Yes, absolutely. They want AI to be people such that copyright applies and such that they can claim the AI was inspired just like a human artist is by the art they’re exposed to.
We need a license model such that AI is only allowed to be trained on content were the license explicitly permits it and that no mention is equal to it being disallowed.
That is the default model behind copyright, which basically says that the only things people can use your copyrighted work for without a license are those which are determined to be “fair use”.
I don’t see any way in which today’s AI ought to be considered fair use of other people’s writings, artwork, etc.