Generative artificial intelligence (GenAI) company Anthropic has claimed to a US court that using copyrighted content in large language model (LLM) training data counts as “fair use”, however.

Under US law, “fair use” permits the limited use of copyrighted material without permission, for purposes such as criticism, news reporting, teaching, and research.

In October 2023, a host of music publishers including Concord, Universal Music Group and ABKCO initiated legal action against the Amazon- and Google-backed generative AI firm Anthropic, demanding potentially millions in damages for the allegedly “systematic and widespread infringement of their copyrighted song lyrics”.

  • Lvxferre@mander.xyz
    link
    fedilink
    arrow-up
    1
    ·
    11 months ago

    My point is that the current approach - statistical association - is so crude that it’ll probably get ditched in the near future anyway, with or without licencing matters. And that those better models (that won’t be LLMs or diffusion-based) will probably skip this issue altogether.

    The comparison with 4yos is there mostly to highlight how crude it is. I don’t think either that it’s viable to “train” models in the same way as we’d train a human being.