• 0 Posts
  • 8 Comments
Joined 7 months ago
cake
Cake day: June 4th, 2024

help-circle


  • It’d be interesting to see how much this changes if you were to restrict the training dataset to books written in the last twenty years, I suspect the model would be a lot less negative. Older books tend to include stuff which does not fit with modern ideals and it’d be a real struggle to avoid this if such texts are used for training.

    For example I was recently reading a couple of the sequels to The Thirty-Nine Steps (written during WW1) and they include multiple instances that really date them to an earlier era with the main character casually throwing out jarringly racist stuff about black South Africans, Germans, the Irish, and basically anyone else who wasn’t properly English. Train an AI on that and you’re introducing the chance for problematic output - and chances are most LLMs have been trained on this series since they’re now public domain and easily available.


  • The watermark is noticeably more readable in the Facebook image I linked though, and it does say photography (even there it is somewhat blurred though, so assuming it was actually clear in the original source that copy is a few recompressions along the chain).

    The dates of the other sources however are what really convinces me it’s not AI. After all, who was doing good quality photorealistic AI image generation in 2021?



  • Seems legit enough to me. The next rack of tomatoes would only be ~2m away after all given the gaps between rows aren’t going to be massive. Pretty sure the sharpness issues are primarily from repeated JPEG recompression data loss - you can find a better quality version of the image by searching ‘carmine spina tomatoes’ which both looks less compressed in the far ground and dates from at least 2022 (so before mass popularity of AI generation).