image description

An infographic titled “How To Write Alt Text” featuring a photo of a capybara. Parts of alt text are divided by color, including “identify who”, “expression”, “description”, “colour”, and “interesting features”. The finished description reads “A capybara looking relaxed in a hot spa. Yellow yuzu fruits are floating in the water, and one is balanced on the top of the capybara’s head.”

via https://www.perkins.org/resource/how-write-alt-text-and-image-descriptions-visually-impaired/

    • Daxtron2@startrek.website
      link
      fedilink
      arrow-up
      1
      ·
      2 months ago

      It’s essentially by-hand CLIP, that’s how the training data for CLIP came into being, it was descriptive text for images.

    • Pennomi@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 months ago

      It’s only useful if the AI was trained on similar prompts. A lot of the anime style ones work best with lists of tags, while the realistic ones work best with descriptions like above.

    • 9488fcea02a9@sh.itjust.works
      link
      fedilink
      arrow-up
      1
      arrow-down
      1
      ·
      edit-2
      2 months ago

      Prompts are just the reverse of image recognition AI tagging stuff.

      Alt text is exactly the kind of tedious work that AI would be good at doing, but everyone in the fediverse seems to have a huge hate boner for ANYTHING AI…

      Fediverse: write a fucking essay every time you post an image… But make sure you waste time doing it manually, instead of using AI tools!!!

  • flashgnash@lemm.ee
    link
    fedilink
    arrow-up
    1
    ·
    2 months ago

    Is this not the kind of thing machine vision/language models would be really good at?

  • bjornsno@lemm.ee
    link
    fedilink
    arrow-up
    1
    ·
    2 months ago

    Ignorant question: isn’t alt text primarily for visually impaired people? If so, what is the point of including info about color?

    • Arthur Besse@lemmy.mlOP
      link
      fedilink
      English
      arrow-up
      2
      ·
      2 months ago

      Color can provide useful context. For example, in the case of this image, imagine if in a thread about it there was some discussion of the ripeness of the yuzu fruit.

    • h3rm17@sh.itjust.works
      link
      fedilink
      arrow-up
      1
      ·
      2 months ago

      You can also become visually impaired at points other than birth in life, and know colours and stuff