Sperm whales communicate with each other using rhythmic sequences of clicks, called codas. It was previously thought that sperm whales had just 21 coda types. However, after studying almost 9,000 recordings, the Ceti researchers identified 156 distinct codas. They also noticed the basic building blocks of these codas which they describe as a “sperm whale phonetic alphabet” – much like phonemes, the units of sound in human language which combine to form words.

Pratyusha Sharma, a PhD student at MIT and lead author of the study, describes the “fine-grain changes” in vocalisations the AI identified. Each coda consists of between three and 40 rapid-fire clicks. The sperm whales were found to vary the overall speed, or the “tempo”, of the codas, as well as to speed up and slow down during the delivery of a coda, in other words, making it “rubato”. Sometimes they added an extra click at the end of a coda, akin, says Sharma, to “ornamentation” in music. These subtle variations, she says, suggest sperm whale vocalisations could carry a much richer amount of information than previously thought.

  • Dkarma@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    ·
    5 months ago

    That’s the thing, dude, when you write software…it doesn’t have a name. They probably started with a framework like tensor flow and then stacked audio analysis ml modules on top figuratively, but this was probably mostly written in house it’s not a commercial product.

    • Zaktor@sopuli.xyz
      link
      fedilink
      English
      arrow-up
      4
      ·
      edit-2
      5 months ago

      I’m in “artificial intelligence” and write software. I’m not looking for a product name. If they developed an entirely new technique that’d also be cool to mention, but they probably built on existing techniques and were at least working in some broad form of AI.

      Edit: After looking at the paper, it’s the “misused” category. They were doing regular scientific data analysis.

      • Match!!@pawb.social
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        5 months ago

        The github shows them doing some k nearest neighbors and kernel estimation as part of their understanding of the coda (I think? I don’t know anything about whales)

        • Zaktor@sopuli.xyz
          link
          fedilink
          English
          arrow-up
          2
          ·
          5 months ago

          The KMeans is actually an unused import, but yeah, I see the kernel estimation. They also use a gaussian mixture in one of the lower cells. So a little AI.