The sperm whale 'phonetic alphabet' revealed by AI - eviltoast

Sperm whales communicate with each other using rhythmic sequences of clicks, called codas. It was previously thought that sperm whales had just 21 coda types. However, after studying almost 9,000 recordings, the Ceti researchers identified 156 distinct codas. They also noticed the basic building blocks of these codas which they describe as a “sperm whale phonetic alphabet” – much like phonemes, the units of sound in human language which combine to form words.

Pratyusha Sharma, a PhD student at MIT and lead author of the study, describes the “fine-grain changes” in vocalisations the AI identified. Each coda consists of between three and 40 rapid-fire clicks. The sperm whales were found to vary the overall speed, or the “tempo”, of the codas, as well as to speed up and slow down during the delivery of a coda, in other words, making it “rubato”. Sometimes they added an extra click at the end of a coda, akin, says Sharma, to “ornamentation” in music. These subtle variations, she says, suggest sperm whale vocalisations could carry a much richer amount of information than previously thought.

    • 👍Maximum Derek👍@discuss.tchncs.de
      link
      fedilink
      English
      arrow-up
      37
      ·
      4 months ago

      I had a marine mammal prof in college who was fiercely dull unless you got him talking about whale hunting techniques. He would get particularly animated when describing how a Sperm Whale click carries enough force at depth that it make luminescent squids “light up like ‘eat me’ signs.”

      He also did a fun impression of a Grey Whale scooping mud off the sea floor. Which was mainly memorable because it was like watching Niles Crane pretend to be a backhoe.

      • Flying Squid@lemmy.worldM
        link
        fedilink
        English
        arrow-up
        14
        ·
        4 months ago

        He also did a fun impression of a Grey Whale scooping mud off the sea floor.

        Was he voiced by Ellen DeGeneres?

      • casmael@lemm.ee
        link
        fedilink
        English
        arrow-up
        2
        ·
        4 months ago

        Staring the rock as jacked squid cop who’s also a professional wrestler

  • Todd Bonzalez@lemm.ee
    link
    fedilink
    English
    arrow-up
    77
    arrow-down
    1
    ·
    4 months ago

    This is the kind of shit I want to see “AI” (machine learning) used for.

    We know that whales have a language. We know they can communicate complex ideas. We’ve never been able to figure it out.

    This could be the beginning of “first contact” with another intelligent species. We’re about to learn how much they fucking hate us. We’ll deserve every bit of what they have to say about us.

  • Arkouda@lemmy.ca
    link
    fedilink
    English
    arrow-up
    65
    arrow-down
    2
    ·
    4 months ago

    I have a feeling if we ever get to speaking with whales they are going to have a lot of words for us.

  • Zaktor@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    56
    arrow-down
    1
    ·
    4 months ago

    I know it’s a general consumption news story, but I wish they’d have at least included a name or basic description of what the “AI” used was. “AI” is about as blandly unspecific as you can get even if it’s only being applied to algorithms that fall into that broad category, let alone how commonly it’s misused nowadays.

    • Kbobabob@lemmy.world
      link
      fedilink
      English
      arrow-up
      22
      arrow-down
      1
      ·
      4 months ago

      It bothered me too so I clicked on the highlighted link in the article and in that there are these tidbits.

      The data collected has been processed using machine-learning algorithms to detect and classify clicks, with results due to be published in 2024.

      And

      More than 5,000 miles (8,000km) away, a group of artificial intelligence and natural language processing experts, cryptographers, linguists, marine biologists, robotic experts and underwater acousticians are also hoping to use AI – this time to decipher sperm whale conversation.

      Emphasis mine

    • PM_Your_Nudes_Please@lemmy.world
      link
      fedilink
      English
      arrow-up
      14
      ·
      4 months ago

      Yeah, it’s so commonly misused that I tend to just equate it to some sort of machine learning algorithm. Which is really just some complicated statistics and a database in a trenchcoat.

      • Zaktor@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        7
        ·
        4 months ago

        ML is in the category of AI, IMO. This was just scientific data analysis though, dressed up with the buzzword. It’s the right tool for the job and looks like good work, but not AI. Which is fine. I don’t get why people feel the need to call “doing something with data on a computer” AI.

      • Zaktor@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        10
        ·
        4 months ago

        It does! This isn’t anything that I’d call AI. It’s cool work, but it was just regular old scientific analysis and data visualization. Buzzwords strike again.

        • girlfreddy@lemmy.caOP
          link
          fedilink
          English
          arrow-up
          5
          ·
          4 months ago

          I’d be blaming the BBC and journalists/editors for that mess then.

          Glad you got the info you wanted tho. 👍

          • Zaktor@sopuli.xyz
            link
            fedilink
            English
            arrow-up
            3
            ·
            4 months ago

            Someone found a little AI in one of the code cells. Pretty low-level stuff, but it was used a little bit.

    • Todd Bonzalez@lemm.ee
      link
      fedilink
      English
      arrow-up
      8
      arrow-down
      1
      ·
      4 months ago

      This almost certainly means using traditional machine learning algorithms against a massive corpus of whale vocalizations and descriptions.

    • Dkarma@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      4 months ago

      That’s the thing, dude, when you write software…it doesn’t have a name. They probably started with a framework like tensor flow and then stacked audio analysis ml modules on top figuratively, but this was probably mostly written in house it’s not a commercial product.

      • Zaktor@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        4
        ·
        edit-2
        4 months ago

        I’m in “artificial intelligence” and write software. I’m not looking for a product name. If they developed an entirely new technique that’d also be cool to mention, but they probably built on existing techniques and were at least working in some broad form of AI.

        Edit: After looking at the paper, it’s the “misused” category. They were doing regular scientific data analysis.

        • Match!!@pawb.social
          link
          fedilink
          English
          arrow-up
          2
          ·
          edit-2
          4 months ago

          The github shows them doing some k nearest neighbors and kernel estimation as part of their understanding of the coda (I think? I don’t know anything about whales)

          • Zaktor@sopuli.xyz
            link
            fedilink
            English
            arrow-up
            2
            ·
            4 months ago

            The KMeans is actually an unused import, but yeah, I see the kernel estimation. They also use a gaussian mixture in one of the lower cells. So a little AI.

  • collapse_already@lemmy.ml
    link
    fedilink
    English
    arrow-up
    15
    ·
    4 months ago

    When we get to the point of communicating with them, will I get in trouble for telling them to sink yachts?

  • AmidFuror@fedia.io
    link
    fedilink
    arrow-up
    13
    arrow-down
    2
    ·
    4 months ago

    There’s a later report where they’ve deciphered one of the oft repeated sperm whale sentences: “So long, but why no fish?”

    • Sabin10@lemmy.world
      link
      fedilink
      English
      arrow-up
      10
      ·
      4 months ago

      When I was a child in the late 80s, I saw the occasional news story about researchers trying to crack the code in whale communication. Crazy that we might get there.

  • AutoTL;DR@lemmings.worldB
    link
    fedilink
    English
    arrow-up
    3
    ·
    4 months ago

    This is the best summary I could come up with:


    Now, from elephants to dogs, modern technology is helping researchers to sift through enormous datasets, and uncover previously unknown diversity and complexity in animal communication.

    They also noticed the basic building blocks of these codas which they describe as a “sperm whale phonetic alphabet” – much like phonemes, the units of sound in human language which combine to form words.

    Pratyusha Sharma, a PhD student at MIT and lead author of the study, describes the “fine-grain changes” in vocalisations the AI identified.

    The existence of a combinatorial coding system, write the report authors, is a prerequisite for “duality of patterning” – a linguistic phenomenon thought to be unique to human language – in which meaningless elements combine to form meaningful words.

    Categorised as “vulnerable” by the International Union for Conservation of Nature (IUCN), sperm whales are still recovering from commercial hunting by humans in the 19th and 20th Centuries.

    – If you liked this story, sign up for The Essential List newsletter – a handpicked selection of features, videos and can’t-miss news, delivered to your inbox twice a week.


    The original article contains 1,360 words, the summary contains 179 words. Saved 87%. I’m a bot and I’m open source!