• AceFuzzLord@lemmy.zip
    link
    fedilink
    arrow-up
    4
    ·
    8 hours ago

    Based on what I read/saw and got out of it, I am real disappointed it looks like it’s gonna be using genAI instead of another form of AI we’ve been using for transcription. Otherwise, sounds like trying to be a modern genAI version of that speech to text software I’d see ads for on TV. Possibly good for accessibility, but I’ll wait and see after it comes out.

    At least they claim the whole thing to be done offline after model installation and it’s allegedly sandboxed with the audio data being stored in a memory buffer that allegedly will be erased after the session. So I’ll have to wait and see how this all plays out before making more judgments on it.

  • chronicledmonocle@lemmy.world
    link
    fedilink
    arrow-up
    12
    arrow-down
    3
    ·
    16 hours ago

    Thanks Canonical…I’ll just throw it in the pile with all the other “wonderful” things you’ve made. It can go on the shelf next to Mir.

  • chgxvjh [he/him, comrade/them]@hexbear.net
    link
    fedilink
    English
    arrow-up
    19
    ·
    22 hours ago

    shrug-outta-hecks maybe it will lead to better accessibility tooling. This is obviously rather silly as a default mode of operations, especially if you imagine people in a crowded office all yelling at their computers.

  • southsamurai@sh.itjust.works
    link
    fedilink
    arrow-up
    55
    arrow-down
    2
    ·
    1 day ago

    Being real, this is why I fucking hate the bullshit, corporate greed hype of LLMs and generative software. All the “bubble” shit? It tars all versions of the technology with the same brush.

    This? This is exactly what it should be used for. And, ffs, earlier speech to text was really the same fucking thing in essence. Software that took input in the form of voice, compared it to a set of data, and made a best guess at what you meant. Yeah, the details are different, but it’s the same concept.

    This? This is fucking awesome. Locally run, and doing a job that’s vital in accessibility, with the side benefit of being useful to others. Assuming canonical is being honest anyway.

    But this kind of thing should be the way things are done.

      • P03 Locke@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        1
        ·
        edit-2
        7 hours ago

        Even if we called them LLMs, which we should, people will keep having these negative connotations to the technology because of overmarketing. This feature is still using LLMs, and that’s not supposed to be a bad thing.

        Stop blaming the technology, and blame the corpos pushing it to the moon. This is “BitTorrent is only used for piracy” bullshit all over again.

      • mfed1122@discuss.tchncs.de
        link
        fedilink
        English
        arrow-up
        6
        arrow-down
        2
        ·
        15 hours ago

        But it is AI, so it should be called that. People should adjust their simplistic notions about the term instead

        • southsamurai@sh.itjust.works
          link
          fedilink
          arrow-up
          1
          arrow-down
          1
          ·
          9 hours ago

          Nah, it isn’t. Intelligence implies independence. What it is is a fancy algorithm with a big data set.

          It doesn’t have to be general ai to be called ai, but so far none of the models I’m aware of have reached a standard to be called intelligence in the colloquial sense for sure

          • mfed1122@discuss.tchncs.de
            link
            fedilink
            English
            arrow-up
            2
            ·
            8 hours ago

            The term used in academic literature and the field itself for that kind of technology is and has been AI for at least ten years. Intelligence doesn’t imply independence anyways? And besides, even if it did, thats why an entire 50% of the term consists of the modifier “artificial”. So like, you’re right that it isn’t intelligence in the colloquial sense. It’s artificial intelligence in the technical and standardized sense, though. The use of term is pretty much totally undebatable. Just because people don’t like the term now doesn’t change that.

      • Voytrekk@sopuli.xyz
        link
        fedilink
        arrow-up
        2
        arrow-down
        1
        ·
        15 hours ago

        This would have been better received if they just didn’t use AI in the name. Sure, it’s just using an LLM under the hood, but it’s running purely locally. It also betters Linux since it helps address an accessibility issue.

        • atzanteol@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          10
          arrow-down
          5
          ·
          16 hours ago

          The “price” of a free offline speech to text AI model? Three of them, actually, to work with varying levels of compute resources available?

          You anti-AI folks are friggin’ ridiculous.

          • Cherry@piefed.social
            link
            fedilink
            English
            arrow-up
            4
            ·
            12 hours ago

            I don’t think it’s anti-AI more a lack of trust of services saying here’s a product…and the concern it will be used for ulterior motives. I know I don’t like my voice being captured.

            • atzanteol@sh.itjust.works
              link
              fedilink
              English
              arrow-up
              3
              arrow-down
              2
              ·
              11 hours ago

              It’s captured when you press a button and only handled locally. This is exactly the sort of thing you want for accessibility. Not everybody can type or type well.

              • Cherry@piefed.social
                link
                fedilink
                English
                arrow-up
                2
                ·
                10 hours ago

                I agree. It’s a great use. I just can’t trust any tech company now. Even if they mean well abuses happen.

          • marxismtomorrow@lemmy.today
            link
            fedilink
            arrow-up
            3
            ·
            edit-2
            12 hours ago

            Nothing AI is free. Unless there’s a chain of custody for all of the training data, it’s still unethical even if it’s used for a good thing. If I build a wheelchair ramp out of the flesh and bones of orphans I’m still not a very good person. And there are non-AI ways to accomplish this that are just as good that would require almost comically less resources.

            This attitude is why Ubuntu, and only Ubuntu, recommends a minimum of 6 GB of Ram btw. You can run a full KDE system with onboard graphics and all the bells and whistles for less than 2 GB on other distros.

            • P03 Locke@lemmy.dbzer0.com
              link
              fedilink
              English
              arrow-up
              2
              arrow-down
              1
              ·
              7 hours ago

              If I build a wheelchair ramp out of the flesh and bones of orphans I’m still not a very good person.

              Comparing copyright, a bullshit concept that was designed only to protect the rich, to building with “the flesh and bones of orphans” is so extreme that it’s not even worth arguing about.

              • marxismtomorrow@lemmy.today
                link
                fedilink
                arrow-up
                1
                ·
                edit-2
                7 hours ago

                Being pro-AI is the same as being pro-capitalist, and thus pro-orphan crushing machine.

                Having your work stolen so that a soulless faceless corporate entity can charge teenagers to recreate it at will and pretend they have any talent or capability whatsoever is pretty shitty, and defending it is anti-proletariat.

            • atzanteol@sh.itjust.works
              link
              fedilink
              English
              arrow-up
              1
              arrow-down
              2
              ·
              edit-2
              11 hours ago

              Nothing AI is free. Unless there’s a chain of custody for all of the training data, it’s still unethical even if it’s used for a good thing.

              This is the weirdest sort of AI bullshit I keep coming across.

              And there are non-AI ways to accomplish this that are just as good that would require almost comically less resources.

              And… Where are they?

              • marxismtomorrow@lemmy.today
                link
                fedilink
                arrow-up
                1
                ·
                10 hours ago

                This is the weirdest sort of AI bullshit I keep coming across.

                Hi this must be your first time on Earth in the last decade, every single AI company has been in or is currently in no less than ten dozen lawsuits over copyright infringement. It’s so bad there’s at least one website purpose built to track copyright infringement from AI companies..

                Without a specific chain of custody for every piece of training data going into the models, there is a default that the model cannot be trusted and is likely infringing on someone’s copyright.

                To specify the 'nothing AI is free" part, LLMs are grossly computationally inefficient. Whether it’s local or not.

                And… Where are they?

                Already installed on most distros.

                • atzanteol@sh.itjust.works
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  arrow-down
                  1
                  ·
                  8 hours ago

                  So where are the winning lawsuits of all that copyright infringement then?

                  Already installed on most distros.

                  And they are called… ?

              • Somecall_metim@lemmy.dbzer0.com
                link
                fedilink
                English
                arrow-up
                2
                ·
                9 hours ago

                You want “reason”? AI is a jumped up chatbot with delusions of grandeur that jacks up energy prices, consumes a stupid amount of scare water and throttles the supply of computer parts just to tell people there are 5 Rs in strawberry and that they should kill themselves. Not to mention the outright theft of the works of authors, designers, and artists.

                FUCK. AI.

                And fuck you for comparing me to those shit dribbling twats in MAGA.

          • middlemanSI@lemmy.world
            link
            fedilink
            arrow-up
            1
            arrow-down
            1
            ·
            12 hours ago

            I admit I don’t know the details, but the title makes it seem like there is a “product” there, by a “company”, probably in it for the profit. And since there is a huge problem with datacenters as it is, why would we encourage more? Most of you AI enthousiasts are blindly walking us into a pit of regret.

            • atzanteol@sh.itjust.works
              link
              fedilink
              English
              arrow-up
              2
              ·
              11 hours ago

              I admit I don’t know the details, but the title makes it seem like there is a “product” there, by a “company”, probably in it for the profit.

              You don’t know more than the details - you don’t know anything about it.

              And since there is a huge problem with datacenters as it is, why would we encourage more? Most of you AI enthousiasts are blindly walking us into a pit of regret.

              Guess you’ll want to research what “offline” means. I doubt you have any idea what any problems with datacenters are either given your… we’ll call it “knowledge” of the situation.

  • Avid Amoeba@lemmy.ca
    link
    fedilink
    arrow-up
    77
    ·
    1 day ago

    Offline-only speech-to-text, integrated with the desktop for push-to-talk voice typing? That’s the kind of AI that I’d like to see. Actually add features that can help people without harming their rights. I’m still moving new machines to Debian but this is nice.

    • Taasz/Woof@piefed.social
      link
      fedilink
      English
      arrow-up
      9
      ·
      17 hours ago

      Also how is speech to text AI? It has existed for decades, obviously a lot better now but I don’t think I’d consider it “AI”

      • Avid Amoeba@lemmy.ca
        link
        fedilink
        arrow-up
        15
        ·
        edit-2
        17 hours ago

        There’s been ML and non-ML ways of doing STT over the years. as far as I recall. The current best implementations are ML-based. In coloquial terms ML algorithms are AI. We used to call them AI in the 2010s, before AI was (un)cool.

      • ComradePenguin@lemmy.ml
        link
        fedilink
        arrow-up
        2
        arrow-down
        2
        ·
        15 hours ago

        I have been using a speech-to-text AI system the last day, and I’m using a whisper large 3 turbo and a rewording model that fixes the sentence but doesn’t rewrite it, and it’s almost perfect. I’m using European-hosted AI through cortecs.ai, and it’s really cheap.

  • thingsiplay@lemmy.ml
    link
    fedilink
    arrow-up
    20
    ·
    1 day ago

    The framework split things into two groups, implicit AI that quietly improves what you already use and explicit AI that are features you’d actually summon on purpose.

    The very first paragraph already upsets me. Have in mind, I would criticize this on every other operating system too. I believe no one should use Ai tools that act autonomously in the background, to improve or change what you already use. It should always be a “summon on purpose”.

  • aim_at_me@lemmy.nz
    link
    fedilink
    English
    arrow-up
    25
    arrow-down
    1
    ·
    edit-2
    12 hours ago

    Bit of a click baity title lol. If there’s one good use of AI, its probably accessibility.

    • Handles@leminal.space
      link
      fedilink
      English
      arrow-up
      3
      ·
      22 hours ago

      Yeah, “Ubuntu wants you to use their new feature” is… unsurprising. Explaining the benefits and purpose of that feature? Now you’re talking.

  • Mordikan@kbin.earth
    link
    fedilink
    arrow-up
    33
    ·
    1 day ago

    Implicit optional features to use local LLMs for STT is something that I think most reasonable people could get behind. Too many accessibility tools for the disabled sit behind paywalls and subscription models.

  • Fizz@lemmy.nz
    link
    fedilink
    arrow-up
    10
    ·
    1 day ago

    My grandma used to hate tech until she learned to use the voice assistance on her mobile phone. It unlocked the phone for her because she doesnt have the dexterity to type. I hope one day this tool could get to that same point.

  • Cris_Citrus@piefed.zip
    link
    fedilink
    English
    arrow-up
    4
    ·
    24 hours ago

    Out of curiosity would it be accurate to call this sort of technology generative ai, or just machine learning? Or it depends on the implementation?

    I feel like most of the anger around ai is because gen ai has a bunch of harmful baggage, and I’m curious if this is an example of gen ai having a productive use case, or an example of ai being more useful outside of gen ai specifically

    • unwarlikeExtortion@lemmy.ml
      link
      fedilink
      arrow-up
      17
      ·
      1 day ago

      Come on.

      Offline-only is privacy-respecting. Accessibility is a noble goal.

      All in all, if there’s an AI usecase that’s as morally acceptable as it gets, it’s this one.

      I get that it’s Ubuntu of all people, but even Big Tech produces some ideas every now and then that FOSS lovers can get behind and democratize!