I’m kind of new to local AI and wondering what’s the move here? Are they trying to pull off a chrome/android situation? Obviously I don’t trust any of these gafam giants but I would be really interested in running a local LLM on my M1 max (briefly used deepseek last year). My use case would be mostly chat functions to help with academic and text analysis tasks (don’t worry I don’t just blindly trust LLMs, I know what I’m doing), so recommendations are welcome.

  • hendrik@palaver.p3x.de
    link
    fedilink
    English
    arrow-up
    10
    ·
    edit-2
    19 days ago

    Probably because they play the same game as Mark Zuckerberg, the Chinese, to some degree OpenAI… They all release open-weights models.

    They’ll generate some hype for their company that way, so it’s advertising. They build good will. They undercut the competition. Or make it clear how they outperform them. Maybe they get some more investor money if they do expand to the local models market. I bet there’s a million reason why it makes sense from a business perspective.

  • all tomorrow's regrets@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    5
    ·
    19 days ago

    The same reason they made Android open source. To look good (Hey we aren’t like the other guys. We support open source!) and to bring in outside ideas, input, and work until they close off every avenue and it’s open in name only.

    • Yerbouti@sh.itjust.worksOP
      link
      fedilink
      English
      arrow-up
      3
      ·
      19 days ago

      Yeah, I’m thinking they want phone devs to start adding their models to android and create a similar situation where it becomes so integrated that it’s almost impossible to get rid off. Like they did with chromium, where most browser are now stuck with it and they can change the rules whenever they want.

  • PetteriPano@lemmy.world
    link
    fedilink
    English
    arrow-up
    4
    ·
    18 days ago

    These gemma models are small enough to run on your laptop or phone. They’ll be bundling them with phones and Chromebooks anyway.

    Might as well get some goodwill and let the horses run free before someone extracts them from the edge device anyway.

    It can give them some plausible deniability if they just ship them off as tech-previews. I’m thinking for when that taking kitty app starts feeding into your delusions and telling you to hurt people.

    • altphoto@lemmy.today
      link
      fedilink
      English
      arrow-up
      2
      ·
      10 days ago

      Prophetic since a week later people found Google pushing AI secretly on Google Chrome. Hexavalent chrome is banned, why can’t chrome be banned?

      • PetteriPano@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        10 days ago

        I’ll further prophesize that we’ll start seeing mixed on-device and cloud calling. Cloud for heavy thinking is probably in the books right now.

        Next week your local tiny gemma4 will be feeding the cloud models with predicted tokens to speed up and reduce work for gemini. It only has to get it right 66% of the time for a 2x speed-up.

  • robber@lemmy.ml
    link
    fedilink
    English
    arrow-up
    3
    ·
    19 days ago

    A lot has been said, but to add to the list I’d say it gives them access to quite a large pool of free testers.

    LLM architectures and optimization techniques change rapidly and by releasing open-weight models a lot of enthusiasts will evaluate new models for free, help implement support in inference engines, catch bugs etc. (and in turn, ofc, get a new model to run for free, so it’s at least somewhat symbiotic).

    We have at least seen this quite obviously when Alibaba released Qwen3-Next, which was a somewhat undertrained but still useful model which introduced the architecture that their latest models now use “in production” (also their paid “Max” models).

    • Yerbouti@sh.itjust.worksOP
      link
      fedilink
      English
      arrow-up
      1
      ·
      19 days ago

      co-opt em, set the de-facto standards and own the “attention”

      That’s what I’m suspecting too. They’re trying to pull a chrome-like situation and become some sort of standard so devs eventually are stuck with their tech and whatever bullshit “manifest” update they release.

      • Jakeroxs@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        1
        ·
        19 days ago

        While not impossible, there’s a lot of variables when it comes to LLMs that I don’t really see how they’d do that, especially since it’s not particularly better then the competition.

        • Yerbouti@sh.itjust.worksOP
          link
          fedilink
          English
          arrow-up
          1
          ·
          19 days ago

          I agree. I’m thinking they are trying anything hoping it will stick because clearly, they are losing the AI race. So offering models that can run locally was their best bet. And deepseek might have just fuck their shit up a little more with V4.

  • Sims@lemmy.ml
    link
    fedilink
    English
    arrow-up
    3
    ·
    19 days ago

    Random suspicion: It is correlated to how much compute US have. That’s the only reason I can think of. I also wondered why they would do that when US tech are grabbing the global hardware. They’d like to keep control of inference in the cloud but are missing power thus lack compute. But they also benefit from a western AI ecosystem that have much more compute available. Pure guess, but maybe something like that ?

  • slacktoid@lemmy.ml
    link
    fedilink
    English
    arrow-up
    2
    ·
    19 days ago

    Cause no one really cares about them and Google doesn’t want to fade into obscurity in this AI race.

  • Shimitar@downonthestreet.eu
    link
    fedilink
    English
    arrow-up
    4
    arrow-down
    12
    ·
    19 days ago

    “i know what I am doing” has been heard also from the overseer of that nuclear power plant, how was that called? Ah, Chernobyl…