ETH Zurich and EPFL will release a large language model (LLM) developed on public infrastructure. Trained on the “Alps” supercomputer at the Swiss National Supercomputing Centre (CSCS), the new LLM marks a milestone in open-source AI and multilingual excellence.

  • In late summer 2025, a publicly developed large language model (LLM) will be released — co-created by researchers at EPFL, ETH Zurich, and the Swiss National Supercomputing Centre (CSCS).
  • This LLM will be fully open: This openness is designed to support broad adoption and foster innovation across science, society, and industry.
  • A defining feature of the model is its multilingual fluency in over 1,000 languages.
  • cabbage@piefed.social
    link
    fedilink
    English
    arrow-up
    3
    ·
    10 hours ago

    Usually when I see this it’s using other machine learning approaches than LLM, and the researchers behind it are usually very careful not to use the term AI, as they are fully aware that this is not what they are doing.

    There’s huge potential in machine learning, but LLMs are very little more than bullshit generators, and generative AI is theft producing soulless garbage. LLMs are widely employed because they look impressive, but for anything that requires substance machine learning methods that have been around for years tend to perform better.

    If you can identify cancer in x-rays using machine learning that’s awesome, but that’s very seperate from the AI hype machine that is currently running wild.

    • ☂️-@lemmy.ml
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      6 hours ago

      to be fair, the LLMs they use for chatbots and stolen pics generator are not AI either.