ETH Zurich and EPFL will release a large language model (LLM) developed on public infrastructure. Trained on the “Alps” supercomputer at the Swiss National Supercomputing Centre (CSCS), the new LLM marks a milestone in open-source AI and multilingual excellence.

  • In late summer 2025, a publicly developed large language model (LLM) will be released — co-created by researchers at EPFL, ETH Zurich, and the Swiss National Supercomputing Centre (CSCS).
  • This LLM will be fully open: This openness is designed to support broad adoption and foster innovation across science, society, and industry.
  • A defining feature of the model is its multilingual fluency in over 1,000 languages.
  • Plebcouncilman@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    2
    ·
    5 hours ago

    Honestly they are pretty good for research too. You can’t imagine the amount of obscure shit that my ChatGPT has surfaced when I bounce ideas on it. But yea it’s terrible in finished products, I think everyone knows that and in a year or two if they don’t improve I expect we will be back to shoving it behind the scenes as had been done before ChatGPT. It’s for the best.

    • thedruid@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      1
      ·
      5 hours ago

      That’s not research. That’s simply surfacing tidbits it found on the net the happen to be true

      .I’ve asked many questions of many llms in my chosen areas of interest and modest expertise , seeking more than basic knowledge( which it often surprisingly lacks ) it always has at least one error. Often so subtle it goes on noticed until it’s too late.