LOOK MAA I AM ON FRONT PAGE

  • 0ops@lemm.ee
    link
    fedilink
    English
    arrow-up
    3
    ·
    5 hours ago

    Is “model” not defined as architecture+weights? Those models certainly don’t share the same architecture. I might just be confused about your point though

    • Communist@lemmy.frozeninferno.xyz
      link
      fedilink
      English
      arrow-up
      2
      ·
      edit-2
      5 hours ago

      It is, but this did not prove all architectures cannot reason, nor did it prove that all sets of weights cannot reason.

      essentially they did not prove the issue is fundamental. And they have a pretty similar architecture, they’re all transformers trained in a similar way. I would not say they have different architectures.