Testing Frontier LLMs (GPT4) on ARC-AGI

Sdílet
Vložit
  • čas přidán 26. 06. 2024
  • Template: www.kaggle.com/code/gregkamra...
    arcprize.org/leaderboard
    arcprize.org/arc-agi-pub
    ARC Prize is a $1,000,000+ public competition to beat and open source a solution to the ARC-AGI benchmark.
    Hosted by Mike Knoop (Co-founder, Zapier) and François Chollet (Creator of ARC-AGI, Keras).
    --
    Website: arcprize.org/
    Twitter/X: / arcprize
    Newsletter: Signup @ arcprize.org/
    Discord: / discord
    Try your first ARC-AGI tasks: arcprize.org/play

Komentáře • 13

  • @jackq2331
    @jackq2331 Před 14 hodinami

    Excellent.

  • @MarkoTManninen
    @MarkoTManninen Před 8 dny +1

    I understand retries, but I am confuced with the two attempts. Do you always need to provide two? In which case they would have different data and both would be required for 100% correct prediction? I also missed the part in which the prediction and correct answers are matched and prounounced.

    • @ARCprize
      @ARCprize  Před 7 dny +3

      Sorry this isn't more clear on the video!
      You get two tried at each task. Old competitions had 3 tries. So you can basically give two attempts. If either are correct you pass the task.
      Under scoring methodology there is more information: arcprize.org/guide#submissions

  • @LimeTubeH
    @LimeTubeH Před 7 dny

    I'm confused...what are we supposed to attach with our API add-on secret?

    • @ARCprize
      @ARCprize  Před 6 dny

      What do you mean attach? That’s where you put your API key and then reference it in your code

  • @conformist
    @conformist Před 8 dny +6

    first.

    • @cyb3rvoid
      @cyb3rvoid Před 8 dny +2

      That was unreal!

    • @conformist
      @conformist Před 8 dny +2

      @@cyb3rvoid for my next magic trick, i will solve the agi price first

    • @wwkk4964
      @wwkk4964 Před 8 dny +4

      ​@@conformistsolve it backwards!

    • @filipgara3444
      @filipgara3444 Před 8 dny +2

      Ensure diversity in your model

  • @johnkintner
    @johnkintner Před dnem

    third since no one called it :kappa:

  • @aluphshahim5808
    @aluphshahim5808 Před 8 dny

    Second 😂