Module 2 Research Update

Invariance and equivariance in brains and machines

#50 Dr. CHRISTIAN SZEGEDY - Formal Reasoning, Program Synthesis

Je peux le faire

Little Girl LOSES IT when she messes up ESPRESSO RIFF w/ Vocal Coach!!!

Don't Let This Happen To You... 😂

Modular Learning and Reasoning on ARC

MITCBMM

zhlédnutí 3 749

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 8. 09. 2024
Speakers: Dr. Andrzej Banburski and Simon Alford (Poggio Lab)
Abstract: Current machine learning algorithms are highly specialized to whatever it is they are meant to do - e.g. playing chess, picking up objects, or object recognition. How can we extend this to a system that could solve a wide range of problems? We argue that this can be achieved by a modular system - one that can adapt to solving different problems by changing only the modules chosen and the order in which those modules are applied to the problem. The recently introduced ARC (Abstraction and Reasoning Corpus) dataset serves as an excellent test of abstract reasoning. Suited to the modular approach, the tasks depend on a set of human Core Knowledge inbuilt priors. We implement these priors as the modules of a reasoning system and combine them using neural-guided program synthesis. We then discuss our ongoing efforts extending execution-guided program synthesis to a bidirectional search algorithm via function inverse semantics.
Věda a technologie

Komentáře • 6

@sdmarlow3926 Před 2 měsíci ⁺²
From Q around 22min mark: The tasks are designed to avoid brute force methods, and don't require "world knowledge" or language as a prior. But, more than testing for simple cognitive skills, the point is to have someone build a system that can "see" some new pattern and store that as a new ability. Definitions and benchmarks are not enough if your only goal is to meet that definition or score high on those benchmarks. There is no honor system when it comes to building "AGI" because everyone just takes shortcuts. A system that is actually dynamic, and can go from ARC to ATARI 2600 games to playing Doom in the span of a week is going to use much the same def and benchmarks as everyone else.. but is ACTUALLY different. Of course, saying it's an architecture problem implies all of ML/DL is on the wrong path, which many will take issue with. ;p
@brandomiranda6703 Před 2 lety ⁺¹
46:20 Current work (or future work) after their initial DreamCoder baselining on ARC (Abstraction and Reasoning Corpus): execution-guided, bidirectional search for program synthesis/how to search for programs the way humans do?.
@brandomiranda6703 Před 2 lety ⁺¹
doesn't Francois C. have a definition on AGI (informed on cognitive priors) and construct his ARC benchmark based on it? Question based on dicussion at 24:20 ish
@DavenH Před rokem
26:26 "you can do about 80% of the tasks solved" in the hand-picked subsample of the training set, not the test set... It's not generalizing. Its performance on the test set is undisclosed and not state of the art.
@googleyoutubechannel8554 Před 9 měsíci ⁺¹
ARC seems like a bunch of random tasks that are heavily human-interacting-with-humanscale-3d-enivronment biased. I can imagine there are perhaps a near infinite array of other patterns that could form ARC tasks, but don't... because the researches didn't include them, for no other reason, than the researchers are humans with eyeballs that take in information basically as a 2d array, so are biased towards certain types of patterns. There doesn't seem to be any, even rudimentary framework, that underpins ARC tasks other than 'this particular researcher thought they were a good idea'? This is the first LLM benchmark I've looked into, and I have a sinking feeling the whole field is like this....
*example of one of a huge set of patterns these human researcher didn't pick, but could be just as 'valid' as an ARC task: (if you have no framework for validity, which they don't) sequence of increasing numbers in binary that are huffman encoded
@sdmarlow3926 Před 2 měsíci
The tasks are built around "simple" cognitive priors, such as counting, flipping or mirroring, and directionallism (that lines and shapes extend in different directions). Of the hundreds of tasks, there are only a handful of these priors (the point of an ARC 2 is to have no one "operation" happen more than once out of all the samples).

Další v pořadí

Automatické přehrávání

Module 2 Research Update

Module 2 Research Update

Invariance and equivariance in brains and machines

Invariance and equivariance in brains and machines

#50 Dr. CHRISTIAN SZEGEDY - Formal Reasoning, Program Synthesis

#50 Dr. CHRISTIAN SZEGEDY - Formal Reasoning, Program Synthesis

Je peux le faire

Je peux le faire

Little Girl LOSES IT when she messes up ESPRESSO RIFF w/ Vocal Coach!!!

Little Girl LOSES IT when she messes up ESPRESSO RIFF w/ Vocal Coach!!!

Don't Let This Happen To You... 😂

Don't Let This Happen To You... 😂

TOHLE JSEM FAKT NEPOTŘEBOVAL VĚDĚT 😅

TOHLE JSEM FAKT NEPOTŘEBOVAL VĚDĚT 😅

Could AI solve this puzzle? (ARC-Game)

Could AI solve this puzzle? (ARC-Game)

ARC challenge is a hard test for machines, easy for humans | François Chollet and Lex Fridman

ARC challenge is a hard test for machines, easy for humans | François Chollet and Lex Fridman

Harvard Professor Explains Algorithms in 5 Levels of Difficulty | WIRED

Harvard Professor Explains Algorithms in 5 Levels of Difficulty | WIRED

ML Was Hard Until I Learned These 5 Secrets!

ML Was Hard Until I Learned These 5 Secrets!

The Hardest Kaggle Challenge

The Hardest Kaggle Challenge

Francois Chollet - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

Francois Chollet - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

When Computers Write Proofs, What's the Point of Mathematicians?

When Computers Write Proofs, What's the Point of Mathematicians?

The Simple $1,000,000 Problem AI Can't Solve

The Simple $1,000,000 Problem AI Can't Solve

Dreamcoder: Bootstrapping Inductive Program Synthesis With Wake-Sleep Library Learning

Dreamcoder: Bootstrapping Inductive Program Synthesis With Wake-Sleep Library Learning

Dyson is Back… Why Do These Exist?

Dyson is Back… Why Do These Exist?

Why Do Dogs and Parrots See Colors Differently on Our TVs?

Why Do Dogs and Parrots See Colors Differently on Our TVs?

iPhone VS Samsung🤯

iPhone VS Samsung🤯

CRAZY KEYBOARD CHALLENGE 😮How fast could you type?

CRAZY KEYBOARD CHALLENGE 😮How fast could you type?

TOHLE JSME KOUPILI NA ALIEXPRESSU

TOHLE JSME KOUPILI NA ALIEXPRESSU

tag your mini friends 😭💕 #miniphone #smartphone #iphone #samsung #fyp

tag your mini friends 😭💕 #miniphone #smartphone #iphone #samsung #fyp

Data recovery from MicroSD using PC3000 Flash & Spider Board 😎

Data recovery from MicroSD using PC3000 Flash & Spider Board 😎

iPhone 15 Pro Max vs Pixel 9 🚀

iPhone 15 Pro Max vs Pixel 9 🚀