Spring 2024 GRASP on Robotics: GRASP Faculty Panel, “AI Embodied in Robotics”

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

Vision Transformer Quick Guide - Theory and Code in (almost) 15 min

Chocolate Banana! 🍌 This would make one epic banana split!! #amauryguichon #chocolate #banana

🍟Best French Fries Homemade #cooking #shorts

VYHRAJE IKON NEBO SÉGRA ? 🤔 #shorts

Spring 2024 GRASP Seminar Yutong Bai, Johns Hopkins University

GRASP Lab

zhlédnutí 743

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 11. 03. 2024
“Listening to the Data: Visual Learning from the Bottom Up”
ABSTRACT
We introduce a novel sequential modeling approach which enables learning a Large Vision Model (LVM) without making use of any linguistic data. To do this, we define a common format, “visual sentences”, in which we can represent raw images and videos as well as annotated data sources such as semantic segmentations and depth reconstructions without needing any meta-knowledge beyond the pixels. Once this wide variety of visual data (comprising 420 billion tokens) is represented as sequences, the model can be trained to minimize a cross-entropy loss for next token prediction. By training across various scales of model architecture and data diversity, we provide empirical evidence that our models scale effectively. Many different vision tasks can be solved by designing suitable visual prompts at test time.
PRESENTER
Yutong Bai is a 5th-year CS PhD student at Johns Hopkins University advised by Prof. Alan Yuille, and currently a visiting student at UC Berkeley advised by Prof. Alyosha Efros. She has interned at Meta AI (FAIR Labs) and Google Brain, and she is selected as a 2023 Apple Scholar and EECS Rising Star.
Věda a technologie

Komentáře •

Další v pořadí

Automatické přehrávání

Spring 2024 GRASP on Robotics: GRASP Faculty Panel, “AI Embodied in Robotics”

Spring 2024 GRASP on Robotics: GRASP Faculty Panel, “AI Embodied in Robotics”

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

Vision Transformer Quick Guide - Theory and Code in (almost) 15 min

Vision Transformer Quick Guide - Theory and Code in (almost) 15 min

Chocolate Banana! 🍌 This would make one epic banana split!! #amauryguichon #chocolate #banana

Chocolate Banana! 🍌 This would make one epic banana split!! #amauryguichon #chocolate #banana

🍟Best French Fries Homemade #cooking #shorts

🍟Best French Fries Homemade #cooking #shorts

VYHRAJE IKON NEBO SÉGRA ? 🤔 #shorts

VYHRAJE IKON NEBO SÉGRA ? 🤔 #shorts

어른의 힘으로만 할 수 있는 버블티 마시는법

어른의 힘으로만 할 수 있는 버블티 마시는법

CSE 373 Finals Part 1

CSE 373 Finals Part 1

RAG with a Neo4j Knowledge Graph: How it Works and How to Set It Up

RAG with a Neo4j Knowledge Graph: How it Works and How to Set It Up

RAG from the Ground Up with Python and Ollama

RAG from the Ground Up with Python and Ollama

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

Life Lessons From 100-Year-Olds

Life Lessons From 100-Year-Olds

My journey to yo-yo mastery | BLACK

My journey to yo-yo mastery | BLACK

Steve Jobs Insult Response - Highest Quality

Steve Jobs Insult Response - Highest Quality

Boston Dynamics' amazing robots Atlas and Handle

Boston Dynamics' amazing robots Atlas and Handle

Supervised vs Unsupervised vs Reinforcement Learning | Machine Learning Tutorial | Simplilearn

Supervised vs Unsupervised vs Reinforcement Learning | Machine Learning Tutorial | Simplilearn

How To Unlock Your iphone With Your Voice

How To Unlock Your iphone With Your Voice

Power up all cell phones.

Power up all cell phones.

iPhone 15 Pro vs Samsung s24🤣 #shorts

iPhone 15 Pro vs Samsung s24🤣 #shorts

Разоблачение ручное зарядное устройство

Разоблачение ручное зарядное устройство

Building the ENDGAME invisible PC

Building the ENDGAME invisible PC

First repair of the day 📱

First repair of the day 📱

Google's secret algorithm exposed via leak to GitHub…

Google's secret algorithm exposed via leak to GitHub…

Mac | Found | Apple

Mac | Found | Apple