Doing More with Data: An Introduction to Arrow for R Users

Making Moves with Arrow Data: Introducing Arrow Database Connectivity (ADBC) | Voltron Data

Comparing duckdb and duckplyr to tibbles, data.tables, and data.frames (CC279)

ZKOUŠÍM DIVNÉ KOMBINACE JÍDEL #9

Best father #shorts by Secret Vlog

#JasonDeruloTV // Wow #GotPermissionToPost From @ease_pase #SlowLow

Using the {arrow} and {duckdb} packages to wrangle medical datasets that are Larger than RAM

R Consortium

zhlédnutí 6 692

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 16. 07. 2024
From R/Medicine Conference 2022
Peter D.R. Higgins, MD, Ph.D., MSc, Director of Inflammatory Bowel Disease (IBD) Program at the University of Michigan.
Deck: speakerdeck.com/higgi13425/bi...
Sections
0:00 Introduction
0:40 Starting point
1:09 The motivating problem
2:10 The data
3:08 Options
4:25 Lots to like about {data.table}
5:23 Data on disk vs data in ram
6:37 How to wrangle bigger-than-RAM data in R?
8:15 Speed-wrangling
9:42 What about the bigger-than-RAM problem?
10:19 Let’s try it out
11:35 What if data are still bigger-than-RAM?
15:42 Back to the question…
16:19 There’s always that (more than) one guy
16:43 Take home points - speed
17:15 Take home points - bigger-than-RAM data
18:12 Closing
More Resources
Main Site: www.r-consortium.org/
News: www.r-consortium.org/news
Blog: www.r-consortium.org/news/blog
Join: www.r-consortium.org/about/join
Twitter: / rconsortium
LinkedIn: / r-consortium
Věda a technologie

Komentáře • 14

@tmuffly1 Před 3 měsíci ⁺¹
This talk blew my mind. Thank you very much!
@tomfenn4 Před rokem ⁺⁶
Really useful presentation, and timely for me. Personally I find data.table statements are greatly improved with just a little whitespace.
@tdawry Před 2 měsíci
A neat question to answer.
I'm using the duckplyr library and it's nice to not have to think about anything. It does make a strong argument for having a fast hard drive (an SSD is an order of magnitude faster than a traditional HDD, an M2 is an order of magnitude faster than that, and modern nvme drives are even faster).
@multitaskprueba1 Před 2 měsíci
You are a genius! Fantastic video! Thanks!
@musicspinner Před rokem ⁺¹
Masterful deployment of the "Kobayashi Maru" reference. 🖖
@VictorOrdu Před rokem ⁺²
Wow, thank you for this illuminating presentation.
@gueyenono Před rokem ⁺²
Great presentation.
@higgi13425 Před rokem ⁺³
For further learning, here are the links from the next to last slide:
Arrow
cheatsheet: raw.githubusercontent.com/rstudio/cheatsheets/master/arrow.pdf
video intro: czcams.com/video/O42LUmJZPx0/video.html
full workshop from useR!: arrow-user2022.netlify.app
DuckDB
website: duckdb.org
R package: cran.r-project.org/web/packages/duckdb/index.html
data.table
website: rdatatable.gitlab.io/data.table
dtplyr (a data.table translator): dtplyr.tidyverse.org
@matthewson8917 Před rokem
Perfectly summarizes my big data journey. Really good!
@JohnoScott Před rokem
Great talk. Concise and to the point.
@porlando12 Před rokem
Excellent presentation!
@torbjornstorli2880 Před 6 měsíci
Loved your presentation. Well done Sir!😊
@ZachRenwickData Před rokem
great video and interesting analysis use case!
@arunabhbarua1924 Před 6 dny
How about just using duckdb and SQL?

Další v pořadí

Automatické přehrávání

Doing More with Data: An Introduction to Arrow for R Users

Doing More with Data: An Introduction to Arrow for R Users

Making Moves with Arrow Data: Introducing Arrow Database Connectivity (ADBC) | Voltron Data

Making Moves with Arrow Data: Introducing Arrow Database Connectivity (ADBC) | Voltron Data

Comparing duckdb and duckplyr to tibbles, data.tables, and data.frames (CC279)

Comparing duckdb and duckplyr to tibbles, data.tables, and data.frames (CC279)

ZKOUŠÍM DIVNÉ KOMBINACE JÍDEL #9

ZKOUŠÍM DIVNÉ KOMBINACE JÍDEL #9

Best father #shorts by Secret Vlog

Best father #shorts by Secret Vlog

#JasonDeruloTV // Wow #GotPermissionToPost From @ease_pase #SlowLow

#JasonDeruloTV // Wow #GotPermissionToPost From @ease_pase #SlowLow

YZO & PTK - NO SLEEP GANG / GET LOW (official double music video)

YZO & PTK - NO SLEEP GANG / GET LOW (official double music video)

Analyze MILLIONS of points in SECONDS (on your computer) with DuckDB for GIS

Analyze MILLIONS of points in SECONDS (on your computer) with DuckDB for GIS

DuckDB: Bringing analytical SQL directly to your Python shell (EuroPython 2023)

DuckDB: Bringing analytical SQL directly to your Python shell (EuroPython 2023)

DuckDB: Supercharging Your Data Crunching by Richard Wesley

DuckDB: Supercharging Your Data Crunching by Richard Wesley

Accelerating Geospatial Computing in R and Python Using Apache Arrow

Accelerating Geospatial Computing in R and Python Using Apache Arrow

Why should you care about DuckDB? ft. Mihai Bojin

Why should you care about DuckDB? ft. Mihai Bojin

Why and How we integrated DuckDB & MotherDuck with GoodData

Why and How we integrated DuckDB & MotherDuck with GoodData

Data Engineering with DuckDb Tutorial | PySpark | SQL | Postgres | Python | ETL Data processing

Data Engineering with DuckDb Tutorial | PySpark | SQL | Postgres | Python | ETL Data processing

Cleaning Medical Data with R

Cleaning Medical Data with R

duckplyr: Tight Integration of duckdb with R and the tidyverse - posit::conf(2023)

duckplyr: Tight Integration of duckdb with R and the tidyverse - posit::conf(2023)

Worlds smallest 4K headset 😎 Visor.com #tech #vr #technology #virtualreality #insideout2

Worlds smallest 4K headset 😎 Visor.com #tech #vr #technology #virtualreality #insideout2

New Batteries: It’s Not All Hype

New Batteries: It’s Not All Hype

iOS 18 Hands-On: Top 5 Features!

iOS 18 Hands-On: Top 5 Features!

plugging a frozen GPU into my PC

plugging a frozen GPU into my PC

Product Link in Bio ( # 1636 ) @MaviGadgets ✅ Smart Universal Magnetic Car Phone Holder

Product Link in Bio ( # 1636 ) @MaviGadgets ✅ Smart Universal Magnetic Car Phone Holder

I’m glad I never reviewed this - AYANEO Pocket S

I’m glad I never reviewed this - AYANEO Pocket S

Nový #iPhone16 možná nebude mít žádná fyzická tlačítka! Dokážeš si to představit?

Nový #iPhone16 možná nebude mít žádná fyzická tlačítka! Dokážeš si to představit?

AirPody budou mít KAMERY?😳 #news #apple #airpods

AirPody budou mít KAMERY?😳 #news #apple #airpods