Holden Karau
Holden Karau
  • 619
  • 141 167

Video

Open Source Satellite Communication w/ Space Beaver - Now Live on Kickstarter
zhlédnutí 673Před rokem
Open Source Satellite Communication w/ Space Beaver - Now Live on Kickstarter
Opening the batteries for Space Beaver
zhlédnutí 796Před rokem
Opening the batteries for Space Beaver
Finishing my Kube install:installing MinIO & poking at rook/ceph
zhlédnutí 600Před 3 lety
Finishing my Kube install:installing MinIO & poking at rook/ceph
Impromptu: Exploring updating multi-cloud Kubeflow Workshop from 0.4.0 to 0.5.0 - Part 1
zhlédnutí 145Před 5 lety
Impromptu: Exploring updating multi-cloud Kubeflow Workshop from 0.4.0 to 0.5.0 - Part 1
Kubeflow End to End Cross Cloud ML Workshop Solution: IBM Partial
zhlédnutí 210Před 5 lety
Kubeflow End to End Cross Cloud ML Workshop Solution: IBM Partial
Kubeflow End to End Cross Cloud ML Workshop Solution Part 1 of 2
zhlédnutí 383Před 5 lety
Kubeflow End to End Cross Cloud ML Workshop Solution Part 1 of 2
Group @ApacheSpark code review with new reviewers :)
zhlédnutí 187Před 5 lety
Group @ApacheSpark code review with new reviewers :)
Jupyter Notebook with Apache Spark 2.4 on Kubernetes example with GKE - client mode
zhlédnutí 2,4KPřed 5 lety
Jupyter Notebook with Apache Spark 2.4 on Kubernetes example with GKE - client mode
Apache Spark 2.4 on Kubernetes example with GKE - client mode
zhlédnutí 2,5KPřed 5 lety
Apache Spark 2.4 on Kubernetes example with GKE - client mode
Apache Spark 2.4 on Kubernetes example with GKE - cluster mode
zhlédnutí 2,4KPřed 5 lety
Apache Spark 2.4 on Kubernetes example with GKE - cluster mode
Getting Started switching Apache Spark distinct() to External Append Only Map
zhlédnutí 176Před 5 lety
Getting Started switching Apache Spark distinct() to External Append Only Map
Improving distinct() to finish directly use the external append only map
zhlédnutí 69Před 5 lety
Improving distinct() to finish directly use the external append only map
Even more distinct() improvements - verifying shuffle removed
zhlédnutí 56Před 5 lety
Even more distinct() improvements - verifying shuffle removed
Data Cleaning in Apache Spark on the Apache Spark Mailing List Data
zhlédnutí 1KPřed 5 lety
Data Cleaning in Apache Spark on the Apache Spark Mailing List Data
Continued Apache Spark distinct() improvements PR update
zhlédnutí 101Před 5 lety
Continued Apache Spark distinct() improvements PR update
Understanding Spark tuning with autotuning or magical spells to stop your pager going off at 2am
zhlédnutí 611Před 5 lety
Understanding Spark tuning with autotuning or magical spells to stop your pager going off at 2am
A quick merge for Apache Spark rlimit k8s support
zhlédnutí 124Před 5 lety
A quick merge for Apache Spark rlimit k8s support
Powering Tensorflow with Big Data using Apache Beam, Flink and Spark
zhlédnutí 862Před 5 lety
Powering Tensorflow with Big Data using Apache Beam, Flink and Spark
Apache Spark Weekly Code Review - Mostly looked through the PySpark rLimit PR again
zhlédnutí 122Před 5 lety
Apache Spark Weekly Code Review - Mostly looked through the PySpark rLimit PR again
Playing well together: Big data beyond the JVM with Spark and friends - Strata SJ 2018
zhlédnutí 206Před 5 lety
Playing well together: Big data beyond the JVM with Spark and friends - Strata SJ 2018
2018 SF Pride Sparkling Pink Pandas riding behind/with the Dykes on Bikes on a 360 Camera
zhlédnutí 108Před 6 lety
2018 SF Pride Sparkling Pink Pandas riding behind/with the Dykes on Bikes on a 360 Camera
2018 SF Trans March Motorcycle Group 360 Video
zhlédnutí 120Před 6 lety
2018 SF Trans March Motorcycle Group 360 Video
360 Start of SF Dyke March 2018 Motocycles/Scooters
zhlédnutí 98Před 6 lety
360 Start of SF Dyke March 2018 Motocycles/Scooters
2018 SF DykeMarch WheelCam - First 40 minutes
zhlédnutí 47Před 6 lety
2018 SF DykeMarch WheelCam - First 40 minutes
360 VR Scooter Ride (with correct encoding) shot on insta360one
zhlédnutí 151Před 6 lety
360 VR Scooter Ride (with correct encoding) shot on insta360one
sort-of-3D scooter ride test with insta360 one in SF from SOMA to Mission
zhlédnutí 88Před 6 lety
sort-of-3D scooter ride test with insta360 one in SF from SOMA to Mission
Sparkling Pink Pandas Sunday Fun Day 360 Ride
zhlédnutí 112Před 6 lety
Sparkling Pink Pandas Sunday Fun Day 360 Ride
Debugging Apache Spark with Holden Karau (Google) & Joey Echeverria (Rocana) - Strata Singapore 2017
zhlédnutí 3,5KPřed 6 lety
Debugging Apache Spark with Holden Karau (Google) & Joey Echeverria (Rocana) - Strata Singapore 2017

Komentáře

  • @CraftDownloads
    @CraftDownloads Před 20 dny

    You look like a Europe east programmer that supports the library that runs behind scenes of everythink, very smart, I like the video and, the way you just get into the code!

  • @ahmedkotb3089
    @ahmedkotb3089 Před 2 měsíci

    Hey Good exercise but i have some questions about bentoml workers i implemented service with bentoml wit multi api-workers the model load once and this is magic and good benefit but don’t effect in performance when open multi worker in fastapi for example 2 workers the model load 2 times but the performance for handling requests was increasing but in bentoml the model load once because of sharing memory but I didn’t see that the performance was improving Did you know anything about that ?

  • @theanigos
    @theanigos Před 2 měsíci

    Solid. By d way Holden it would be great if you can also add the PR urls in the youtube description so that it will be easy to navigate. I always search and get it though ;) But getting the PR urls below the video will be best. It is always great to see you code and explain it.

  • @SebastianDangg
    @SebastianDangg Před 2 měsíci

    Yesterday, When searching some docs about Spark Debugging. I saw your presentation On Spark Summit, that was incredible! Now, Im reading your book about High Perf Spark - 15% on it! Very good book. Probably one of the best Spark Developer on the Earth! ❤ Keep it up!

    • @HoldenKarau
      @HoldenKarau Před 2 měsíci

      Oh thank you that's so awesome :)

  • @samukapsilvas
    @samukapsilvas Před 2 měsíci

    very helpfull videos

  • @w3w3w3
    @w3w3w3 Před 2 měsíci

    interesting video

  • @abhimadav
    @abhimadav Před 2 měsíci

    What keyboard do you use? Sounds really good. Apologies if you have already shared your desk setup before on this channel.

    • @HoldenKarau
      @HoldenKarau Před 2 měsíci

      It's the razer <3 hello kitty collab keyboard :p :)

  • @nosh3019
    @nosh3019 Před 2 měsíci

    Thanks for this kind of workflow and deep dive content. I find it very useful!

  • @nosh3019
    @nosh3019 Před 3 měsíci

    this is awesome! Thanks for uploading this advanced content :)

  • @kamerayakonus
    @kamerayakonus Před 4 měsíci

    Hello :) I want to apply to this project on my lab. Can you share with me yaml files? :)

  • @_UtkarshUmang
    @_UtkarshUmang Před 5 měsíci

    Did you figure this out, Can you share github link for it?

  • @CrashLaker
    @CrashLaker Před 5 měsíci

    seeing you debuging your code gives me motivation to keep debugging mine awesome :) keep posting

  • @rickt1866
    @rickt1866 Před 5 měsíci

    thx for sharing / uploading

  • @RandomVideos-im4ue
    @RandomVideos-im4ue Před 5 měsíci

    Are you training a language model like chatgpt?

  • @hemanthkumar-tj4hs
    @hemanthkumar-tj4hs Před 5 měsíci

    👌

  • @VARSTAR_Tutorials
    @VARSTAR_Tutorials Před 6 měsíci

    Hi Holden, are you on some type of small treadmill?

  • @hemanthkumar-tj4hs
    @hemanthkumar-tj4hs Před 7 měsíci

    thank you dude

  • @Someonner
    @Someonner Před 7 měsíci

    Hey.

  • @ZubairMuhammad09
    @ZubairMuhammad09 Před 10 měsíci

    That's Awesome, you plan to do the book sessions on a schedule or random ?

    • @HoldenKarau
      @HoldenKarau Před 9 měsíci

      For now random, but probably I'll try and get a schedule going in October so I have an hour of writing each week at least.

  • @Rafael-oq9vu
    @Rafael-oq9vu Před 10 měsíci

    NA trans are so disgustingly ugly, here in SA they do a little better

  • @Liu_Cao
    @Liu_Cao Před 10 měsíci

    Hey Holden - Randomly looking for data quality libraries online for spark and landed on this video! I recall from the Spark summit this year that Nike is a Databricks customer. As an engineer using databricks in day job myself, one hunch I immediately have after looking at your PR is that to run spark/pyspark on databricks your in-house libraries cannot have direct dependency on spark/pyspark. It will conflict with the proprietary spark install that databricks provide. The other two missing libraries also happen to be available in databricks environment (and the delta lake lib also might have a proprietary version on that environment). That also kind of explains the delta lake dependency (and no easy flexibility for iceberg support etc. yet).

    • @HoldenKarau
      @HoldenKarau Před 9 měsíci

      Interesting, thanks for the context and saying hey :) I would have hoped Databricks would have marked the package as provided on their internal version so we could have libraries which work inside/external just as easily but such is life.

  • @tariquea09
    @tariquea09 Před 10 měsíci

    High Performance Spark 2e is comging up? 🎉🎉

  • @darnelltate9488
    @darnelltate9488 Před 11 měsíci

    Promo-SM 😒

  • @Andromeda26_
    @Andromeda26_ Před 11 měsíci

    Good Job Holden! keep trying!

    • @HoldenKarau
      @HoldenKarau Před 11 měsíci

      Thanks ❤️. I think the Facebook notebook looks like it’s more designed for one GPU fine tuning so I think we’ll have success with that one :) (or so I hope).

  • @ramongonzales7489
    @ramongonzales7489 Před 11 měsíci

    Inspirational! Your dedication

  • @tariquea09
    @tariquea09 Před rokem

    Is this document available publicly somewhere? Didn't watch the video yet so apologies if you've mentioned it already.

  • @SanjeevKumar-nq8td
    @SanjeevKumar-nq8td Před rokem

    I am using Minio & Rookceph & see the following error in Minio pod API: SYSTEM() Time: 12:26:16 UTC 07/10/2023 DeploymentID: e0986c7c-b40f-4f0f-ac6a-57034c20d078 Error: Storage resources are insufficient for the write operation .minio.sys/tmp/9286dad1-86b6-40aa-8f12-165c5d1371f2/25d31237-c15f-4830-ba5a-61a2da2e1320/part.1 (cmd.InsufficientWriteQuorum) 2: internal/logger/logger.go:258:logger.LogIf() 1: cmd/erasure.go:427:cmd.erasureObjects.nsScanner.func1() API: SYSTEM() Time: 12:26:16 UTC 07/10/2023 DeploymentID: e0986c7c-b40f-4f0f-ac6a-57034c20d078 Error: Write failed. Insufficient number of drives online (*errors.errorString) 10: internal/logger/logger.go:258:logger.LogIf() 9: cmd/erasure-encode.go:112:cmd.(*Erasure).Encode() 8: cmd/erasure-object.go:1190:cmd.erasureObjects.putObject() 7: cmd/erasure-object.go:954:cmd.erasureObjects.PutObject() 6: cmd/erasure-sets.go:767:cmd.(*erasureSets).PutObject() 5: cmd/erasure-server-pool.go:953:cmd.(*erasureServerPools).PutObject() 4: cmd/config-common.go:83:cmd.saveConfigWithOpts() 3: cmd/config-common.go:88:cmd.saveConfig() 2: cmd/data-scanner.go:227:cmd.runDataScanner() 1: cmd/data-scanner.go:80:cmd.initDataScanner.func1()

  • @claytonmurray4328
    @claytonmurray4328 Před rokem

    Promo-SM 💕

  • @Alex-xf8pl
    @Alex-xf8pl Před rokem

    i keep wondering why is bash still used..maybe it's my fault that i fell short of learning it

    • @HoldenKarau
      @HoldenKarau Před rokem

      I mean I think Python can be a good substitute for a lot of bash use cases, but for those used to shell scripting it’s (sometimes) faster to just shell out

  • @raghavguptavlogs
    @raghavguptavlogs Před rokem

    Thanks holden. Have a good day

  • @Someonner
    @Someonner Před rokem

    All the best on your book. 👍

  • @a_k__
    @a_k__ Před rokem

    looking forward to the second part

  • @a_k__
    @a_k__ Před rokem

    missing these sessions... just had a look at Distributed Computing 4 Kids. interesting stuff 👍

    • @HoldenKarau
      @HoldenKarau Před rokem

      Thanks :) I've got a bit of travel coming up but I'll try and do some streams this week/early next week before I leave and then more when I get back. I'm really excited for progress on the DistributedComputing4Kids stuff too :)

    • @a_k__
      @a_k__ Před rokem

      Excellent👌

  • @amruthasaiprathipati2674

    Big fan ❤of you Holden one day would love to meet u

  • @a_k__
    @a_k__ Před rokem

    Interesting stuff

  • @lucc8703
    @lucc8703 Před rokem

    😡 ᵖʳᵒᵐᵒˢᵐ

  • @mmgtechm
    @mmgtechm Před rokem

    Can you please put good resolution in future. Its hard to see what you type. Thanks in advance.

  • @andrewm4894
    @andrewm4894 Před rokem

    Cool, can you make slides public if possible?

    • @HoldenKarau
      @HoldenKarau Před rokem

      Oh great suggestion - docs.google.com/presentation/d/1BX2gd4Am87Z6kdkENoCgKIEIWBKPxXVAx8kgOW3lth8/edit?usp=sharing open for comment if you have suggestions :D

  • @a_k__
    @a_k__ Před rokem

    Love these end to end stuff. Please do more of it

  • @a_k__
    @a_k__ Před rokem

    I love these streams. Keep it up! 👍

  • @a_k__
    @a_k__ Před rokem

    It would be very useful if you could do a stream on ur vim/emacs tooling for ppl with less experience like me

  • @vanexvillas2637
    @vanexvillas2637 Před rokem

    AWESOME!!!

  • @raghavguptavlogs
    @raghavguptavlogs Před rokem

    This is super cool 😎

  • @srinuvasu4164
    @srinuvasu4164 Před rokem

    After longtime. Hope your Ray book is done

  • @a_k__
    @a_k__ Před rokem

    I know its kinda fundamental but it would be very useful if you can do a live stream more focused on how you use emacs, packages, etc

  • @magoomba
    @magoomba Před 2 lety

    Looking forward to this book

  • @marwanla1870
    @marwanla1870 Před 2 lety

    Hello, Why dont you use an IDE ?

    • @HoldenKarau
      @HoldenKarau Před 2 lety

      It's a good question, I started programming back when IDEs were really slow on my computer so I've always stuck to text editors. I like emacs because I can add tools to it (and I do now sometimes use IDE like features like scalametals).

  • @penolove15
    @penolove15 Před 2 lety

    awesome holden, will these documents be shared?

  • @StrangerBaba86
    @StrangerBaba86 Před 2 lety

    Wonderful... We have spark jobs running on dataproc .. is it really nice to run them on gke ?? Kindly suggest

  • @Tako4047
    @Tako4047 Před 2 lety

    Thanks. I've been spending the past 2 days trying to get PySpark to work on ARM Kubernetes. Gives me hope to see it actually working. I was building from source within Raspberry Pi to get the ARM image, but I'll try your QEMU way instead and build multi-arch image.