What is Big Data? - Computerphile

Sdílet
Vložit
  • čas přidán 27. 07. 2024
  • With all this talk of Big Data, we got Rebecca Tickle to explain just what makes data into Big Data.
    MapReduce: • MapReduce - Computerphile
    / computerphile
    / computer_phile
    This video was filmed and edited by Sean Riley.
    Computer Science at the University of Nottingham: bit.ly/nottscomputer
    Computerphile is a sister project to Brady Haran's Numberphile. More at www.bradyharan.com

Komentáře • 271

  • @joonasfi
    @joonasfi Před 5 lety +194

    You can make any data big data by exporting it in XML

    • @Lodinn
      @Lodinn Před 5 lety +5

      It's hoooman-readable format though amirite? :>

  • @neuron1618
    @neuron1618 Před 5 lety +1171

    big data is anything that is too large to be opened in excel

    • @senkottuvelan
      @senkottuvelan Před 5 lety +12

      True.😂

    • @Rakkoonn
      @Rakkoonn Před 5 lety +37

      You say that, but many companies who say they use 'big data' really mean a huge spreadsheet.

    • @ubummer
      @ubummer Před 5 lety +19

      big data is anything too large to fit in pandas

    • @minihjalte
      @minihjalte Před 5 lety +10

      For thats there is microsoft access. Its just large excel :^)

    • @michaelsommers2356
      @michaelsommers2356 Před 5 lety +6

      _"big data is anything that is too large to be opened in excel"_
      So big data is any table with more than 100 rows?

  • @macchicken98
    @macchicken98 Před 5 lety +55

    Hands down best explanation of big data I have seen. I‘m coming from a business degree where we often learn about the 5Vs but don’t really touch on what infrastructure is actually used or needed for using/handling big data. Now I definitely have a better perspective on this!

  • @BaronSamedi1959
    @BaronSamedi1959 Před 5 lety +188

    According to management all big data can be reduced to one nice coloured 3D-pie chart!

    • @Walleggwp
      @Walleggwp Před 5 lety +21

      And if you don't have a nice upsloping line graph, well... I'm sorry but I'd like to speak to you in my office when are you finished.

    • @napillnik
      @napillnik Před 5 lety

      @@Walleggwp hockeystick!

    • @napillnik
      @napillnik Před 5 lety

      mmm, pie... I keep suggesting it but my team starts ignoring me after that.

  • @Manabender
    @Manabender Před 5 lety +364

    More V's of data!
    *Volatility*: How likely is it that this data is received intact? How often do the bits get flipped?
    *Velociraptors*: How much would this data scare xkcd?
    *Vaingloriousness*: How hard is the creator of this data trying to shove it in your face despite repeated attempts to get them to shut up?
    *Vanity*: How likely would the data be to win a beauty pageant?
    *Vampiricism*: When mirrored, does this data delete itself?
    *Vaccination*: Has the data been protected from viruses?
    *Vuvuzela*: Honestly, this one should describe itself.

    • @jackik1410
      @jackik1410 Před 5 lety +8

      this is glorious! nearly died here XD

    • @triton62674
      @triton62674 Před 5 lety

      Superb haha

    • @letMeSayThatInIrish
      @letMeSayThatInIrish Před 5 lety +35

      Vastness: Does 'huge volume' not even begin to describe the sheer size of the data?
      Verse: Is the data in verse form?
      Viscosity: Does the data flow effortlessly, or does it lump up like blood clots?
      Vikings: Does the data contain false information about vikings, such as them wearing horned helmets?
      Vendetta: Is the data vengeful? Viciously vindictive?
      Vincent van Gogh: Is it art?
      Vortex: Does the data rotate in ever more violent circular motions around the data center?
      Vulgarity: Must the data be censored for people in the US?
      Violas: Would a symphony orchestra make fun of the data?

    • @jackik1410
      @jackik1410 Před 5 lety +2

      @@letMeSayThatInIrish Holy data, this is even more ridiculus. The beauty is that each of the makes so much sense by itself and represents an actual (kinda) valid query!

    • @SniperSpy10
      @SniperSpy10 Před 5 lety +12

      Virginity: is it new and pure
      Violence Level: how likely is it to destroy other data
      Vocal: how easy is it to be heard
      Viagraity: can it give the reader a hard on

  • @koz857
    @koz857 Před 5 lety +115

    I picture a computer scientist somewhere thinking "Hmm gravity of the data is an important aspect that should define big data." and his friends are like "It doesn't start with a 'V' it won't work"

    • @galgrunfeld9954
      @galgrunfeld9954 Před 2 lety +1

      Value of importance - how important the data is
      Based on the value you can manage its position in a data pipeline - e.g what dataset you process first, how much computation power going into processing it, what data is sent to nodes in a network first, etc.

    • @Epic-so3ek
      @Epic-so3ek Před 8 měsíci

      I'm pretty sure them using all v's is to appeal to people who don't have a computer science background (aka managers and execs), or maybe people taking a first course in data science. I don't know that for sure but just the fact they used "velocity" instead of throughput makes me think that. If it was for people with a cs/IT background, that would just confuse them.

  • @senkottuvelan
    @senkottuvelan Před 5 lety +39

    8:32 Sean Ridley is an awesome editor. Used the word Process to add Pre Process in the video.💯🔥

  • @allluckyseven
    @allluckyseven Před 5 lety +167

    TIL a little bit about Big Data, but also learned that in England a truck is called a lorry.

    • @randallanderson4999
      @randallanderson4999 Před 5 lety +27

      And a highway is called a motorway.

    • @a.yashwanth
      @a.yashwanth Před 5 lety +8

      In India too.

    • @Jamie-st6of
      @Jamie-st6of Před 5 lety +10

      Ande Yashwanth well yeah, cause england invaded india

    • @lsmeteor4652
      @lsmeteor4652 Před 5 lety +19

      And in the us, you park on driveways and drive on parkways

    • @NoseyNick
      @NoseyNick Před 5 lety +10

      That's nothing, they come in different colours (with a u) too! Try saying "red lorry yellow lorry red lorry yellow lorry red lorry yellow lorry" really fast.

  • @MILCHMONSTER3D
    @MILCHMONSTER3D Před 5 lety +293

    my modded skyrim is big data
    too much for one computer to handle

    • @hattrickster33
      @hattrickster33 Před 5 lety +10

      I know what you mean. I literally have to run the game at my local rendering farm to get anything over 10 fps.

    • @manualvarado2212
      @manualvarado2212 Před 4 lety

      @@hattrickster33 At least you have a local rendering farm.

  • @edgekane958
    @edgekane958 Před 5 lety +7

    Every Computerphile video deserves a like.
    Change my mind.

  • @code-dredd
    @code-dredd Před 5 lety +45

    "Big Data" is the confusion that follows after marketing people end up describing technical stuff.

    • @WilliamAncich
      @WilliamAncich Před 5 lety +6

      Could not agree more.

    • @cmonkey63
      @cmonkey63 Před 5 lety +1

      Did you know? The term "Machine Learning" was an invention of the marketing team at IBM in 1959. Machines don't learn, silly. Well, neither do people, much of the time.

    • @MrCmon113
      @MrCmon113 Před 5 lety +13

      @@cmonkey63
      Machine learning describes precisely what it's about. Really, I cannot think of any better term for it. Computer aided reverse deduction? Knowledge discovery in databases? Automated stochastical analysis? Practical function fitting? Those are all obscurantist, *learning* is what it's about. And who learns? A machine.

    • @alkis2407
      @alkis2407 Před 5 lety +2

      @@MrCmon113 Statistical model estimation/fitting would be more accurate IMO. Optimization has been around for ages, why call it learning all of a sudden? (hint: money)

    • @napillnik
      @napillnik Před 5 lety

      @@alkis2407 algorithms learn. They adapt without code being rewritten, and produce outcomes that haven't been preprogrammed, and get better with experience. That's learning.

  • @fcs_96
    @fcs_96 Před 5 lety +7

    This channel is super informative. I'm super pleased that I was able to stumble upon it. Broadens my knowledge of Computer Science.

  • @HighestRank
    @HighestRank Před 5 lety

    That montage at the end is such a wax museum.

  • @MrFloris
    @MrFloris Před 5 lety +2

    Thank you for making these and sharing these lovely videos. They're a fantastic resource.

  • @Bnelen
    @Bnelen Před 3 lety +1

    She does a good job of covering many of the important basic concepts.

  • @sumitrana8114
    @sumitrana8114 Před 5 měsíci

    Let's take a moment and say that computerphile never disappoints.

  • @Shadow81989
    @Shadow81989 Před 5 lety +50

    Great to see more of Rebecca!
    This one was much better presented, seems like she's getting some practice (and confidence). :-)

    • @AndyH2O
      @AndyH2O Před 5 lety +27

      ...and is being patronised slightly less.

  • @Bordsteinpflaster
    @Bordsteinpflaster Před 5 lety +1

    I started to research to that topic today and was even on this yt channel to search for stuff ... and tadaaah I see this upload in my subbox, perfect timing :)

  • @RichardT2112
    @RichardT2112 Před 5 lety +49

    It’s not the size of your data that matters, rather how well you process it ...

    • @MrCmon113
      @MrCmon113 Před 5 lety +1

      No, it's both.
      We knew about lots of the best machine learning algorithms more than thirty years ago, but we didn't have the datasets to train them sufficiently.
      Deep neural networks are comparatively simple, but they perform miracles if you throw tons and tons of data at them.

    • @RichardT2112
      @RichardT2112 Před 5 lety +11

      Taxtro I see humour isn’t lost on you ... thanks for playing along!

    • @Monk-E
      @Monk-E Před 5 lety +1

      @@MrCmon113 wow you're cool

  • @vedi0boy
    @vedi0boy Před 5 lety

    Looking forward to the next video, thanks!

  • @leonleeds534
    @leonleeds534 Před 5 lety +6

    Great video and really well explained. Ms Tickle is one of my two fav presenters on this channel.

  • @AnonymousAccount514
    @AnonymousAccount514 Před 5 lety

    Long overdue...thank you

  • @derpimusmaximus8815
    @derpimusmaximus8815 Před 5 lety +57

    "This data is small, but the data over there is far away."

    • @recklessroges
      @recklessroges Před 5 lety +4

      Thanks Ted.

    • @bencrossley647
      @bencrossley647 Před 5 lety +2

      Best / most unexpected comment I’ve ever laughed at. I can see him looking so confused.

  • @DavidLindes
    @DavidLindes Před 5 lety +2

    Good stuff. While I knew each of the concepts, I'd not heard of the "5 Vs" (let alone the 10/whatever)... cool!
    And wait, is this map/reduce video out already? Must find it. I've been wanting a refresher, because I haven't used it in a while, but it could be useful for me soon.

  • @DantalionNl
    @DantalionNl Před 5 lety +4

    Can we also get videos on big data using none Spark based technologies?

  • @delusionnnnn
    @delusionnnnn Před 5 lety +7

    Every time I hear "data" as a singular noun ("data is") instead of a plural ("data are"), it seems like such a welcome change. The old plural usage seems so stilted and it's simply not how I hear most people talk unless they're very prescriptivist.

    • @teranokitty
      @teranokitty Před 5 lety +1

      "Data are as Data is."

    • @TheSam1902
      @TheSam1902 Před 5 lety

      Datum

    • @delusionnnnn
      @delusionnnnn Před 5 lety +1

      @@TheSam1902 That's the stilted usage I refer to which no actual person under 70 uses unless they're deliberately trying to sound awful.

    • @klaxoncow
      @klaxoncow Před 5 lety +2

      Whilst grammatically, the singular is "datum" and the plural is "data", and by linguistic pedantry it ought to be "data are", this ignores the intrinsically "uncountable" nature of data.
      A single bit could be legitimately described as a "datum", as you can't further decompose it. But, for anything more than that, the problem with the notion of singular and plural on data is that it's always composite.
      Is a byte a singular piece of data? Or is it 8 bits of data? Or is it 2 nibbles?
      Well, yes, exactly. The answer is "yes".
      So we've already hit the issue with any notion of plurality on "data". Any amount of it, beyond a singular bit, could be viewed as singular or plural. Depends on your metrics.
      Information = data + structure.
      "Data", by this definition, is without structure. So you cannot logically impose singularity / plurality onto it without implicitly providing structure, that makes it cease being "data" and becoming "information".
      More over, it's worth noting that, in English, "information" is uncountable. You can't have "informations". Information plus more information is still information - there's just more of it.
      It's a linguistic quirk. Shouldn't really be there. "Data" is, by nature, uncountable - whether English grammar wishes to agree or not.
      Therefore, for me, it's always "data is". Data plus more data is still data - there's just more of it. Exactly as uncountable as "information" already is.
      (And this ends up being even more so, if you actually spend any time with assembly language programming. As you're quite often doing things like grabbing the upper nibble of a byte to test for flags, or - to, for example, swap endianness - grab the individual bytes in, say, a 64-bit value and then swap the byte order around. The fluid interchangeability of how you interpret data - that, indeed, at the machine level, code is data too and you can create confusing self-modifying code that rewrites itself, even - becomes very apparent. Data, as data, has no inherent structure. No intrinsic plurality. Code implicitly provides the structure from how it treats the data, which turns it into useful information. In this view, data is, by nature, intrinsically uncountable - even if, by a quirk of history, the English language appears to disagree with this.)

    • @vnickleswitter
      @vnickleswitter Před 4 lety

      @@klaxoncow buried.

  • @satyris410
    @satyris410 Před 5 lety +11

    More than 16,384 columns = Big Data.

  • @stefanjooste3598
    @stefanjooste3598 Před 2 lety

    Love the use of old dot matrix printer paper to try and explain the basics of big data.

  • @Treviath
    @Treviath Před 5 lety

    Would it be possible for you to do a video on the piece of art that is called Wireguard?

  • @fruitfcker5351
    @fruitfcker5351 Před 5 lety +1

    01:28 I haven't seen that wide of a continuous paper in decades

  • @darylallen2485
    @darylallen2485 Před 3 měsíci

    Please bring this one back

  • @uristmcdani
    @uristmcdani Před 2 lety

    Thanks a lot for this explanation, very clear!

  • @etinosaizekor6533
    @etinosaizekor6533 Před rokem

    Clean and clear explanation

  • @johndripper
    @johndripper Před 4 lety

    i can listen to u all day :)

  • @peter_smyth
    @peter_smyth Před 5 lety +2

    2:52 That lorry is heading NNW, not NNE.

  • @moni7235
    @moni7235 Před 4 lety

    Thank you Rebecca!

  • @ecelon
    @ecelon Před 5 lety +5

    Big data for me is when any text editor I try crashes while opening it...

  • @ShankarSivarajan
    @ShankarSivarajan Před 5 lety +1

    A quote I heard last week about big data: "We are drowning in data but starved for information." (Paraphrasing John Naisbitt, 1982).

    • @MrCmon113
      @MrCmon113 Před 5 lety

      Information is just the complexity of the data. What you are looking for is knowledge.

  • @noredine
    @noredine Před 5 lety +29

    It's the opposite of ˢᵐᵃˡˡ data

  • @raffriff42
    @raffriff42 Před 5 lety +2

    CZcams views and likes are tracked by traditional databases. CZcams recommendation algorithms use "big data" (although they use views and likes as raw input)
    "Big Data" systems are mainly interested in the _patterns_ in the data (data = whatever information is fed into the system), and the integrity, or confidence in, the individual atom of data is not very important. OTOH, in traditional databases (bookkeeping, inventory, payroll) the integrity of each atom of data is (with some exceptions) very important indeed.

  • @willynebula6193
    @willynebula6193 Před 5 lety +11

    Candy crush is big data for my Amiga 500😉

  • @davidgillies620
    @davidgillies620 Před 5 lety

    Kafka is really easy to use in node.js. I like it.

  • @maulanaibnusabil5280
    @maulanaibnusabil5280 Před 4 lety

    Can someone explain me the difference between Big Data, ETL (Datawarehouse), and Data Engineer.
    I'm really confused

  • @TagetesAlkesta
    @TagetesAlkesta Před 5 lety +9

    Big Data is a great band 👍

  • @sooskca
    @sooskca Před 5 lety +4

    How many Apache projects are there?

    • @AndyVanee
      @AndyVanee Před 5 lety +3

      At the moment... exactly 367

    • @TheSam1902
      @TheSam1902 Před 5 lety +1

      As much as the number of feathers on a peacock.

  • @robertboran6234
    @robertboran6234 Před 5 lety

    Long time ago i was thinking that we can in theory use Big Data to create new electrical energy that can feed other machines or even the Big Data system itself. When we have huge amount of data, some of it is relevant information (this is used for processing) a second type of data is a second relevant data (this is used to train the Big Data system to improve itself) and the last type is total garbage data (this is still data that has 0 and 1). Now we know that when digital information is deleted from the machine the actual bits of information are not lost but transformed via thermodynamic effects into heat (this heat is raising the temperature of the machine) so when digital data is deleted the machine will heat up a little bit. Now we channel all the heat from all the machines and instead of disposing it we reuse it to produce electricity. So we recycle the "heat" from the machine.

    • @TheSam1902
      @TheSam1902 Před 5 lety

      But you forgot something, it's not the heat that is valuable, it's the heat **differential** . Some datacentres in northern countries uses the temperature difference between the inside of the server room and the outside air to power Sterling engines and produce electricity, but it's still not very efficient.
      Also iirc the swedish military won a wargame against the US because their submarine were (partially) powered by these Sterling engines making them stealthy than nuclear/diesel powered submarines.

    • @robertboran6234
      @robertboran6234 Před 5 lety

      @@TheSam1902 I agree with the inefficiency. Another way to improved this is by increasing the information density. But i still believe that this will be possible if the system is large enough. I am thinking about interplanetary internet where you need to process all the data of an entire planet. Also we know that information at a quantum level is stored in the surface not in volume. so i am thinking of using black holes as memory.

  • @kevind814
    @kevind814 Před 5 lety +1

    Big Data: The lifeblood of Big Brother

  • @jvne_
    @jvne_ Před 5 lety +4

    "How big is big data?"
    Me: big

  • @quratulain8396
    @quratulain8396 Před rokem

    Productive video

  • @911madza
    @911madza Před 5 lety +4

    0:02 Ron Graham is the right person to answer this.

  • @dirkdigglerswonderlandempo5170

    How times have changed in my day it was the 4F's now its the 5V's

  • @nathangek
    @nathangek Před 5 lety +7

    That's data but, like, really big.

  • @hillwin10
    @hillwin10 Před 5 lety +11

    Does size really matter?
    It is how the data is used.
    edit: or "data are"

    • @michaelsommers2356
      @michaelsommers2356 Před 5 lety +1

      It depends on whether you are referring to the data individually or collectively.

    • @thomaspearson8782
      @thomaspearson8782 Před 5 lety

      @@michaelsommers2356 wouldn't you use datum if it was singular, and data otherwise, using "is" for both?

    • @michaelsommers2356
      @michaelsommers2356 Před 5 lety

      @@thomaspearson8782 Sure, but I was mostly joking.

    • @MrCmon113
      @MrCmon113 Před 5 lety

      Ok so you have to tell me what distribution produced the following input-output pair: A -> 0
      Do you think your chances of guessing the right function improve if I give you more examples? If not, why do you think learning is even possible?

  • @keeganhoover8688
    @keeganhoover8688 Před 5 lety

    -> Rotate/Move the rocket
    ->Light

  • @Peds013
    @Peds013 Před 5 lety +2

    It's funny how people think of bug data, the company I work for can produce 100s TBs every few hours, we went to a 'big data' conference and got told we didn't count as it was a small problem :-/

    • @terohannula30
      @terohannula30 Před 5 lety +2

      "Bug data" 🦗🤔

    • @napillnik
      @napillnik Před 5 lety

      There are a lot of smug assholes in the industry. And there are a lot of people who push buzzwords for no reason. Don't mind them.

    • @BlackDragon31000
      @BlackDragon31000 Před 2 lety

      @@terohannula30 🐛 🐞 bug data

  • @gorgolyt
    @gorgolyt Před rokem

    There's only three Vs, the last two were clearly added on because somebody wanted five "Vs" but they really have nothing to do with whether something is big data.

  • @rednull8315
    @rednull8315 Před 5 lety +6

    640 kB

  • @farqueueman
    @farqueueman Před rokem

    "how big is big"
    giggles

  • @RAZREXE
    @RAZREXE Před 2 lety

    Big data is the study material folder in the d drive

  • @BrikoLage
    @BrikoLage Před 5 lety +10

    Thanks for enabling transcriber... oh, it's disabled...

    • @BrikoLage
      @BrikoLage Před 5 lety +1

      ​@@jamiecropley I don't know why they don't enable the transcriber, it's free and it helps people like me that English is not their mother language. It's too hard for me listening people talking in English, I understand some words, few phrases, but not all. On the other hand, I understand very well English written.
      I'm not lucky like others who born in countries where English is the first language, or where education system worries about teaching English to students.

  • @gabetower
    @gabetower Před 5 lety

    I won't be content until you have more V's than the speech from V for Vendetta. Voila!

  • @neddyladdy
    @neddyladdy Před 5 lety +1

    They could solve their problem with the simple expedient of not collecting data.

  • @dancingCamels
    @dancingCamels Před 5 lety +2

    Step 1: Rotate/hone the rocket
    Step 2: Light
    Step 3: ...
    Step 4: Profit!

    • @NoseyNick
      @NoseyNick Před 5 lety

      I think it's "rotate / move the rocket" but I hope we learn more about Rebecca's Rockets in a future computerphile video!

    • @dancingCamels
      @dancingCamels Před 5 lety

      @@NoseyNick oh yes, on looking again you're right.
      Hopefully we will find out what it's about!

  • @lmaoukiddin680
    @lmaoukiddin680 Před 2 lety +1

    3 inches is pretty big right?

  • @rock3tcatU233
    @rock3tcatU233 Před 5 lety

    It's not the size of the data that matters, but how you use it.

    • @MrCmon113
      @MrCmon113 Před 5 lety

      The size of the data matters a lot. Some things you can only learn from incredibly huge sets of data.

  • @azizalaliq8
    @azizalaliq8 Před 5 lety +5

    I am fully functional, programmed in multiple techniques and now *big*

  • @olik136
    @olik136 Před 5 lety

    I think data has to be at least this >| |< big... maybe even this >| |< big...

  • @mipmipmipmipmip
    @mipmipmipmipmip Před 5 lety +3

    Very clear video and explanation, but is Big Data still a relevant issue in 2019?

    • @randallanderson4999
      @randallanderson4999 Před 5 lety

      Just ask the NSA. They are listening to everything and everybody, all that data has to go somewhere.

    • @Fraznist5673
      @Fraznist5673 Před 5 lety +1

      mipmipmipmipmip That depends on what you mean by relevant issue. The problem of handling and analyzing big data is pretty much solved. With tons of working different solutions available, we are past the phase of making it possible and at the phase of improving. So the problem of whether big data can be put to use is not relevant anymore. However, all fields are trying to become data driven to increase profits, thus generating big data. So I would say yes, big data is a relevant issue, perhaps more than ever.

    • @MrCmon113
      @MrCmon113 Před 5 lety

      That's like asking: "Do we still want to learn things about the real world?"
      The only way you are going to learn things about the world is to collect data and the more data you have the more you can, in principle, learn. If you knew everything, you'd already know everything and wouldn't need a theory. As long as you don't know everything, you need a theory to predict what you don't know. And that theory can only be improved via training and testing examples.

  • @sabuein
    @sabuein Před 2 lety

    Thank you.

  • @strydomobile
    @strydomobile Před 10 měsíci

    Lorries are awesome.

  • @lasersimonjohnson
    @lasersimonjohnson Před 5 lety

    Mind tickled :p

  • @runningjoke_masterstroke

    Only the first 3 Vs given are actually particular to defining Big Data. If the data is such Volume, Velocity, and/or Variety that traditional data management can't handle it well, then it's Big Data. Value and Veracity apply just as well to a single data point. If the data (no matter its size or shape) has no value, then there is no reason to collect or store it. If the data (no matter its size or shape) lacks veracity, then its value is questionable.

  • @blackbox4214
    @blackbox4214 Před 5 lety

    Thumbnail a+

  • @modnode2869
    @modnode2869 Před 5 lety +1

    As a programmer.. Can someone please tell me how to meet girls like this?

  • @Dribbleondo
    @Dribbleondo Před 5 lety

    1TB.

  • @WickedMuis
    @WickedMuis Před 5 lety +24

    Ah the adorable one is back :D

  • @snake1625b
    @snake1625b Před 5 lety

    Generally, more than 10 terabytes is big Data usually

    • @Alex1891
      @Alex1891 Před 5 lety

      When I was a kid, I used to say things priced at $30 or greater were expensive, regardless of context. ;)

    • @snake1625b
      @snake1625b Před 5 lety

      @@Alex1891 most things in life are subjective and don't have a definitive answer. But it's definitely possible to give a generalized average answer. In this case you can say the AVERAGE server can only process less than 1 terabyte of typical data and thus you'll need multiple computers to process the data. The most unhelpful and pedantic answers you can give is something annoying like " it depends. It's subjective. It varies from problem to problem".

  • @StrangeIndeed
    @StrangeIndeed Před 3 lety

    I've realized that 5 V makes for a very nice mnemonic. V is 5 in roman numerals, so you can pretty easily remember that there are 5 Vs.
    It's probably just an accident, but makes it things a little easier to remember c:

  • @MoonMarshmallow
    @MoonMarshmallow Před 5 lety +9

    Rebecca is so cute!! ❤

    • @polygondwanaland8390
      @polygondwanaland8390 Před 5 lety +4

      @MichaelKingsfordGray What's your address and credit card number? Wouldn't want to be anonymous and cowardly, big man.

    • @inzanozulu
      @inzanozulu Před 5 lety +1

      Use your inside voice. It's not a problem to find somebody attractive, but did that really need to be in a comment on this video?

  • @edge4694
    @edge4694 Před 5 lety

    I hate how the sound of the pen lags behind the actual pen

    • @rendogsbiggestfan
      @rendogsbiggestfan Před 5 lety

      I didn't realize but now I can't not realize, you monster

  • @deanbrowne9557
    @deanbrowne9557 Před 5 lety

    A megabyte.

  • @SingularityofPower
    @SingularityofPower Před 5 lety

    Big if true

  • @mahdibrooz1982
    @mahdibrooz1982 Před 3 lety

    hi !

  • @mannycalavera121
    @mannycalavera121 Před 5 lety

    I like to pretend i'm smart enough to understand what's going on in this video :)

  • @Ubeogesh
    @Ubeogesh Před 5 lety

    so where's that map reduce video?

  • @realityveil6151
    @realityveil6151 Před 5 lety +3

    Hey, she's back! The cute nerdy chick!

  • @JanB1605
    @JanB1605 Před 5 lety +2

    How I love me some pretty, intelligent women in STEM. Great Video, was always wondering what big data really is.

  • @senkottuvelan
    @senkottuvelan Před 5 lety

    CZcams IS BIG.

  • @willhendrix86
    @willhendrix86 Před 5 lety

    In before your entire life and your rights are represented in a 5 star rating system;
    And yes I have seen that black mirror episode ( ' ', )

    • @gqh007
      @gqh007 Před 5 lety

      In before killer robot bees

  • @alittlebyte
    @alittlebyte Před 5 lety

    00:01 "How big is big?"
    LOL

  • @AtlasMTBRider
    @AtlasMTBRider Před 5 lety

    big data > small data

  • @LathosZan
    @LathosZan Před 5 lety +4

    Always like for gals in tech!

  • @xakkep9000
    @xakkep9000 Před 5 lety

    coool

  • @kenichimori8533
    @kenichimori8533 Před 5 lety

    Jumbo big data.

  • @kennb33
    @kennb33 Před 5 lety

    Isilon

  • @SephirothDL
    @SephirothDL Před 5 lety +1

    Splunk

  • @nosuchthing8
    @nosuchthing8 Před 5 lety

    Building Brainiacs Brain

  • @grainfrizz
    @grainfrizz Před 5 lety

    Vig data

  • @aadeshrana0
    @aadeshrana0 Před 5 lety

    Is it just me who cringes to the sound of the marker writing on that paper

  • @BrokebackBob
    @BrokebackBob Před 5 lety

    Data storage is now totally separate physically from the computers that access it. The idea of defining big data as the max that a single computer can process is laughable.

    • @szebohalasz7793
      @szebohalasz7793 Před 5 lety

      I dont thik so, given the fact that you mostly need the "computer" to process the data. Also its just metaphorical not absolute definition, as the BigData itself.