Database Design for Custom Fields

Sdílet
Vložit
  • čas přidán 9. 09. 2020
  • 📝 Get my free SQL Cheat Sheets: www.databasestar.com/get-sql-...
    🎓 Learn and become confident in SQL: databasestar.mykajabi.com/sql...
    The ability to allow custom fields to be added to a database is a common requirement. This could be something the user wants to add when using an application, or for defining reports, or something else.
    There are many different ways of allowing custom fields in your database design. There is no one right solution though, as it depends on your requirements, amount of data, and the application.
    In this video, I'll share 8 different solutions that can achieve this requirement. Each of them has a sample diagram to help visualise it, and a list of the advantages and disadvantages.
    The solutions are (with timestamp links):
    1:52 - EAV (Entity Attribute Value)
    5:17 - Modified EAV
    8:06 - Single Table
    10:34 - Class Table Inheritance
    12:21 - Concrete Table Inheritance
    13:36 - Normalised Tables
    15:09 - JSON
    16:34 - Dynamic Schema
  • Věda a technologie

Komentáře • 60

  • @jellehuibregtse9476
    @jellehuibregtse9476 Před 2 lety +2

    Thanks for this video, I really like your clear voice and concise explanations.

  • @marna_li
    @marna_li Před 2 lety +6

    In my project, I added a CustomFieldDefinitions table and a CustomFields table, for the Orders table.
    My idea was to get a clean structure. It was proven to be bad when I tried to implement filtering of Orders and thus wanted CustomFields to be included.
    I used Entity Framework and the idea was compile a structured query string arg into a query. But you cannot flatten all custom fields per Order to columns that the ORM can handle.
    My next attempt will be with a JSON field.

    • @DatabaseStar
      @DatabaseStar  Před 2 lety +3

      Thanks for sharing. Sometimes designs can work at one point but later on they can be hard to work with.

  • @jhfdjdhfdgfghjzrztku
    @jhfdjdhfdgfghjzrztku Před rokem +8

    JSON support in most major DBMSes has gone a long way, but if your data is highly dynamic to begin with, you are probably bringing a knife to a gun fight and should consider a NoSQL database for your application

    • @DatabaseStar
      @DatabaseStar  Před rokem +1

      Good point. Using JSON data for custom fields, in a NoSQL database, could be the way to go.

    • @mr.random8447
      @mr.random8447 Před 10 měsíci

      I need ACID, but also need custom fields. What do I do? Have an id column in SQL that points to a document in MongoDB. Two databases?

  • @rikyriky966
    @rikyriky966 Před 3 lety +5

    I prefer 3NF Normalized Tables (6th in the video) and the JSON (7th) for my large scale app’s database design.
    Thanks for the video. :)

  • @saran-ravi
    @saran-ravi Před rokem +1

    Great video

  • @ADAMSIVES
    @ADAMSIVES Před 2 lety

    Very helpful for me

  • @havenlin5633
    @havenlin5633 Před 2 lety +10

    I would prefer #1 EAV, and #7 JSON for alternative, thanks for sharing.

    • @mr.random8447
      @mr.random8447 Před rokem

      EAV bad, you chose the worse options. Best is Normalized

    • @DatabaseStar
      @DatabaseStar  Před 10 měsíci

      Thanks for sharing!

    • @DatabaseStar
      @DatabaseStar  Před 10 měsíci

      Yeah generally normalised is better

    • @FLAWLESSBEATS
      @FLAWLESSBEATS Před 7 měsíci

      @@DatabaseStar What if you need to create custom attributes for a certain class in your application? Normalized option obligates you modify your app to create a new class and a new table on the database in order to have a new type of attribute, so in that case using sort of an EAV solution is better right?

  • @mmaaaxxxxx
    @mmaaaxxxxx Před 3 lety +3

    For #3 the video missed the most important con, which is in multi-tenancy, each customer, with a click in the product, would change the basic database schema. That shuts down the solution right there. (if your situation is not multi-tenancy, then you probably don't even need to worry about custom fields)
    edit: and it's thousands or tens of thousands of columns, not 50-200

    • @DatabaseStar
      @DatabaseStar  Před 3 lety

      That's a good point, if there are many customers then the number of custom fields would increase a lot. This is something to consider.

  • @aubbusta
    @aubbusta Před rokem +2

    Json is the best solution for a dynamic fields even big companies use it on their api

  • @webwhale
    @webwhale Před 9 měsíci

    Magento cms indexing mechanism solves entity attribute value performance...that can also be applied to worpdress to solve performance issue...I think nosql can be used with sql to indexing the large public data, for eg: ecommerce product data & confidential records like order, stocks can be stored in sql for better secured storing...

    • @DatabaseStar
      @DatabaseStar  Před 9 měsíci

      That's a good point, good to see it can help for those systems.

  • @CallMe_PsPsPs
    @CallMe_PsPsPs Před 2 lety +2

    Working with norlamized (#6) all time. Json sometimes too (#7) but now I have to switch my articles to EAV.

  • @taamcyat
    @taamcyat Před rokem +2

    I've had much success with EAV using: * (1) MSSQL and the Variant type so that you only have 1 value table(or column). * (2) use strongly typed and optimized hand written SQL stored procedures(middle-tiered ORM generated SQL is not always a good idea for reporting queries). * (3) Create a dynamic-PIVOT operation stored procedure to generate a pivoted 'flat' table for reporting. Legally inject parameters(list of required attributes) from the reporting user interrace into the dynamic(sql) PIVOT stored procedure to generate the pivoted flat table. I used COUNT and /or MAX as the PIVOT aggregate function. * (4) Use optimized indices(indexes). You actually can index the tables easily. Also you don't really need to cast or convert values since the very nature of the query would limit the types (attributes) required and also that you are using the Variant data type that is directly sent straight up to the user interface reporting visualization elements. These alone solved my issues. I seeded the model(the value table) with millions of records and the performance is amazingly fast! As a bonus, you can further 'cube' the generated pivoted flat table to generate further summaries and statistics before presentation.

    • @DatabaseStar
      @DatabaseStar  Před rokem

      Great tip! That approach sounds good and it's good to hear it works for you.

  • @alankalbfleisch3141
    @alankalbfleisch3141 Před 3 měsíci

    I think the Modified EAV could be modified even more to be a bit more useful. Instead of a single customer_attributes table, why not three tables for the different data types (customer_string_attribute, customer_date_attribute, customer_number_attribute). This allows for better data validation compared to EAV and removes the risk of multiple columns being populated. It does add more complexity when trying to query all attributes.

    • @DatabaseStar
      @DatabaseStar  Před 3 měsíci

      Good point! I think that can work as well.

  • @ms77grz
    @ms77grz Před 8 měsíci

    👍👍

  • @ADAMSIVES
    @ADAMSIVES Před 2 lety +1

    I didn't think of the JSON (or potentially XML??? - is that a thing too?) version until you said it....

    • @DatabaseStar
      @DatabaseStar  Před 10 měsíci

      Good to know! Yeah XML could work but I don't see it used very often.

  • @kushagrajain2407
    @kushagrajain2407 Před rokem +1

    @DatabaseStar This is a bit difficult to understand without any table examples of how it looks like. Is there any blogpost where you have explained it with an example. thanks in advance :)

    • @DatabaseStar
      @DatabaseStar  Před rokem +4

      Thanks for the feedback! Yeah I've heard that before about this video and some others, so I try to add examples into some of my recent videos. I don't have any posts that has examples unfortunately - but I can create one in the future.

  • @ZachariBarnes
    @ZachariBarnes Před rokem

    Only a few of these options support custom fields and none of these support defining a custom field(column) (which would be different for each tenant) to which each row would have a value.

    • @DatabaseStar
      @DatabaseStar  Před rokem +1

      What do you mean by "only a few support custom fields"? And what do you mean by "none of these support defining a custom field"?
      Each of them allow users to define custom fields, where the users can determine what information can be captured for a record.
      If there's a need to have a new column where each row has a value, then this would be more like adding a new column to a table, and would be done with an Alter Table statement.

  • @martinjones8237
    @martinjones8237 Před rokem

    Nice video! Anybody know what "PK" and "RK" refers to in these diagrams? This is the only thing I'm missing!

    • @vivekbansal7207
      @vivekbansal7207 Před rokem

      primary key

    • @DatabaseStar
      @DatabaseStar  Před rokem

      Ah, PK = Primary Key (the unique identifier for the row), and FK = Foreign Key (a reference to a Primary Key in another table)

  • @mickinoz2005
    @mickinoz2005 Před rokem

    While I enjoyed the video I am none the wiser as to which is best or better maybe a list of don't do to best option might be good

  • @hassamulhaq7762
    @hassamulhaq7762 Před rokem

    Dynamic Design looks 😬 due to risk of columns change on live server. need strong validation for this type.
    I prefer JSON, EAV structure.

    • @DatabaseStar
      @DatabaseStar  Před rokem +1

      Thanks! Yeah there are some risks for this approach.

  • @gauravsrivastava17
    @gauravsrivastava17 Před měsícem

    Normalisation

  • @mouaad_games8
    @mouaad_games8 Před 7 měsíci

    Or just use no-sql DB

  • @faisalmushtaq2287
    @faisalmushtaq2287 Před 2 dny

    i think 1st solution was much better

    • @DatabaseStar
      @DatabaseStar  Před dnem

      Oh thanks, that’s good to know

    • @faisalmushtaq2287
      @faisalmushtaq2287 Před dnem

      @@DatabaseStar currently watching your "7 database design mistakes to avoid" video👍

  • @antoniocs8873
    @antoniocs8873 Před 2 lety +2

    I think you oversimplified the EAV by A LOT.

    • @DatabaseStar
      @DatabaseStar  Před 2 lety

      Yeah I did simplify it for the video, it can get out of hand pretty easily. What else would you add for EAV?

    • @antoniocs8873
      @antoniocs8873 Před 2 lety +1

      @@DatabaseStar Take some notes from this video: czcams.com/video/WneHTRZVbec/video.html