Longest Substring Without Repeating Characters - Leetcode 3 - Python

Sdílet
Vložit
  • čas přidán 19. 08. 2024

Komentáře • 319

  • @NeetCode
    @NeetCode  Před 3 lety +30

    🚀 neetcode.io/ - A better way to prepare for Coding Interviews

    • @adityavartak6990
      @adityavartak6990 Před 3 lety

      Why did you pick sliding window approach.I didnt understand the thinking process of this algo selection

    • @sinnyman2157
      @sinnyman2157 Před 3 lety +3

      well, great video, but i did not unerstand the last move - "r-l+1", could u explain

    • @igorverevkin7709
      @igorverevkin7709 Před 3 lety +5

      Could you please explain the last part (r-l+1). What did you mean by "we can make result larger than it is if the current window size is greater than it is right now"?
      Thanks!

    • @rajakrishna
      @rajakrishna Před 3 lety +8

      @@igorverevkin7709 r-l+1 is the size of the current substring.
      l,r are indexes which means in "abc"
      index(c)-index(a) will give 2 instead of 3 that's why you add 1 to it

    • @bruce716
      @bruce716 Před 2 lety

      @@rajakrishna Thanks for the details.

  • @dilln2158
    @dilln2158 Před 2 lety +156

    Amazing explanation as always. Just a little trick I found, instead of res = max(res, r - l + 1), we can simply do res = max(res, set.size()), because the size of the set at that point in the loop will be the size of the current substring

    • @brickndick
      @brickndick Před rokem +1

      Is that true? LC errors out when I call charSet.size()

    • @sk_4142
      @sk_4142 Před rokem +1

      @George standard implementation of size/len is O(1) time. The sizes/lengths of most data structures are usually kept as instance variables that can be retrieved and updated (when adding to or removing from, say, an array or a set) in constant time

    • @garyhu728
      @garyhu728 Před rokem +6

      @@brickndick If you're doing LC in Python, it should be len(charSet)

    • @wycliffeo4656
      @wycliffeo4656 Před rokem +1

      This is good because i was wondering why we created a set id we are not using it in any determining operation

    • @MrjavoiThe
      @MrjavoiThe Před rokem +25

      That’s more understandable, r - l + 1 is confusing.

  • @mostinho7
    @mostinho7 Před rokem +50

    Done thanks!
    Sliding window vs two pointer, in sliding window we keep expanding and shrinking the window dynamically, by moving left and right pointers (sliding left side in this case to reduce the window size) until we don’t have duplicates in our window. Then we can increase window by moving right pointer.
    Algorithm:
    Expand window until you encounter a duplicate, then keep shrinking the window until the duplicate is not in the window anymore. Keep track of longest substring.
    Pitfall:
    - when shrinking the window, you have to remove the character from your Set data structure as well
    - when you encounter a duplicate when expanding the window, the duplicate that needs to be removed won’t necessarily be the one on the left end, it can be inside the window so you have to keep popping till you reach it

    • @vezzolter
      @vezzolter Před 18 dny +1

      Thank you! Quite a great summary, especially important for me was your note about last pitfall.

  • @arunsp767
    @arunsp767 Před 2 lety +407

    A very important correction. Not in the code, but in the explanation: We use Set NOT because it stores unique characters, but because (in HashSet) the operation 'contains()' runs in O(1). We really don't need a data structure that maintains unique elements because we are doing that part explicitly using the 'contains()' operation. So the code would run fine even with a regular List. But 'contains()' in List is typically O(n). Hence the choice of (Hash)Set

    • @zionroar9066
      @zionroar9066 Před 2 lety +20

      For that matter we could of used a dictionary, because look up in a dictionary is 0(1)

    • @user-us4mc7ej3c
      @user-us4mc7ej3c Před rokem +17

      @@zionroar9066 but it takes more space than a set bc the keys. Always better to optimize time AND space complexity

    • @jaechanlee290
      @jaechanlee290 Před rokem +6

      ​@@user-us4mc7ej3c they are the same in terms of "space complexity".... O(N).

    • @dancingdragon1143
      @dancingdragon1143 Před rokem +5

      @@jaechanlee290 No key value pairs use a hashing algorithm so that each key is unique. Therefore spacial complexity will alway be less optimal than say a list where elements can be stored contiguously. If your keys aren't sufficiently spaced you risk a key collision which will amortize your lookup time to 0(n) .

    • @dancingdragon1143
      @dancingdragon1143 Před rokem

      I still don't understand. Our conditional checks every char in string s. our left pointer iterates based on each character. So are we not still o(n) given we're checking each element at o(1) time complexity?

  • @douglasfrb
    @douglasfrb Před 2 lety +82

    Hey dude! I really appreciate everything you've been doing with these videos teaching how to solve this algorithms. I have a good understanding of data structures in general but I never actually knew about some techniques that people use to solve these problems. I'm studying algos for almost a week now and when I come across with a complicated one and a I struggle to solve I always come here to see your point of view. Just a comment to show my gratitude.

  • @abebts20000
    @abebts20000 Před rokem +14

    You could just use hashmap to store the last index of the character and just move you start of the sliding window to the character next to the duplicate one. Rather than erasing all the characters from set.
    int lengthOfLongestSubstring(string s) {
    unordered_mapdict;
    int mx =0;
    int start =0,end=0;
    for(;end

    • @The6thProgrammer
      @The6thProgrammer Před 11 měsíci +1

      This solution will fail on certain cases. For "abba" this returns 3 instead of 2. The approach will still work however, just update the conditional statement to check that dict[s[end]] >= start as well, so you ignore any indices out of the current sliding window. (e.g. if (dict.find(s[end]) != dict.end() && dict[s[end]] >= start).

  • @AnmolSingh-sb5jj
    @AnmolSingh-sb5jj Před rokem +51

    You're the best. Got selected at Amazon because of your teaching.

  • @MrMainak90
    @MrMainak90 Před 4 lety +71

    One of the cleanest solutions i have come across, thanks!!

  • @cesaralcantara1341
    @cesaralcantara1341 Před 3 lety +41

    If you are having trouble understanding any parts, I would recommend writing the code in vs code and debugging it. Add some watch variables as well. Really helped me understand some parts of this algorithm. Great stuff man, really helpful!

    • @chilo7272
      @chilo7272 Před 2 lety

      The debugger in leetcode does the same or is vscode better?

    • @pmxi
      @pmxi Před 2 lety +22

      @@chilo7272 well, the vscode debugger is free

  • @licokr
    @licokr Před 5 měsíci +1

    all of a sudden, come to think of it, even though I'm watching this video to find the answer and how it works. I think this channel is not only talking about the answer of a coding interview problem. He is actually teaching people, not only telling us about an answer. I'mr really glad I found this channel. It helps me grow up as a software developer. I really hated algorithm but this channel makes me finding fun in solving problems. I really appreciate it with all of my heart.

  • @eddockery7118
    @eddockery7118 Před 3 lety +46

    alternatively you can use:
    res = max(res, len(found))
    I think that maybe a bit cleaner and intuitive for some.

  • @chyyeeah
    @chyyeeah Před 4 lety +26

    Appreciate the content! It might be useful at the end to do a quick summary and run through just to cement the idea. Thanks!

  • @letechnicaljames
    @letechnicaljames Před 2 lety +15

    If there is a duplicate, we need to continue to shift the leftmost pointer. Got it. And remove character from substring set. Sliding window technique is highly key to reduce time complexity to O(N) for substring/subarray problems.

  • @pinakadhara7650
    @pinakadhara7650 Před rokem +3

    Great explanation! I used dictionaries instead of set and moved the max comparison under an if condition so that it runs only when it comes across a duplicate character. It cut the runtime by almost 50%

  • @rhen4610
    @rhen4610 Před 2 lety +3

    Bruuuh i really got stuck for hours on this problem set and almost got 50 lines and this dude did it in 13 lines 😩😩😩😩😩

  • @jigglypikapuff
    @jigglypikapuff Před 11 měsíci +2

    Thanks a lot, great as always! I believe we can slightly improve by using the hashmap, instead of the set, to also keep track of the index where the character appeared last time so that we can instantly move the left pointer past that index, once the duplicate is found.

    • @xyz-vv5tg
      @xyz-vv5tg Před 11 měsíci

      Can you please provide the code using hashmap?

    • @jigglypikapuff
      @jigglypikapuff Před 11 měsíci

      @@xyz-vv5tg More or less
      l = 0
      duplicates = {}
      max_length = 0
      for r in range(len(s)):
      if duplicates.get(s[r], -1) >= l:
      l = duplicates[s[r]] + 1
      duplicates[s[r]] = r
      max_length = max(max_length, r - l + 1)
      return max_length

  • @jasonswift7468
    @jasonswift7468 Před rokem +3

    Using a hashtable to remeber the last index of every char, And keep tracking the starting point of a valid substring. That is a little bit fast compared to using a while loop.

  • @bereketyisehak5584
    @bereketyisehak5584 Před rokem +2

    Space complexity should be o(1) as there are limited number of characters in the alphabet,

  • @ciantim2996
    @ciantim2996 Před rokem +5

    It's not 100% correct right ? I'm not sure in Python but I think Set is not guarantee order so removing left most character in set might lead incorrect result. For example:
    input: "bcabefg" -> expected result: 6 ('cabefg')
    At the point where r = 3, the set is now contains ('b', 'c', 'a) but since it's not ordered it might appear like 'cab', thus we gonna remove all the characters currently in the set
    and leads wrong result 4 (befg)
    Please correct me if something wrong

    • @namankothari8390
      @namankothari8390 Před 2 měsíci

      I am also wondering the same thing, and trying to rack my brain to understand how is it removing items in an order. If you got to know how, please explain

  • @danyeun01
    @danyeun01 Před 2 měsíci

    checking the highest only when a duplicate is detected shaves off a ~10 milliseconds in python albeit the code isnt as compact
    class Solution:
    def lengthOfLongestSubstring(self, s: str) -> int:
    highest = 0
    l, r = 0, 0
    chars = set()
    while r < len(s):
    if s[r] in chars:
    highest = max(highest, len(chars))
    while s[r] in chars:
    chars.remove(s[l])
    l += 1
    chars.add(s[r])
    r += 1
    highest = max(highest, len(chars))

    return highest

  • @Ba2sik
    @Ba2sik Před 8 dny

    What I did, is storing in hashMap each character and its index.
    And then I check , if the left pointer is smaller than hash[letter], I set left = hash[letter]
    therefore skipping the inner while loop.

  • @valentinrafael9201
    @valentinrafael9201 Před měsícem

    Interesting, I ended up with this solution
    l = 0
    r = 0
    max_len = 0
    while r < len(s):
    if s[r] not in set_string:
    set_string.add(s[r])
    r+=1
    max_len = max(max_len, r-l)
    else:
    set_string.remove(s[l])
    l+=1
    return max_len
    Because max_len is updated before the else statement, which increments l, it will not require the ugly "r-l+1". It will require the ugly "r-l" which is one less uglier.

  • @shwethaks7994
    @shwethaks7994 Před 2 lety +5

    This solution does not work for test case 'pwwkew'. Any updates on why it doesn't pass.

    • @jhangtheman
      @jhangtheman Před 2 lety +5

      Make sure you use a WHILE and not an IF on line 8. If you run into multiple duplicate characters like "ww" it only runs the if once, when in fact you need it run twice.

  • @randomkoolstuff
    @randomkoolstuff Před 2 lety +5

    Isn't it better to use a hashmap (or dictionary) to store the index? That way instead of using a while loop, you can have some additional logic to jump the left pointer. And if a character shows up in the hashmap but is behind the left pointer, we can simply update the index in the hashmap and recognize it as part of the string. The math for the distance would be based on the left and right pointers instead of the length of the hashmap.

    • @NeetCode
      @NeetCode  Před 2 lety +3

      Yeah, I agree a hashmap is probably better in this case. That said, the worst-case time complexity should stay the same.

    • @randomkoolstuff
      @randomkoolstuff Před 2 lety

      class Solution:
      def lengthOfLongestSubstring(self, s: str) -> int:

      if len(s) == 0:
      return 0
      seenCharacters = {}
      left = 0
      right = 1
      maxLen = 1
      seenCharacters[s[left]] = left
      # For left, you do not need to go to the end. If right hits end, you
      # have already found the max length, checking from left to the end
      # is unnecessary even though there could be SMALLER valid substrings
      while right < len(s):
      newChar = s[right]
      # Checking if character is in the map and has to be greater than
      # left pointer. This is to allow the left pointer to jump/update without
      # having to delete/remove items from the hashmap
      if newChar in seenCharacters and seenCharacters[newChar] >= left:
      left = seenCharacters[newChar] + 1
      else:
      # Only get max when no duplicates
      maxLen = max(right - left + 1, maxLen)
      # Continuously update the mapping, even if seen before since the new
      # location of the existing character is important for the if
      # condition above
      seenCharacters[newChar] = right
      right = right + 1
      return maxLen

    • @randomkoolstuff
      @randomkoolstuff Před 2 lety

      @NeetCode In the code above, wouldn't it be slightly faster? For example if you had "abcdefghijklmnoppabdce". You wouldn't need to delete every character from the set before p. You would simply jump the left pointer to where 'p' is first time you saw the duplicate in the substring + 1. The reason I do this and not just update to where the new duplicate character is because maybe there is a larger substring from that point. Let me know what you think. I would have thought maybe that though the speed increased, the drawback now is the space complexity since a little more space is needed (and not deleted) for the indexes.

    • @oluwatomilola
      @oluwatomilola Před rokem

      @@randomkoolstuff hi! please why did you set the max length to be 1 initially in your code?

    • @chrischika7026
      @chrischika7026 Před 8 dny

      it isnt better

  • @lelandconn
    @lelandconn Před 4 lety +9

    Why does changing the solution from "While s[r] in charSet" to "If s[r] in charSet" change the result for the scenario when string is "pwwkew"?

    • @suhailahmed2310
      @suhailahmed2310 Před 3 lety +2

      while will run until the condition becomes false, if just checks once and exits

    • @MIDNightPT4
      @MIDNightPT4 Před 2 lety +12

      if only checks if the left-most character is the same as the current, but this is not always the case. For example: BCAC . Once you reach the right-most C (our current character), an if statement will only remove one character: B. But if we do that, it becomes CAC, and there is still a duplicate. So we use the while loop to loop UNTIL the duplicate letter is gone.

    • @87frasta
      @87frasta Před 2 lety +2

      @@MIDNightPT4 You saved my brain, that's exactly what I was trying to figure it out

  • @mistercosmin3542
    @mistercosmin3542 Před 4 lety +7

    I did hashset of chars+queue+sliding window technique, got 12 ms for C++(faster than 93%). Anyway clean and neat solution bro, congrats!

  • @aurelianoyepez5107
    @aurelianoyepez5107 Před 3 lety +4

    one of the best algo/ds interview prep channels out there!!!!

  • @anirbansarkar6306
    @anirbansarkar6306 Před 3 lety +5

    Thanks a lot. It was short, to the point and an easy to follow video. Really a great content to learn from.

    • @anirbansarkar6306
      @anirbansarkar6306 Před 3 lety +1

      Although if possible, can you add time and space complexity explanations too? Like how have you calculated them. It would help us more.

  • @danielbartolini115
    @danielbartolini115 Před 3 lety +5

    Very clever solution. But can anyone please tell me what the time complexity is? He has a while loop within a for loop so I'm not sure.

  • @shashikantdivekar7839
    @shashikantdivekar7839 Před 2 lety +5

    Very well described and helpful. Quality material. Thank you Sir.

  • @kafychannel
    @kafychannel Před rokem +2

    Thanks for your content, NeetCode. It is a great idea to draw the idea of solution thanks to this I coded this problem up on my own. Thank you so much!

  • @atheerabdullatif7557
    @atheerabdullatif7557 Před rokem

    the #2 while is not needed
    charSet = set()
    l = 0
    c=0
    big = 0
    while(c

  • @darcash1738
    @darcash1738 Před 9 měsíci +1

    I had a different approach that seemed more intuitive and I don’t think it was any slower.
    I had the variables longest currentlongest, and the set.
    Enumerate through the string, if current val in the set, set currlongest to 0, clear the set(O(1)).
    If not, add 1 to current longest, add the Val to the set and check if greater than longest, if so setting equal to longest

  • @xaapt
    @xaapt Před 11 měsíci +4

    I watched this 3 times, still no idea how is it working :(

  • @ap2s2000
    @ap2s2000 Před 2 lety +3

    You can change the way you compute the longest substring by just getting the length of the set

  • @efe3036
    @efe3036 Před rokem

    Thanks for explaining this topics, you made it simple as a,b,c .I have been struggling on data structure, have never solve any problem on my own without making mistakes and searching for help. you made things easy. thank you.

  • @noahmader7325
    @noahmader7325 Před 2 lety +5

    I think space complexity would be O(1) because there are a finitie number of possible characters that could go into your hash/set despite how long the string is.

    • @edenrosales8214
      @edenrosales8214 Před rokem +1

      I agree

    • @wol2231
      @wol2231 Před 10 měsíci

      True. Great observation.
      For anyone wondering, the question has this constraint "s consists of English letters, digits, symbols and spaces."
      By definition, a set can't contain duplicates. so, the set will be finite.

  • @juliomedina5446
    @juliomedina5446 Před 5 měsíci

    A slightly more elegant way (in my opinion!) would be to set res = max(res, charSet.size()). This way you would avoid the possibility of forgetting to add +1 and having a bug in your code during an interview!

  • @midicine2114
    @midicine2114 Před rokem

    Can also exchange the outer for loop with a while loop and manage the right pointer.

  • @rams2478
    @rams2478 Před 2 lety +5

    Nice idea.. but i didnt quite understand how we get 0(n) with inner while loop...wouldn't it be 0(n2)?

    • @nikhil_a01
      @nikhil_a01 Před rokem +2

      Nested loops doesn't automatically mean O(N^2). It matters how many iterations there are in the loops and the time complexity of the operations in each iteration.
      The outer loop is growing the sliding window by iterating the r pointer through the entire string. It's O(N) because it goes through the whole string.
      The inner loop is shrinking the window by iterating the l pointer. But the l pointer never goes past the r pointer. It's basically lagging the r pointer. After all, the left side of the window can't overtake the right side. In the worst case the l pointer will also iterate through the entire string, which is O(N). Overall we get O(2N) which is equal to O(N).
      (And of course the set operations are O(1) so we are only doing O(1) work inside each loop iteration).

    • @manasisingh294
      @manasisingh294 Před 8 dny

      @@nikhil_a01 Thanks for explaining! :D

  • @ElijahManor
    @ElijahManor Před 2 lety +1

    Great resource! I really like how you draw through the concept before coding.

  • @Basta11
    @Basta11 Před 4 měsíci

    The space complexity is not necessarily O(n). The number of characters is going to be finite even using the largest character set known to man. So its O(n) for when n is below the size of the character set and O(1) after that. Of course, if we've reached that point, then that is the max answer.

  • @siyahulhaq7062
    @siyahulhaq7062 Před 7 měsíci

    This approach is strikes on my mind when i see this video on my feed

  • @weiyihuang67
    @weiyihuang67 Před 2 lety +2

    Short and clean solution! Easy to follow, thanks a lot!

  • @amadakram5722
    @amadakram5722 Před rokem

    amazing explaination, but I came up with someting simpler I guess:
    def lengthOfLongestSubstring(self, s: str) -> int:
    if (len(s) == 0):
    return 0
    maxS = 1
    l = 0
    r = 1
    while r < len(s):
    substr = s[l:r]
    if (s[r] not in substr):
    r+=1
    substr = s[l:r]
    maxS = max(len(substr), maxS)
    else:
    l+=1
    return maxS

    • @harrywong4302
      @harrywong4302 Před 2 měsíci

      You shouldn't update the left pointer by simply plus 1, instead do l += 1 + substr.index(s[r])

  • @akinniyiakinyemi2737
    @akinniyiakinyemi2737 Před 7 měsíci

    Great content as always. Just to point out, our space complexity for the set could be o(1) since there'll only ever be 26 chars if we're only dealing with the English alphabets

  • @shaikhevogen7811
    @shaikhevogen7811 Před rokem +2

    umm don't u think this will be O(N^2) since we have a nested loop here ?

  • @ankitasharma4192
    @ankitasharma4192 Před 2 lety +5

    How is it O(n), as we are using a for and a while loop within
    ?

    • @salesrafael
      @salesrafael Před 2 lety +4

      The inner while loop is limited to 26 iterations (alphabet). So, worse case is O(n * 26) -> O(n)
      Here's an improved solution, though: leetcode.com/problems/longest-substring-without-repeating-characters/discuss/1889969/O(n)-faster-than-99.87-Sliding-Window-Optimized

    • @nikhil_a01
      @nikhil_a01 Před rokem +1

      That's completely wrong. It's got nothing to do with the size of the alphabet (which is not even 26 since this problem allows all English letters, digits, symbols and spaces).
      The reason it's O(N) is because the inner while loop across all its iterations is moving the l pointer through the whole string. It can never move the l pointer past the r pointer, and the r pointer iterates once through the entire string.

  • @davinanaya468
    @davinanaya468 Před 3 lety +1

    I love the way you explain, thanks for this explanation with real exercises

  • @chookingvid
    @chookingvid Před 2 lety +6

    Instead of a set, I used an array of length 128 to keep track of the characters via their ASCII value as index. This has terrible performance in Python (slower than half of all submissions), but the performance in C++ and Rust is great. I got 0ms 100% faster than all submissions for both of those languages. I realize that the space complexity is not good, but I was trying to see if I could get it faster than a set. Also, I am aware that this would not work at all for unicode.

  • @mertzorlu386
    @mertzorlu386 Před 2 lety

    guys print(charSet) in while loop after remove and print(charSet) after .add also add a print('after addition) and see the transition making lot of sense!

  • @ankitsablok952
    @ankitsablok952 Před 2 lety +2

    I had the same intuition but then I got hung up on making use of a TreeMap instead of a Set and then kinda lost the track of how to remove the element from the collection. But I guess I was very close to the same solution. Watching the first couple minutes of the video I was able to come up with the code myself. Nice explanation!

  • @rayhanmuktader1064
    @rayhanmuktader1064 Před 7 měsíci +1

    I really like the way you explain things. One question though, is it safe to just remove the leftmost char from charSet since python set does not preserve any order? Can we be sure that the right characters are being removed?

  • @jaidenlee7394
    @jaidenlee7394 Před 4 měsíci

    Wouldn't it be more efficient to use a hashmap/dictionary instead of a set, and store the last seen occurrence of each character. By doing this, you can adjust the left pointer in O(1) since you can just move it past the index of the last occurrence. This makes it so that you never have to go back to an already visited index, unlike using a set which might require you to revisit all previously visited indices.

  • @AlmightyWeick
    @AlmightyWeick Před 4 měsíci

    In the C# solution posted on the NeetCode website, it uses a HashSet to store the substring chars. Is there a reason for this? I assume it just stores the ASCII value, but is there a reason for using HashSet instead of HashSet (which seems more intuitive)?

  • @henrmota
    @henrmota Před rokem

    Thanks for this videos I am learning rust while I implement this algorithms :).

  • @Taranggpt6
    @Taranggpt6 Před 7 měsíci +1

    Can anyone help me to understand that why are we removing the left most character? aren't we supposed to remvoe the duplicate character which just came in while traversing? i.e. s[right]?? @neetcode

  • @Mpjadhav1144
    @Mpjadhav1144 Před 2 lety +1

    i think 12th line should be res=max(res,len(charSet))
    since res= max(res, r-l+1) is giving me the length of original string

  • @JamesBond-mq7pd
    @JamesBond-mq7pd Před 7 měsíci

    class Solution:
    def lengthOfLongestSubstring(self, s: str) -> int:
    array = []
    max_size = 0
    for i in range(len(s)):
    while s[i] in array:
    array.pop(0)
    array.append(s[i])
    max_size = max(max_size, len(array))
    return max_size

  • @ethannguyen6240
    @ethannguyen6240 Před 6 měsíci

    Wow. what a clean explanation. Thanks!

  • @marisaliang3335
    @marisaliang3335 Před 2 lety +2

    Thanks for the great video! A quick question on the space complexity of this problem: the leetcode details mention "s (input string) consists of English letters, digits, symbols and spaces." - would we then be able to consider this O(1) space complexity, as there is a finite max length for our charSet?

    • @arjunreddy2647
      @arjunreddy2647 Před 2 lety

      probably since it’s not a factor of string length

  • @tsw781
    @tsw781 Před 2 lety

    This was a great explanation for this problem. Thank you.

  • @nachiket9857
    @nachiket9857 Před rokem

    I found using a queue easier for internalising this problem better:
    class Solution:
    def lengthOfLongestSubstring(self, s: str) -> int:
    l = 0
    lst = collections.deque()
    res = 0
    for r in range(len(s)):
    while s[r] in lst:
    lst.popleft()
    l += 1
    lst.append(s[r])
    res = max(res, r - l + 1)
    return res

  • @sachins6196
    @sachins6196 Před 2 lety +1

    Beautifully done

  • @Syedaxox
    @Syedaxox Před 3 lety +1

    This was very well-explained. The only thing I don't understand is why you would use a while loop, rather than an if statement?

    • @Syedaxox
      @Syedaxox Před 3 lety +4

      Figured it out a second after asking lol. Nvm. Thank you for the video!

    • @mitramir5182
      @mitramir5182 Před rokem

      @@Syedaxox can you say why he's using a while? We only have one repetition of every character in a set. It's not a list.

    • @nikhil_a01
      @nikhil_a01 Před rokem +1

      ​@@mitramir5182 The left side of the window needs to shrink until there are no duplicates in the window. The window is defined by the l and r pointers. We calculate the max length based on those pointers so we can't have a substring that has duplicates.The set is just a convenient way to quickly check for duplicates.
      Let's say our input is "pwwkew" and right now our charSet contains {p, w}. When we take the second 'w' we see that we have a duplicate because it's already in the set.
      Now let's say we only remove s[l] and increment l once instead of using a while loop. We just removed the 'p' and incremented l to 1. Then we add the second 'w' to charSet. It's true that our charSet is only {w} because it's a set. But our window is "ww" which has a duplicate. This isn't valid, so we're not going to calculate the correct max length later on.
      What we needed to do was keep removing from our window until we get rid of that first 'w', which is why we used a while loop.

  • @shengbinwang5892
    @shengbinwang5892 Před 2 lety +2

    I think the time complexity is large than O(n). Since there is a while loop inside of for loop. Is that correct?

    • @SaifAli-nt5sh
      @SaifAli-nt5sh Před 2 lety

      No. The while at Max can run only n times. And that will not be the case for every loop's iteration.

  • @shanejacob9054
    @shanejacob9054 Před 2 lety

    Quick question.
    Why is the space complexity O(n)?
    The problem description states that s consists of English letters, digits, symbols and spaces. Since we're using a set which holds unique elements of S, the space complexity should be O(number of letters/digits/symbols/spaces), which is a finite number. Giving a space complexity of O(1)

  • @musayo9482
    @musayo9482 Před 2 lety

    It makes it so much clearer! Thanks!

  • @geld5220
    @geld5220 Před rokem

    right-left + 1 that is confusing me but I understood now that The expression right - left + 1 calculates the length of the current substring by subtracting the left pointer left from the right pointer right and adding 1 (because the length of a substring is the difference between the indices plus 1).

    • @geld5220
      @geld5220 Před rokem +2

      believe me it made sense when i was saying it but reading it again just confused the heck out of me

  • @akashvaidsingh
    @akashvaidsingh Před rokem +1

    we are running while loop inside for loop , so why the complexity is not N-square ?

  • @Taziamoma
    @Taziamoma Před 2 lety +1

    This is amazing :D
    Just one question. Why at the end (res = max(res, r- l + 1)), why are we adding 1 at the end?

    • @choleaoum1383
      @choleaoum1383 Před rokem +2

      The 1 is because the for loop begins at 0 otherwise if you have a single item array the subset size would be 0 instead of 1.

  • @SmoothCode
    @SmoothCode Před 3 lety +2

    How does the variable l act as a pointer when it is initialized as 0? What are we accomplishing by incrementing l by 1 each time we find a duplicate character?

    • @MichaelShingo
      @MichaelShingo Před rokem

      When you find a duplicate character, it means that you've found the maximum string length given that L value. So you increment L by 1 until that duplicate is gone. Now you have another L value, from which you can start increasing the window size, to potentially find a larger string length.

  • @sarmadkhawaja2669
    @sarmadkhawaja2669 Před 2 lety +1

    Why don't we use a list instead of set? When you said that set makes sure there is one of each element so does a list for this problem that is. Because you only add item at the right pointer to your list once you have removed it already using the while loop.
    I know appending to list or adding to set are both O(1). But what about removing form list vs set? Is removing from set more efficient and is that why a set is preferred? Asking cuz I was able to submit my solution with list instead of set and it still passed.

  • @H4RSHU17
    @H4RSHU17 Před 10 měsíci +1

    Check Out This one:
    class Solution:
    def lengthOfLongestSubstring(self, s: str) -> int:
    temp = ans = ""
    for i in s:
    if i in temp:
    temp = temp[temp.index(i) + 1:]
    temp += i
    ans = temp if len(temp) > len(ans) else ans
    return len(ans)

  • @sahilverma6160
    @sahilverma6160 Před rokem

    My solution a bit simple approach
    a="abcabcbb"
    b=[]
    for v in range(len(a)):
    for i in range(len(a[v:])):
    if len(a[v:][:i+1])==len(set(a[v:][:i+1])):
    b.append(a[v:][:i+1])
    print(len(max(b,key=len)))

  • @pradeethbathina3474
    @pradeethbathina3474 Před rokem

    Hey @NeetCode, I think it should be if clause instead of while.... because now you got complexity as O(n^2)

    • @nikhil_a01
      @nikhil_a01 Před rokem +1

      It's not O(N^2). There are already a couple of comments on this video explaining why it's O(N). And you can't just change the while loop to an if statement. The algorithm won't return the correct results. You have to shrink your window until there are no more duplicates, which you can't do with just an if statement.

  • @seanshaw3493
    @seanshaw3493 Před 3 měsíci

    I am just wondering why the space complexity is O(n) but not O(1)? Since we are always keeping each distinguished char appearing only once, the max length of the HashSet is fixed and depands on how many distinguished chars we have in the specific language. Thus the space complexity should be O(some fixed number) -> O(1). Please enlight me if I got something wrong

  • @beransantur7611
    @beransantur7611 Před měsícem

    I didn't quite understand why the worst time complexity is o(n) since we could potentially check the whole input size twice imagine an input such this. "abcddcbaabcd " don't we check the whole thing twice?

  • @momenwadood1342
    @momenwadood1342 Před 2 lety +1

    why use a set if we remove duplicates before we add the next char?

  • @patilashish
    @patilashish Před 2 lety +1

    Hey @NeetCode the while statement could result in O(n) operation for last char in a no duplicate string.
    Hence the time complexity would be O(n^2).
    A constant time operation N times is O(N) operation.

    • @andresivanmonterocassab5486
      @andresivanmonterocassab5486 Před 2 lety +6

      Hi, yes is a bit tricky when you have a while loop inside a for loop, but what helps me is to try to see how many times you will visit each character, you are right that the worst case for the while statement would be O(n) but this could only happen one time (the worst case, all the other cases you will only add and pop a few chars), so in the end you will visit each element once inside this while loop and you will visit each element once in the for loop so the time complexity is O(2n) which converges to O(n)

    • @rekcahY
      @rekcahY Před rokem

      @@andresivanmonterocassab5486 nice, I was scratching my head on this because it totally looked like O(n*2) to me... haha

  • @whitebaik
    @whitebaik Před 2 lety +1

    Thanks for the great solution. I'm a little rusty on the set and don't understand why the left pointer variable is declared outside the for-loop, instead of inside. What confuses me is that I thought the left pointer has to re-initialize to 0 every time we delete every duplicate and add the same character at the end of the set. For example, when we are at the iteration for the second "a" in string "abcab", the left pointer is incremented to 1 after removing the first 'a' and placing the second "a" at the end (which will result in "bca"). And then, if we want to access "b" in the next iteration, aren't we supposed to have left pointer at the 0th index instead of 1st?

    • @lronmate
      @lronmate Před 2 lety +15

      It's key to realize that we're not actually removing any indices from the original string, s. When we run into a duplicate char, we remove the instance of that char from the SET, and move the left pointer ahead by 1 in the original STRING to represent the new window. We can't modify the original string, otherwise it would mess with using the for loop.
      edit: I know this was asked a few months before me answering it, but I figured I'd still comment in case anyone else has the same question.

  • @ianokay
    @ianokay Před 10 měsíci

    What wasn't immediately intuitive to me is that we can stop when the right pointer hits the end and not care about the left doing any more things (shortening I guess).

  • @user-xm4lj4zc7u
    @user-xm4lj4zc7u Před 11 měsíci

    one of the best explanation

  • @vikrantsanke3554
    @vikrantsanke3554 Před 2 lety

    Thanks a lot man.. easy understable solution

  • @noturbiz4670
    @noturbiz4670 Před rokem

    It's a very beautiful solution.

  • @Jod4light
    @Jod4light Před 2 lety

    Great approach!

  • @venkatanishanthchaganty1697
    @venkatanishanthchaganty1697 Před 10 měsíci

    Just curious, Wouldn't the space be O(1) because even in the worst case, we know for a fact that the size of the set is not going to exceed 26 assuming every character is lowercase. Essentially, it is not scaling with input size and hence, we can say it has constant space complexity.

  • @MrWardo2009
    @MrWardo2009 Před 2 lety

    Thank you for doing this video!

  • @KhalidTubing
    @KhalidTubing Před 7 měsíci

    Hi, many thanks! however, there's probably a more optimal solution, as this approach beats 51% only of LeetCode Ruby users, and beats less in terms of memory :)

  • @yinglll7411
    @yinglll7411 Před 2 lety

    Thank you for your solution!

  • @derrickaddo5531
    @derrickaddo5531 Před 3 lety +2

    It is it really O(n) if you are looping again to remove the chars from the set? Why not use an if statement instead of the while loop?

    • @gamrygamer5690
      @gamrygamer5690 Před 3 lety

      you have to keep on removing elements to make the elements unique in the substring

    • @arturlamaka1400
      @arturlamaka1400 Před 3 lety +3

      Agree. It should be O(n^2) as we have loop in loop, isn't it?

    • @ZubairKhan-tt8ev
      @ZubairKhan-tt8ev Před 3 lety +6

      @@arturlamaka1400 that while loop is capped to 26 as there are only 26 possible letter (entire alphabet) that can constitute the set. so worst case is O(n * 26) -> O(n)

    • @nikhil_a01
      @nikhil_a01 Před rokem

      It's got nothing to do with the size of the alphabet (which is not even 26 since this problem allows all English letters, digits, symbols and spaces).
      The reason it's O(N) is because the inner while loop across all its iterations is moving the l pointer through the whole string. It can never move the l pointer past the r pointer, and the r pointer iterates once through the entire string.

  • @MichaelShingo
    @MichaelShingo Před rokem

    wow so concise

  • @LandoKalrissian
    @LandoKalrissian Před rokem

    @NeetCode The space complexity is O(1), not O(n), because the description says "s consists of English letters, digits, symbols and spaces.", which means the size of the set of different characters is fixed (less than 127, size of the ASCII table).

  • @arturozenteno7524
    @arturozenteno7524 Před 2 lety +1

    Why is it r - l + 1? Why the + 1?

  • @antoinenijhuis450
    @antoinenijhuis450 Před 6 měsíci

    Can someone explain me why the following code is slower runtime wise? It performs about 10 times slower than the solution provided by Neetcode. But I have no idea why.
    longestSS = 0
    firstLettIndex = 0
    collection = {}

    for i in range(len(s)):
    if s[i] not in collection:
    collection[s[i]] = i
    else:
    if len(collection) > longestSS:
    longestSS = len(collection)
    firstLettIndex = collection[s[i]] + 1
    collection[s[i]] = i
    collection = {}
    for j in range(firstLettIndex, i+1):
    collection[s[j]] = j

    if len(collection) > longestSS:
    longestSS = len(collection)
    return longestSS

  • @algosavage7057
    @algosavage7057 Před 2 lety

    Best explanations ever!

  • @tachyon7777
    @tachyon7777 Před 6 měsíci

    You don't need the inner while loop or the .remove operations.

  • @chandanagarwal2827
    @chandanagarwal2827 Před 3 lety

    Thanks for making it dude

  • @raximshokirov
    @raximshokirov Před rokem

    Solution is O(2n), may implemenet in O(n). I mean could be optimized.

  • @floydian25
    @floydian25 Před rokem

    so why use set? Cant we use Stack instead? do we require the ability if set that doesn't store duplicates?