When Claude 3.5 Sonnet Became The Better Chatbot

bycloud

zhlédnutí 37 860

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 3. 08. 2024
Scale your business easily now with AI/ML API to deploy any models you need aimlapi.com/?ref=bycloud
My newsletter
mail.bycloud.ai
Claude 3.5 Sonnet
[Blog] www.anthropic.com/news/claude...
[Website] claude.ai/
This video is supported by the kind Patrons & CZcams Members:
🙏Andrew Lescelius, alex j, Chris LeDoux, Alex Maurice, Miguilim, Deagan, FiFaŁ, Robert Zawiasa, Daddy Wen, Tony Jimenez, Panther Modern, Jake Disco, Demilson Quintao, Shuhong Chen, Hongbo Men, happi nyuu nyaa, Carol Lo, Mose Sakashita, Miguel, Bandera, Gennaro Schiano, gunwoo, Ravid Freedman, Mert Seftali, Mrityunjay, Richárd Nagyfi, Timo Steiner, Henrik G Sundt, projectAnthony, Brigham Hall, Kyle Hudson, Kalila, Jef Come, Jvari Williams, Tien Tien, BIll Mangrum, owned, Janne Kytölä, SO, Richárd Nagyfi, Hector, Drexon, Claxvii 177th, Inferencer, Michael Brenner, Akkusativ, Oleg Wock, FantomBloth
[Discord] / discord
[Twitter] / bycloudai
[Patreon] / bycloud
[Music] massobeats - mango tea
[Profile & Banner Art] / pygm7
[Video Editor] Silas
Věda a technologie

Komentáře • 158

@bycloudAI Před 27 dny ⁺¹³
Scale your business easily now with AI/ML API to deploy any models you need! aimlapi.com/?ref=bycloud
and Claude 3.5 Opus is about to be glorious
@VigilVanDijk666 Před 26 dny ⁺⁹²
Saw someone get it to make a 3D fps shooter, it absolutely blew my mind
@springheeledjackofthegurdi2117 Před 26 dny ⁺⁷
Do you have a link?
@Deathington. Před 26 dny ⁺⁸
I had it make a voxel engine with raytracing.
@jaredf6205 Před 26 dny ⁺²
The only thing it needs next is to be able to have longer code and to be able to automatically read and fix bugs and errors.
@JonDoe-uq1mk Před 26 dny
Probably ripped off someone's code from GitHub
@Always.Smarter Před 26 dny ⁺¹
How?
@captaindryvids6909 Před 26 dny ⁺¹²⁰
ChatGPT Pro feels like such a rip-off with Claude Sonnet 3.5 being around and still no voice mode being released.
@Deathington. Před 26 dny ⁺⁸
Remove your subscription
@xthesayuri5756 Před 26 dny
cancel your subscription!
let OpenAI feel that they cant get away with cucking us and GPT-5
@ps3guy22 Před 26 dny ⁺⁷
Remove your subscription
@ckq Před 26 dny ⁺¹
I think the GPTs and integration with Dalle is it's value proposition.
@ytubeanon Před 26 dny ⁺³
hmm, with ChatGPT I know you can do talk and reply on their mobile app, but it's not as advanced as the gpt-4o demos... the API allows some voices for TTS, I had it create an audiobook, but yeah still not Scarlett Johansson quality
@yashizuko Před 26 dny ⁺⁷⁶
I am so happy I switched from chatgpt to Claude, like the amount of fucking stress I had generating and correcting prompts on chargpt was soo bad, Claude is a breeze working with, absolutely a pleasure, the same feeling I had when chatgpt 4 was actually smart and not lobotomised, and projects are amazing to manage shit codebases
@M-dv1yj Před 26 dny ⁺¹⁵
Claude is not much different once your level of complexity gets high enough both Claude and gpt respond the same.
For mid intensity use Claude is better. But once really pushed it breaks down just the same as gpt.
@jonathancarter5120 Před 26 dny ⁺²
@@M-dv1yj I agree with your assessment.
@Omega-mr1jg Před 26 dny ⁺³
I worked with claude, the prompting is easy
when i switched to chatgpt to test out some other things, it blew up on my face
not even chat gpt 4.0 fixed it,
claude was better since haiku came out
@SahilP2648 Před 26 dny
You should checkout out abacus AI. I am not rate limited when using it, and it also supports team related shared RAG like claude projects.
@bigglyguy8429 Před 26 dny ⁺³
I've had the total opposite experience. I cancelled my Claude sub as it was stressful trying not to offend the prudish thing. It was like talking to Karen, the biggest prude in the office, who's best friends with Helen in HR. I'd just rather not deal with her in the first place, than keep tip-toeing around the model's sensitivities. Heard of so many people being randomly banned that I pro-actively banned myself and stuck with GPT and running my own models locally. Claude isn't fit for purpose, unless your name is Karen... Plus GPT has web access, remembers my details and can make pics, for the same price.
@dennisnotamenace2047 Před 26 dny ⁺³¹
ClosedAI is coping like crazy
@WillHarmon Před 26 dny ⁺²⁷
I asked ChatGPT to make an image of a website, and then passed that image to Claude and simply asked it to build the website. I was surprised at how close it got on its first attempt. Not perfect, by far, but impressive still.
@carkawalakhatulistiwa Před 25 dny
2 ai working together
@Vysair Před 11 dny
@@carkawalakhatulistiwa multi-modal be like
@Rortti Před 26 dny ⁺⁴¹
Last banana was close
@meh2285 Před 26 dny ⁺⁷
I found that for visuals that Claude produces, often taking a screenshot of what it produced and uploading it so Claude can see what it's code made usually yields better results.
@DBonacich Před 26 dny ⁺⁹
FYI: As an avid user of both the Claude Pro subscription and the Claude API, I can confirm that you can use the API even if your Pro limits are reached. The subscription and the API use two completely different rate limiting and pricing mechanisms.
@ytubeanon Před 26 dny
are the quality of responses similar from the API to Pro/Artifacts?
@4ytherium Před 26 dny ⁺¹
Prepare to get your wallet fucked by using the API
@DBonacich Před 26 dny
@@ytubeanon It depends on the way you use the API. For the most part, the model is the same. I use the API for things like open-devin and TypingMind.
@DBonacich Před 26 dny ⁺¹
@@4ytherium The Claude 3.5 API cost is honestly pretty cheap. If you're a lite user, the costs would likely come out to be cheaper than the Pro subscription. But if you're a heavy user, or you're using the API for a program that makes a ton of calls, the costs definitely add up.
@countofst.germain6417 Před 26 dny
He was just talking about tos
@doingtime20 Před 23 dny ⁺³
I've been using sonnet for my web dev job and it works great. I'm working double the speed because of it.
Granted I don't do super complex programming, but still it's amazing and pretty accurate.
I also find myself bouncing off ideas with Claude, as if he was a coworker. We discuss the pros and cons of different approaches and such.
@Guedez1 Před 26 dny ⁺¹⁵
Available to everyone unless you are a Brazilian that is, which people assume it's due to our data privacy laws
@RonyPlayer Před 26 dny ⁺¹
Download opera, my fellow Brazilian
@Earthball_Productions Před 26 dny
@@RonyPlayer1st Opera VPN isnt a real VPN, It sends your data to china, 2nd, you need a phone number.
@seto007 Před 26 dny ⁺¹
Can't comment specifically on Brazil, but since Claude is not available in Canada, I've found that using an American VPN and processing my payment while connected to that VPN, despite the card being Canadian, allowed me to use it just fine
@gustavodutra3633 Před 26 dny ⁺²
Brazil mentioned 🇧🇷
@johndank2209 Před 26 dny ⁺¹
Brazil's got these super strict data privacy laws, but meanwhile there's a ton of brutal killings going down every day. Kinda makes you wonder if they've got their priorities straight.
@middle-agedmacdonald2965 Před 26 dny ⁺⁴
I think this hop scotch from one being the best, to another, is all by design. I mean the NSA, Apple, and Microsoft seem to think Open AI is "the one". Good for Anthropic, but obviously this will be short lived. In my opinion, these other companies are letting their users use up all of their compute, while Open AI is still the bar to beat, but they aren't "wasting" their compute on the end user.....which means they're definitely using it for advancement. I know nothing, but this seems very logical to me. If they released Sora, it'd bog their compute down, for example.
@picklechill Před 26 dny ⁺²
Oh, hey! Thanks for breaking this down.
@Seventhsu Před 26 dny ⁺⁸
I'm still waiting for Claude to be able to display LaTeX natively and do math using it. That and the lack of a mobile app are the only things holding me back from switching
@edenassos Před 22 dny
Lack of a mobile app? Have you been living in a cave?
@bananabatsy3708 Před 13 dny
@@edenassos The mobile app for Claude is out.
@poisonza Před 26 dny ⁺⁷
It seems like Sonnet 3.5 is trained using dialogue generated through a ReAct-CoT and DFSDT-like workflow. For example, it almost always includes phrases like "let me explain step by step" in response to general requests or questions. I can get a somewhat similar result with ChatGPT, but it requires more specific prompting.
@Leonhart_93 Před 25 dny
Yes, it requires specific prompting. So they can give Claude a specific hidden system prompt, telling it to explain it to you step by step when applicable. So it probably has nothing to do with the training, the data is too massive to ever be specific.
If you ever played with local LLMs, you can give them any system prompt making them behave in a very specific way. They can even impersonate characters like that.
@poisonza Před 25 dny
@@Leonhart_93 the "step-by-step" behavior of Sonnet 3.5 is likely a result of a combination of factors, including its training data, architecture, and possibly a hidden system prompt (as you pointed out). It's a complex interplay that makes it difficult to isolate the exact cause.
@Leonhart_93 Před 25 dny
@@poisonza The training data is immense. Likely TBs in size. You can filter the data, but there is no way to "arrange" the final model, the trainers don't have control over specifics, it's mostly a black box.
The control part comes when they set up alignment layers over the training data and then the specific hidden system prompts.
I know that very well from using models with the alignment completely removed, you can get it to be whatever you want it to be and easily change it with each chat.
@poisonza Před 25 dny
@@Leonhart_93 what do you mean by alignment layer? alignment is done by RLHF or DPO. There is no specific layer for this can you point out reference papers?
@Leonhart_93 Před 25 dny
@@poisonza How do think they censor models like ChatGPT and Gemini and everything else public facing? They only trained those once in the past, there is no way they would spend millions of dollars and days to re-traing them for each small modification.
Here come the alignment layers, basically system prompts that tell the model how to answer and what they should not answer.
And when you remove those layers programmatically, you get something capable of answering anything. Because under those layers, the model is unchanged.
@OperationDarkside Před 26 dny ⁺³
If Claude Sonnet 3.5 can write decent GLSL code, I might be able to continue my game project.
@rockochamp Před 26 dny ⁺⁴
and don't forget the new project feature, which is actually great and helped me gigantic hugely mega wooow with my master thesis...
@ytubeanon Před 26 dny
how did it help, were you collaborating with other people?
@micbab-vg2mu Před 26 dny
Claude 3.5 is amazing:)
@AbelShields Před 26 dny
I used Claude 3.5 to give coeing style feedback on a pwrsonal project and it made some really good suggestions for how to clean up the style and organise it in a more modular way, it was really impressive.
@SangoProductions213 Před 26 dny ⁺¹
I've personally had literally 0 problems with especially the latest chatgpt models, but .. I'm going to try and use both for my new project and see which helps me most. Worst case scenario is spending a bit longer on the accelerated process that ai provides. Best case would be to find where both well and truly shine.
@ainet8415 Před 26 dny
Good job 🎉🎉🎉🎉❤❤❤
@reipuradinata8766 Před 26 dny ⁺³
I'm not sure Claude good enough yet. Here's what I asked:
Me: someone said "Traps are full of androgen receptors", is that true?
Claude: That statement is not accurate. "Trap" is a slang term sometimes used to refer to transgender women or crossdressers, which many consider offensive. Regardless of gender identity or expression....
Me: WTF, traps refer to trapezius muscle
Claude: I apologize for the misunderstanding. You're absolutely correct, and I thank you for the clarification. In this context, "traps" indeed refers to the trapezius muscles. Let me address the statement with this correct interpretation: "Traps are full of androgen receptors" is generally true....
Maybe I'll try again in 6 months.
@gamershd6225 Před 24 dny ⁺¹
I think imma hold out till gpt5
@lumiey Před 26 dny ⁺¹
bruh it took 2 prompts for claude to copy someone's shader code (1st prompt was kinda nice tho)
@AndyIsHereBoi Před 24 dny
I thiught this was a fireship video by the thumbnail
@paulzhu7694 Před 24 dny
In fact, after using GPT-4 and Claude 3.5, the biggest impression is that GPT-4 seems like a blockhead, while Claude 3.5 feels a level above GPT-4 in terms of comprehension abilities in various aspects. It feels more like a true AI
@SandTiger42 Před 26 dny ⁺³
I kept waiting for the video to get to the roleplay portion because I read "chatbot". My bad. My ideal AI can not only help me with productivity tasks, but alo put me in a book adventure. I want it to create a novel where I get to direct the protagonist. I think the main use for RP AI now is for weird smut and girlfriends/boyfriends. I'm ok with that if it gets me to my choose your own adventure chatbot.
@ytubeanon Před 26 dny
I heard of a technique to write a novel where you use A.I. to write one paragraph and then you write the next and alternate back and forth
@cajampa Před 26 dny ⁺³
Have you explored Koboldai and Silly Tavern?
@SandTiger42 Před 26 dny
@@cajampaKobold uses some pretty small and puny LLM's. There are much better LLM's out there to use. They just cost either a lot of money to host, or crazy hardware to selfhost. I do use ST, and I plug in API's from various LLM's depending the use. But we're still years away from having novel length and novel quality choose your own adventures.
@cajampa Před 26 dny
@@SandTiger42 What are you talking about dude. The only limit is the hardware you have available. So where do you get this BS that Kobold is a fixed thing of small and puny LLM. It is just like most other open source implementations of llm hosting. You can use what ever llm you have available on what ever hardware you have available. And most of the time your money you can and want to spend on it is the limit.
@SandTiger42 Před 26 dny
@@cajampa I just checked out the github page for a tiny bit. They mainly only list 7b and 13b models. Not my fault they aren't very clear what their program does. *shrug*
@big_mac_love Před 24 dny
Wow, what a neat tool! 🤩😯 - - That would be something a bot would say, so I just say the following: I yet got not thaaat much experience with using Sonnet3.5 in my workflow (E2E test generation). The little I tested with it I would say that code quality is not better than what gpt4o returns, but at some point its hard to compare it by code quality since its a weird subjective metric... for that I used anthropic's API, but the webUI feature where it runs code automatically seems really cool tho... as long as it doesnt run malicious code for a reverse shell automatically or shit like that xD
@sofia.eris.bauhaus Před 24 dny
a square (typically with diagonal sides) is commonly called a "diamond" so i don't think it should be blamed for that. i think when generating code, you want to err on the side of small size that can later expanded upon, rather than go for a maximalist interpretation of the task at hand.
@CMak3r Před 24 dny
I found at least 5 repositories with 2D subway surfers game mechanics
@Kutsushita_yukino Před 26 dny
the response sounds so robotic though but pretty smart
@Matt-bp5vy Před 26 dny ⁺⁴
First comment not by bot!
@dv_interval42 Před 16 dny
I have never hit a message limit on Claude. I strictly follow the "new-thread" guidelines and non-repeating pasted content parts. I think they base their "message limits" more around tokens processed rather than pure messages like OpenAI does.
@onlyyoucanstopevil9024 Před 25 dny
Yeah
@ckq Před 26 dny
The limit is pretty fair (I think higher than GPT 4os limit)
@blackpiller3777 Před 25 dny
no its not
@Kotakill Před 26 dny ⁺¹
I am a power user of both. I didn’t like that Claude’s PRO subscription limit it waaay lower then ChatGPTs. Also I really don’t understand why Claude needs to write fixes to its code into a separate file(windows) each time, when I first saw this feature I assumed it will make changes in the same window. Also Claude does not accept a lot of file extensions. I was furious when it refused to accept my .cs files that were accepted fine by ChatGPT
@ytubeanon Před 26 dny ⁺²
agreed about usage limit, it sucks that Artifacts works by reviewing the entirety of everything written before every time it updates something
@cajampa Před 26 dny ⁺¹
I read here in the comments to this video that the limit is separate on the API side of things. So if you need more you can always use it that way.
@Laszer271 Před 25 dny
For LLM to work with data (like in .csv format), it shouldn't look at the data directly, it should interact with the data through code. That's how openai does this and I would bet that's how it works for claude too. Because of that 50 or 5000 records shouldn't make the difference, it all comes down to df.head(), df.groupby() and df.sum() anyway.
@starpause Před 25 dny
huhu i got it 😋😋
@TheMattaBase Před 26 dny
Projects are available for Pro users not just Teams.
@ShowmanSharma Před 26 dny ⁺¹
Need some suggestion here, please.
Claude doesn't currently connect to the internet, right? So from a utility standpoint, don't you think ChatGPT 4o (pro access) is more useful than Claude 3.5 (pro access)?
I have a master's thesis to work on and a SaaS product to build. Please help me make a decision.
@countofst.germain6417 Před 26 dny ⁺³
ChatGPT is probably the better bet, Claude doesn't search the net, ChatGPT has far more features and you hit the limits with Claude far far easier
@trench01 Před 26 dny
i cant get on it. also any updates on alpaca ?
@Dannnneh Před 26 dny ⁺¹
They let *everyone* use it? Even those in Europe like me?
@gaggix7095 Před 26 dny ⁺¹
Not having a problem using it in the EU
@NighttimeJuneau Před 26 dny
@@gaggix7095And you just signed up like that, no VPN, with a European state credit card? Because it’s been restricted for a very long time.
@Kynatosh Před 26 dny ⁺¹
Still waiting on a good AI that can know my entire codebase and help me code.
If anyone knows any, please do tell
@alansmithee419 Před 26 dny
Will do as soon as one exists in, say, 2025/6?
@danacosta5237 Před 13 dny
There are some already like codeium (free), they still have the same issues that they all do have with hallucinations. Codeium's autocomplete can speed you up sometimes i've found especially with boilerplate
@Khari99 Před 26 dny ⁺¹
I've been using Claude Opus and Sonnet for a while now. Sonnet is great but model size definitely still matters. Sonnet falls apart when it comes to long complex prompts. Opus on the other hand handles complexity better than any other model I've ever used. For most things Sonnet is fine though. I can't wait until Opus 3.5 drops
@TheSonOfDumb Před 26 dny ⁺¹
This is what happens when you don't waste time having Kenyan contractors RLHF your LLM to death.
@RedOneM Před 25 dny
0:50 nothing is uniquely synthesized by an AI model...
@nuvotion-live Před 26 dny
I’m not paying for Claude unless they give the same amount of usage as ChatGPT. It’s way way too little. I’m coding, and if I’m working for hours I never want to hit a limit. And never have for ChatGPT Teams. I do use the Claude API for integration in my editor though.
@Purpbatboi Před 26 dny ⁺¹
for free- IF you live in the US
if not then not free
@gaggix7095 Před 26 dny ⁺²
I'm not in the US and it's free.
@Pawnsappsee Před 26 dny ⁺¹
Its free😊
@Purpbatboi Před 26 dny
@@Pawnsappsee not*
@FinalMorningstar Před 22 dny
@@Purpbatboiit is lol. im not in the US, and also if this was such a huge problem, just use a vpn...
@murillodaniel9208 Před 26 dny ⁺¹
Shame I can't use Claude since there is no phone option of my countrie
@powervidsful Před 23 dny
Code doesn't get stolen just copy and pasted.
@Arcticwhir Před 26 dny ⁺¹
wow yeah, tired several prompting tricks - even uploaded pic of a banana for reference, it couldnt draw an svg of a banana correctly.
@illyaeater Před 26 dny ⁺¹
where is gemini bruh 2 mil context code executor filter disableler juiced out the wazoo
@Mobay18 Před 25 dny
I would highly advice against uploading anything to this. Your files are not private.
@HasimFN Před 26 dny
gpt has less context but you can just start new conversation. claude u need to wait 5 hours
@brg2147 Před 26 dny
I also like gemini 1.5 pro for coding, much better than gpt
@desmondmailswhite8023 Před 26 dny ⁺⁴
"... allows everyone to access their latest model"
No they don't. It's not accessible in China, Russia, Iran, Cuba, Venezuela, Iran, Afghanistan, Ethiopia, Egypt, Myanmar (maybe somewhere else, lazy to check them all), which is almost two billion people.
It's not like I care, they can restrict it all they want. But it's not pleasant to hear myself being excluded from "everyone on earth".
@Alice_Fumo Před 26 dny ⁺²
While your statement is true (I assume), you could probably use a VPN to access the model as a free user at least? As a european I used to not be able to subscribe to Claude, but was happy to find this was changed in the meantime. Free tier isn't enough for a power user, but it might be for someone who only occasionally throws some queries at a language model throughout the day. I don't know your use cases.
@Earthball_Productions Před 26 dny
You are required to use a phone number @@Alice_Fumo
@nathanb011 Před 26 dny
You can't really get past import restrictions unfortunately
@gaggix7095 Před 26 dny
China (1.5 bln people) would block the site regardless
@willienel4906 Před 19 dny
Fireship rip off 😂
@NormTurtle Před 26 dny
your late dude
@bikkihsn Před 17 dny
Until Claude is censored i would never pay for it, such a waste of potential
@thetabletopskirmisher Před 21 dnem
I'd use Claude in a flash if it wasn't so useless for creative writing. I tried using one of the Claudes to help me with worldbuilding for my fantasy novel.
I asked it to describe the Pale Man in Pan's Labyrinth for me. It refused. 'I dont feel comfortable bla bla bla'. Useless!
@aidajam3294 Před 25 dny
It is garbage. I have really long experience using different GPTs (in programming). Claude (at least Sonnet) is making up stuff, therefore you should pay more attention how precise it is. In the end you will spend more time. On contrary ChatGPT (4o) is verbosy (even with your custrom instruction) but more accurate. IMHO, it is better to get annoying verbosy results instead of keeping constantly in eye on result's precison.
@mkultravictim0 Před 26 dny
30 mins no likes, bro fell off
@JohnnysaidWhat Před 26 dny ⁺¹
chatgpt has become absolute trash. There is a new king.
@override367 Před 25 dny
Claude is an absolutely worthless chatbot lol, it can't do anything other than G rated fiction. You can't even use it to like, create D&D characters, it says that it's "potentially harmful power imbalances" to create characters, just absolutely worthlessly hobbled
@human_shaped Před 26 dny
It's not "codes" it's "code". It is source code, it is not source codes. If you don't want to sound clueless, start saying "code" even in the plural context.
@jld-ni3vf Před 26 dny
Chill out man, its only a word
@freedom_aint_free Před 26 dny
Yes, but it's far more nannying and woke for sure!
@4.0.4 Před 15 dny
That fractal animation stolen from @kishimisu was pretty blatant! In fact, I wonder if Claude learned it from the many people who uploaded their versions of it (myself included) to various online code repositories / showcase sites.

Další v pořadí

Automatické přehrávání