When Claude 3.5 Sonnet Became The Better Chatbot
Vložit
- čas přidán 3. 08. 2024
- Scale your business easily now with AI/ML API to deploy any models you need aimlapi.com/?ref=bycloud
My newsletter
mail.bycloud.ai
Claude 3.5 Sonnet
[Blog] www.anthropic.com/news/claude...
[Website] claude.ai/
This video is supported by the kind Patrons & CZcams Members:
🙏Andrew Lescelius, alex j, Chris LeDoux, Alex Maurice, Miguilim, Deagan, FiFaŁ, Robert Zawiasa, Daddy Wen, Tony Jimenez, Panther Modern, Jake Disco, Demilson Quintao, Shuhong Chen, Hongbo Men, happi nyuu nyaa, Carol Lo, Mose Sakashita, Miguel, Bandera, Gennaro Schiano, gunwoo, Ravid Freedman, Mert Seftali, Mrityunjay, Richárd Nagyfi, Timo Steiner, Henrik G Sundt, projectAnthony, Brigham Hall, Kyle Hudson, Kalila, Jef Come, Jvari Williams, Tien Tien, BIll Mangrum, owned, Janne Kytölä, SO, Richárd Nagyfi, Hector, Drexon, Claxvii 177th, Inferencer, Michael Brenner, Akkusativ, Oleg Wock, FantomBloth
[Discord] / discord
[Twitter] / bycloudai
[Patreon] / bycloud
[Music] massobeats - mango tea
[Profile & Banner Art] / pygm7
[Video Editor] Silas - Věda a technologie
Scale your business easily now with AI/ML API to deploy any models you need! aimlapi.com/?ref=bycloud
and Claude 3.5 Opus is about to be glorious
Saw someone get it to make a 3D fps shooter, it absolutely blew my mind
Do you have a link?
I had it make a voxel engine with raytracing.
The only thing it needs next is to be able to have longer code and to be able to automatically read and fix bugs and errors.
Probably ripped off someone's code from GitHub
How?
ChatGPT Pro feels like such a rip-off with Claude Sonnet 3.5 being around and still no voice mode being released.
Remove your subscription
cancel your subscription!
let OpenAI feel that they cant get away with cucking us and GPT-5
Remove your subscription
I think the GPTs and integration with Dalle is it's value proposition.
hmm, with ChatGPT I know you can do talk and reply on their mobile app, but it's not as advanced as the gpt-4o demos... the API allows some voices for TTS, I had it create an audiobook, but yeah still not Scarlett Johansson quality
I am so happy I switched from chatgpt to Claude, like the amount of fucking stress I had generating and correcting prompts on chargpt was soo bad, Claude is a breeze working with, absolutely a pleasure, the same feeling I had when chatgpt 4 was actually smart and not lobotomised, and projects are amazing to manage shit codebases
Claude is not much different once your level of complexity gets high enough both Claude and gpt respond the same.
For mid intensity use Claude is better. But once really pushed it breaks down just the same as gpt.
@@M-dv1yj I agree with your assessment.
I worked with claude, the prompting is easy
when i switched to chatgpt to test out some other things, it blew up on my face
not even chat gpt 4.0 fixed it,
claude was better since haiku came out
You should checkout out abacus AI. I am not rate limited when using it, and it also supports team related shared RAG like claude projects.
I've had the total opposite experience. I cancelled my Claude sub as it was stressful trying not to offend the prudish thing. It was like talking to Karen, the biggest prude in the office, who's best friends with Helen in HR. I'd just rather not deal with her in the first place, than keep tip-toeing around the model's sensitivities. Heard of so many people being randomly banned that I pro-actively banned myself and stuck with GPT and running my own models locally. Claude isn't fit for purpose, unless your name is Karen... Plus GPT has web access, remembers my details and can make pics, for the same price.
ClosedAI is coping like crazy
I asked ChatGPT to make an image of a website, and then passed that image to Claude and simply asked it to build the website. I was surprised at how close it got on its first attempt. Not perfect, by far, but impressive still.
2 ai working together
@@carkawalakhatulistiwa multi-modal be like
Last banana was close
I found that for visuals that Claude produces, often taking a screenshot of what it produced and uploading it so Claude can see what it's code made usually yields better results.
FYI: As an avid user of both the Claude Pro subscription and the Claude API, I can confirm that you can use the API even if your Pro limits are reached. The subscription and the API use two completely different rate limiting and pricing mechanisms.
are the quality of responses similar from the API to Pro/Artifacts?
Prepare to get your wallet fucked by using the API
@@ytubeanon It depends on the way you use the API. For the most part, the model is the same. I use the API for things like open-devin and TypingMind.
@@4ytherium The Claude 3.5 API cost is honestly pretty cheap. If you're a lite user, the costs would likely come out to be cheaper than the Pro subscription. But if you're a heavy user, or you're using the API for a program that makes a ton of calls, the costs definitely add up.
He was just talking about tos
I've been using sonnet for my web dev job and it works great. I'm working double the speed because of it.
Granted I don't do super complex programming, but still it's amazing and pretty accurate.
I also find myself bouncing off ideas with Claude, as if he was a coworker. We discuss the pros and cons of different approaches and such.
Available to everyone unless you are a Brazilian that is, which people assume it's due to our data privacy laws
Download opera, my fellow Brazilian
@@RonyPlayer1st Opera VPN isnt a real VPN, It sends your data to china, 2nd, you need a phone number.
Can't comment specifically on Brazil, but since Claude is not available in Canada, I've found that using an American VPN and processing my payment while connected to that VPN, despite the card being Canadian, allowed me to use it just fine
Brazil mentioned 🇧🇷
Brazil's got these super strict data privacy laws, but meanwhile there's a ton of brutal killings going down every day. Kinda makes you wonder if they've got their priorities straight.
I think this hop scotch from one being the best, to another, is all by design. I mean the NSA, Apple, and Microsoft seem to think Open AI is "the one". Good for Anthropic, but obviously this will be short lived. In my opinion, these other companies are letting their users use up all of their compute, while Open AI is still the bar to beat, but they aren't "wasting" their compute on the end user.....which means they're definitely using it for advancement. I know nothing, but this seems very logical to me. If they released Sora, it'd bog their compute down, for example.
Oh, hey! Thanks for breaking this down.
I'm still waiting for Claude to be able to display LaTeX natively and do math using it. That and the lack of a mobile app are the only things holding me back from switching
Lack of a mobile app? Have you been living in a cave?
@@edenassos The mobile app for Claude is out.
It seems like Sonnet 3.5 is trained using dialogue generated through a ReAct-CoT and DFSDT-like workflow. For example, it almost always includes phrases like "let me explain step by step" in response to general requests or questions. I can get a somewhat similar result with ChatGPT, but it requires more specific prompting.
Yes, it requires specific prompting. So they can give Claude a specific hidden system prompt, telling it to explain it to you step by step when applicable. So it probably has nothing to do with the training, the data is too massive to ever be specific.
If you ever played with local LLMs, you can give them any system prompt making them behave in a very specific way. They can even impersonate characters like that.
@@Leonhart_93 the "step-by-step" behavior of Sonnet 3.5 is likely a result of a combination of factors, including its training data, architecture, and possibly a hidden system prompt (as you pointed out). It's a complex interplay that makes it difficult to isolate the exact cause.
@@poisonza The training data is immense. Likely TBs in size. You can filter the data, but there is no way to "arrange" the final model, the trainers don't have control over specifics, it's mostly a black box.
The control part comes when they set up alignment layers over the training data and then the specific hidden system prompts.
I know that very well from using models with the alignment completely removed, you can get it to be whatever you want it to be and easily change it with each chat.
@@Leonhart_93 what do you mean by alignment layer? alignment is done by RLHF or DPO. There is no specific layer for this can you point out reference papers?
@@poisonza How do think they censor models like ChatGPT and Gemini and everything else public facing? They only trained those once in the past, there is no way they would spend millions of dollars and days to re-traing them for each small modification.
Here come the alignment layers, basically system prompts that tell the model how to answer and what they should not answer.
And when you remove those layers programmatically, you get something capable of answering anything. Because under those layers, the model is unchanged.
If Claude Sonnet 3.5 can write decent GLSL code, I might be able to continue my game project.
and don't forget the new project feature, which is actually great and helped me gigantic hugely mega wooow with my master thesis...
how did it help, were you collaborating with other people?
Claude 3.5 is amazing:)
I used Claude 3.5 to give coeing style feedback on a pwrsonal project and it made some really good suggestions for how to clean up the style and organise it in a more modular way, it was really impressive.
I've personally had literally 0 problems with especially the latest chatgpt models, but .. I'm going to try and use both for my new project and see which helps me most. Worst case scenario is spending a bit longer on the accelerated process that ai provides. Best case would be to find where both well and truly shine.
Good job 🎉🎉🎉🎉❤❤❤
I'm not sure Claude good enough yet. Here's what I asked:
Me: someone said "Traps are full of androgen receptors", is that true?
Claude: That statement is not accurate. "Trap" is a slang term sometimes used to refer to transgender women or crossdressers, which many consider offensive. Regardless of gender identity or expression....
Me: WTF, traps refer to trapezius muscle
Claude: I apologize for the misunderstanding. You're absolutely correct, and I thank you for the clarification. In this context, "traps" indeed refers to the trapezius muscles. Let me address the statement with this correct interpretation: "Traps are full of androgen receptors" is generally true....
Maybe I'll try again in 6 months.
I think imma hold out till gpt5
bruh it took 2 prompts for claude to copy someone's shader code (1st prompt was kinda nice tho)
I thiught this was a fireship video by the thumbnail
In fact, after using GPT-4 and Claude 3.5, the biggest impression is that GPT-4 seems like a blockhead, while Claude 3.5 feels a level above GPT-4 in terms of comprehension abilities in various aspects. It feels more like a true AI
I kept waiting for the video to get to the roleplay portion because I read "chatbot". My bad. My ideal AI can not only help me with productivity tasks, but alo put me in a book adventure. I want it to create a novel where I get to direct the protagonist. I think the main use for RP AI now is for weird smut and girlfriends/boyfriends. I'm ok with that if it gets me to my choose your own adventure chatbot.
I heard of a technique to write a novel where you use A.I. to write one paragraph and then you write the next and alternate back and forth
Have you explored Koboldai and Silly Tavern?
@@cajampaKobold uses some pretty small and puny LLM's. There are much better LLM's out there to use. They just cost either a lot of money to host, or crazy hardware to selfhost. I do use ST, and I plug in API's from various LLM's depending the use. But we're still years away from having novel length and novel quality choose your own adventures.
@@SandTiger42 What are you talking about dude. The only limit is the hardware you have available. So where do you get this BS that Kobold is a fixed thing of small and puny LLM. It is just like most other open source implementations of llm hosting. You can use what ever llm you have available on what ever hardware you have available. And most of the time your money you can and want to spend on it is the limit.
@@cajampa I just checked out the github page for a tiny bit. They mainly only list 7b and 13b models. Not my fault they aren't very clear what their program does. *shrug*
Wow, what a neat tool! 🤩😯 - - That would be something a bot would say, so I just say the following: I yet got not thaaat much experience with using Sonnet3.5 in my workflow (E2E test generation). The little I tested with it I would say that code quality is not better than what gpt4o returns, but at some point its hard to compare it by code quality since its a weird subjective metric... for that I used anthropic's API, but the webUI feature where it runs code automatically seems really cool tho... as long as it doesnt run malicious code for a reverse shell automatically or shit like that xD
a square (typically with diagonal sides) is commonly called a "diamond" so i don't think it should be blamed for that. i think when generating code, you want to err on the side of small size that can later expanded upon, rather than go for a maximalist interpretation of the task at hand.
I found at least 5 repositories with 2D subway surfers game mechanics
the response sounds so robotic though but pretty smart
First comment not by bot!
I have never hit a message limit on Claude. I strictly follow the "new-thread" guidelines and non-repeating pasted content parts. I think they base their "message limits" more around tokens processed rather than pure messages like OpenAI does.
Yeah
The limit is pretty fair (I think higher than GPT 4os limit)
no its not
I am a power user of both. I didn’t like that Claude’s PRO subscription limit it waaay lower then ChatGPTs. Also I really don’t understand why Claude needs to write fixes to its code into a separate file(windows) each time, when I first saw this feature I assumed it will make changes in the same window. Also Claude does not accept a lot of file extensions. I was furious when it refused to accept my .cs files that were accepted fine by ChatGPT
agreed about usage limit, it sucks that Artifacts works by reviewing the entirety of everything written before every time it updates something
I read here in the comments to this video that the limit is separate on the API side of things. So if you need more you can always use it that way.
For LLM to work with data (like in .csv format), it shouldn't look at the data directly, it should interact with the data through code. That's how openai does this and I would bet that's how it works for claude too. Because of that 50 or 5000 records shouldn't make the difference, it all comes down to df.head(), df.groupby() and df.sum() anyway.
huhu i got it 😋😋
Projects are available for Pro users not just Teams.
Need some suggestion here, please.
Claude doesn't currently connect to the internet, right? So from a utility standpoint, don't you think ChatGPT 4o (pro access) is more useful than Claude 3.5 (pro access)?
I have a master's thesis to work on and a SaaS product to build. Please help me make a decision.
ChatGPT is probably the better bet, Claude doesn't search the net, ChatGPT has far more features and you hit the limits with Claude far far easier
i cant get on it. also any updates on alpaca ?
They let *everyone* use it? Even those in Europe like me?
Not having a problem using it in the EU
@@gaggix7095And you just signed up like that, no VPN, with a European state credit card? Because it’s been restricted for a very long time.
Still waiting on a good AI that can know my entire codebase and help me code.
If anyone knows any, please do tell
Will do as soon as one exists in, say, 2025/6?
There are some already like codeium (free), they still have the same issues that they all do have with hallucinations. Codeium's autocomplete can speed you up sometimes i've found especially with boilerplate
I've been using Claude Opus and Sonnet for a while now. Sonnet is great but model size definitely still matters. Sonnet falls apart when it comes to long complex prompts. Opus on the other hand handles complexity better than any other model I've ever used. For most things Sonnet is fine though. I can't wait until Opus 3.5 drops
This is what happens when you don't waste time having Kenyan contractors RLHF your LLM to death.
0:50 nothing is uniquely synthesized by an AI model...
I’m not paying for Claude unless they give the same amount of usage as ChatGPT. It’s way way too little. I’m coding, and if I’m working for hours I never want to hit a limit. And never have for ChatGPT Teams. I do use the Claude API for integration in my editor though.
for free- IF you live in the US
if not then not free
I'm not in the US and it's free.
Its free😊
@@Pawnsappsee not*
@@Purpbatboiit is lol. im not in the US, and also if this was such a huge problem, just use a vpn...
Shame I can't use Claude since there is no phone option of my countrie
Code doesn't get stolen just copy and pasted.
wow yeah, tired several prompting tricks - even uploaded pic of a banana for reference, it couldnt draw an svg of a banana correctly.
where is gemini bruh 2 mil context code executor filter disableler juiced out the wazoo
I would highly advice against uploading anything to this. Your files are not private.
gpt has less context but you can just start new conversation. claude u need to wait 5 hours
I also like gemini 1.5 pro for coding, much better than gpt
"... allows everyone to access their latest model"
No they don't. It's not accessible in China, Russia, Iran, Cuba, Venezuela, Iran, Afghanistan, Ethiopia, Egypt, Myanmar (maybe somewhere else, lazy to check them all), which is almost two billion people.
It's not like I care, they can restrict it all they want. But it's not pleasant to hear myself being excluded from "everyone on earth".
While your statement is true (I assume), you could probably use a VPN to access the model as a free user at least? As a european I used to not be able to subscribe to Claude, but was happy to find this was changed in the meantime. Free tier isn't enough for a power user, but it might be for someone who only occasionally throws some queries at a language model throughout the day. I don't know your use cases.
You are required to use a phone number @@Alice_Fumo
You can't really get past import restrictions unfortunately
China (1.5 bln people) would block the site regardless
Fireship rip off 😂
your late dude
Until Claude is censored i would never pay for it, such a waste of potential
I'd use Claude in a flash if it wasn't so useless for creative writing. I tried using one of the Claudes to help me with worldbuilding for my fantasy novel.
I asked it to describe the Pale Man in Pan's Labyrinth for me. It refused. 'I dont feel comfortable bla bla bla'. Useless!
It is garbage. I have really long experience using different GPTs (in programming). Claude (at least Sonnet) is making up stuff, therefore you should pay more attention how precise it is. In the end you will spend more time. On contrary ChatGPT (4o) is verbosy (even with your custrom instruction) but more accurate. IMHO, it is better to get annoying verbosy results instead of keeping constantly in eye on result's precison.
30 mins no likes, bro fell off
chatgpt has become absolute trash. There is a new king.
Claude is an absolutely worthless chatbot lol, it can't do anything other than G rated fiction. You can't even use it to like, create D&D characters, it says that it's "potentially harmful power imbalances" to create characters, just absolutely worthlessly hobbled
It's not "codes" it's "code". It is source code, it is not source codes. If you don't want to sound clueless, start saying "code" even in the plural context.
Chill out man, its only a word
Yes, but it's far more nannying and woke for sure!
That fractal animation stolen from @kishimisu was pretty blatant! In fact, I wonder if Claude learned it from the many people who uploaded their versions of it (myself included) to various online code repositories / showcase sites.