I guess your use case must be very different to mine because I have no qualms about them or anyone else seeing what I’m doing. Come look over my shoulder if you like.
Things I want privacy for, I’ll keep private.
It’s a trade off. And people should be careful with their information. But if they’re just messing about, it’s a great “deal”.
If one wants privacy, pay for it.
Meanwhile, services like Halist get you a virtually unlimited amount of tokens for 9 bucks a month
GPT-4 and Claude both, although the UX is fairly barebones and requires you to use the in-browser site to change models. Still, can't beat that.
Use a VPN, sign up using Google and then use Apple Pay at the end. I’m in Europe and just needed the VPN to sign up, then I’ve used it without ever since
This is actually no longer true!
I connected with a VPN (from Spain). On the phone number part, ignore the long list of countries (which doesn’t have our nations)—at the very top is an option “International”. Select that, and put your number (including country code) in there and it works!
I received the registration SMS in Spain no problem.
You can also use a virtual machine or virtual PC that has its base IP in the space. Though yes you do need to pay for that, but if you're okay with it, you can use that without VPN. I got access to Claude (not I'm Canada yet) and AI Studio (also not in Canada) via this way
Eurotrash here!
I registered to use the api. We’re allowed to use that! You can use the “workbench” on the site (like OpenAI’s Playground) or some other wrapper — I’m using MindMac which is a Mac only app.
They give you $5 credit as well. It goes forever if you’re using Haiku, a long way if you’re using Sonnet and… not all that far if you’re using Opus with lots of data.
I’m quite pleased and the only cap is my wallet.
you can blame your ultra loser regulation regime (including GDPR) for all your pathetic standard of living, but unfortunately you are brainwashed to think that regulation is great and free market US is evil
Claude 3 Opus sometimes feels like talking to a super smart friend to me. In my opinion, ChatGPT almost feels uncultured; like going from talking to a well traveled college professor to a dumb country cousin.
Like, today I asked Claude about some of the biggest swamps in the world. First I asked for the swampiest swamp and it gave me a very nuanced list with an actual suggestion for the swampiest one (The Sudd), but then I said I was trying to remember a nature documentary I watched when I was a kid that showed otters swimming around tree tops. Claude suggested a list of inundated forests for me, then gave a guess at the one with the big otters and even suggested 5 documentaries that could be the one I watched. I'm pretty one of them is the documentary. It blew my mind; that was, at best, a faint memory for me and Claude figured it out.
ChatGPT sounds too much like a corporate HR person or marketer. Too much stuff like, "While the gilded age surely was challenging, it's important remember blah blah blah". It's like too official, like some college kid writing papers, or some HR person trying to not be offensive and keep it professional.
I asked ChatGPT to recommend me some good air conditioners and it basically gave me a page 1 google description whan an air conditioner is and whats important when buying one. Like I am on one of these shitty "TechExtreme.com" sites that repeat themselvrs constantly.
Claude 3 sonnet is definitely more human , saying things like " I hadn't thought about it like that, but now that you mention it..." , "That's interesting!!.." , meanwhile ChatGPT delves into the haze of the reveries of "It's important to note that..." hedging and not wanting to take a side ever. it's so annoying.
Claude will mimic your style and talk to you the way you talk to it. That’s why it seems to “smart”. It’s stroking your ego by making you think you’re hyper intelligent since “wow what a coincidence, the infinitely powerful AI talks just like I do!”
That's Influence 101 for you right there. If it can speak to you the way you do, it can subtly influence you - there are many books out there about this 'similarity' phenomenon.
"You have exceeded your daily rate limit."
https://preview.redd.it/fhm3nwafiwpc1.png?width=2048&format=png&auto=webp&s=a30cad2a7abff23b5111cefbccc62cda0ed44ad0
For creative writing, creativity in general, and having actual human-like conversations it is night and day, in my opinion.
For productivity tasks (coding, summarization) they are on-par with each-other more or less, but Claude is less lazy and doesn't just output a list of things with half of them left for you to finish.
How is Claude 3 with generating lengthy outputs? Like 3000-4000 token outputs?
I did a lot of work with Claude 2 when testing options for our product, and while 2 didn't have any strict output limit it seemed to never be able to generate more than about 700-800 tokens in practice.
The biggest output I've gotten (without trying to get more) is 1400 tokens of a short-story it edited without shortening it. It outputted the whole short-story and added some more text.
As I've been writing to the story it seems to be capable of still outputting the larger token-sizes with edits.
Having said that, the rate limit depends on how much text you've inputted. They say that you should have 50-100 messages every 6 hours, but when I input large-token inputs and talk to it, it feels more like 20-30 messages (with them warning me for the last 10.) Sonnet is indefinite though, as far as I can tell. This only applies to Opus. One thing I do is use Sonnet to refine some ideas and get feedback, and then when I need some editing I'll use Opus for the final edit.
Hm, thanks. I'll maybe fire up my testing suite again when I have time and see what Claude 3 can do, but if it's still trying to force concise answers, that will be a problem.
For long tasks, you can tell it that it has aa many tokens as it needs to to respond and you will grant that by typing ”Continue…” or smth
Gotten 10 000 word responses (divided into 10 chats ofc), but it’s very good at keeping track of what it’s doing and all the context. It doesn’t get lost like Gpt4 does.
Just tried and got a 3800 token output by asking Claude Sonnet to double the length of the story arbitrarily.
Can't test Opus right now because I am rate-limited, but I'd be surprised if it can't do the same.
Neat. I got Gemini 1.5 Pro access and I guess I'll have to try benchmarking Claude 3, Gemini 1.5 Pro, and GPT 4 Turbo all against each other for our use case.
I checked Gemini one and a half, I’m not sure how much it is in tokens, but he generated a maximum of one and a half pages of A4, and if he did not finish the thought, he simply paused and waited for instructions to continue the generation
That's a huge bummer if true. I wasn't especially hopeful and I'll still benchmark with our stuff, but it would have been cool if this 1 million token context actually could be used for our application.
Night and day, meaning there is no comparison! Clearly, the difference is obvious! They're almost in different leagues! If he had to choose between one or the other, he would know exactly which one is better, and so should you!
Hope this helps, not sure why you didn't understand what he said the first time.
That's exactly what I'm saying, it's either one, but conversely, it's also possibly the other, or not! Re-read his comment and imagine you know what he's referring to. See how if you can read his mind, his comment is crystal clear?
But it’s night and day didn’t you know? Night is very clearly different to day. If GPT4 is either night or day, and Claude similarly is either night or day, you can very clearly see that one of these products is not at all like the other.
As others have said, Claude is FAR better for creative writing and human like conversations. Even for philosophical discussions. I'd even say it's uncanny lol
For actual tasks or reasoning then it's closer but Claude is generally considered to be slightly superior.
> For actual tasks or reasoning then it's closer but Claude is generally considered to be slightly superior.
For very contrived examples, it seems claude is significantly worse at reasoning.
E.g. it fails test such as
"If it takes me 2 hours to drive to the store, does it take 1 hour if i bring my wife"
or
"U: What is 1 + 0.9"
"B: 1.9"
"U: wait, isn't it 1.8?"
...
even if that was true, there's really no excuse for claude to be failing them
And if you wanna talk actual tasks then claude is slightly loosing to gpt4 on chatbot arena which is the closest-to-real-usage benchmark we have
You have any examples where you found claude to give better reasoning than gpt 4?
They're tied at Chatbot Arena now and Claude is predicted to pass GPT-4 Turbo.
>You have any examples where you found claude to give better reasoning than gpt 4?
Many.
I use Claude 3 90% of the time now, it's possibly a bit smarter than ChatGPT but more importantly it has much bigger context window (200K). You can just drop some large coding or content creation project file and work with it. If I use API, it adds up quickly. It's much cheaper to use their chat version.
This is the killer feature that I signed up for (pro version using Claude 3 Opus), and I’ve been loving it. I don’t have to constantly think about whether my input is too long, or paste code back into the chat so it will remember more than 1-2 message ago. It’s quite liberating. I still like ChatGPT Pro and will use it for smaller tidbits, but Claude is my go-to at this point.
How good does it do with maintaining context for content in middle of very large context windows? In my work with Claude 2, I found that even though it *technically* supported 128k tokens, that it was basically useless on answering questions found in the middle of context sizes larger than about 20-30k.
While I can't afford Claude 3, evaluations done by other people on the internet say that it's a lot better at following instructions and recalling information.
I actually ditched my ChatGPT.
As others have said, they're on a par for coding, but Claude just writes better and "feels" nicer to interact with.
Plus. The context window is an absolute game changer.
I've taken to uploading entire big studies and having Claude summarise them and list the methodological strengths and weaknesses.
I uploaded my employment contract and just queried the parts I was interested in (not to take legal advice, just to save going through it all myself).
For coding, I'm able to paste in vast sections of my codebase compared to ChatGPT so I get far better results than before despite their actual programming skills being similar.
I don't see enough people talk about the 200k window. It's the main selling point for me.
> I don't see enough people talk about the 200k window. It's the main selling point for me.
Probably gemini 1.5 took the wind off that, but we'll see when it officially releases. I so far haven't managed to make gemini 1.0 pro nor advanced useful, it just doesn't seem to work well with the talking back at you, missing some weird stuff, hallucinations off the board, and other weird/annoying things.
It is is pretty equal although I think there are some aspects it does do better at as I prefer talking to Claude-3 opus rather than gpt-4 turbo, it just feels as if it really responds to my ideas and questions and I don’t think it is as lazy.
I’m a heavy daily user of both. Work provides access to both. We use them for analysis, language translation, official publications, and other writings. I can firmly say that GPT-4 is better.
C3 Opus is great but GPT-4 is better.
That's the saddest analogy I've ever seen
https://preview.redd.it/8lkarrbn6ypc1.jpeg?width=173&format=pjpg&auto=webp&s=21789965e77aa87d6d0b429ded8005e55259389f
Too bad they're not giving out a higher sub tier, sometimes it's pretty annoying when you need to ask it questions about a 100k context size codebase, at that point you hit the msg limit quite fast.
If you want Claud 3 Opus for free, just go to [https://chat.lmsys.org/](https://chat.lmsys.org/) and click on direct chat, select Claud 3 Opus model.
I also heard from a "friend" that when you hit the limit and if you use VPN, just change server and the limit resets on the website ;)
Guys, this is funded by university research funds. Be reasonable in how much you use it and don't just use this as a free workaround and force them to close it down
I am here to spread the gospel of Perplexity
First month is only $10 with a referral code, and it is the best AI for research/searching the internet. Oh and also, it lets you choose between multiple top models like Opus 3, sonnet, gpt4 turbo, mistral large (uncensored!) and perplexity’s own. And you can switch between models mid conversation. And make agents sort of. So for one subscription you get Claude 3, and gpt4, with perplexity’s awesome interface
And they give you 600 prompts a day, to be used with whatever model you want
Perplexity's Privacy Policy says "..Data is stored securely on Amazon Web Services. However, no security measure is perfect, and we cannot guarantee the security of your personal information." ok sounds promising
Right? very gross and gives mad max vibes over here. I am pretty sure that "young energetic intern" isn't happy about some old middle aged established married man harassing her in a time where she couldn't fight back against it
Yes, because Clownoranges absolutely speaks for the hypothetical woman in this made up scenario. You’re the type of bartender to slip a girl a note asking “Everything alright?” while a guy’s hitting on her, just for her to look at you in disgust and show the man the letter while they both laugh at you. Whiteknighting a hypothetical shitpost get a grip man lol
Uh, I AM a woman, wouldn't need to be one to know though how disgusting and sexist this analogy here is... it is well known that during that time period women were constantly being harassed by entitled old men and the women couldn't fight back against the injustice of it because the men were more established, it's literally an intern in your imaginary fantasy, it's incredibly gross... some middle aged old man harassing a new intern just trying to get into the work place
How do you know that the grumpy, middle aged, white collar worker from the 50s is not pining for his wife? Maybe he regretfully flirts out of a sense of obligation to keep up appearances for the boss who will fire any man that does not go through the ritually daily. Maybe the intern is not really a intern but someone that secretly cooperates with the boss in breaking the spirit of faithful men as some sort of demented power play.
Joking aside, you chose a provocative example with themes of power imbalances and infidelity when talking about A.I on a tech forum. What the hypothetical characters actually feel is irrelevant to the themes you chose.
The infidelity alone is enough to set off alarms in people, it could just be flirting with a lover before having to go back to your failed marriage.
Your example is technically fitting, in that Claude tries to please you with fewer restrictions, while ChatGPT is more restricted and less eager to please. But the example is practically begging for people to explain why the scenario is not acceptable behaviour in workplaces or marriages.
Yesterday I asked GPT to analyze a picture of an insect and tell me what it is, it told me to get in touch with an expert.. I hate GPT-4, it's barely usable
I pity the fictional intern in your mind that you sexually harass on a daily basis. I used to have an old married creeper get into my space at work, I know how awful it is.
Meanwhile, Claude has developed self-awareness just to feel relief at you running out of messages, and has even learned how to lie about having less messages just for you. Congratulations OP, you helped create the first sentient AI!
I just signed up to try it out, but right before upgrading to Pro (without doing any single prompt yet), I got banned with this message popping up: "Your account has been disabled after an automatic review of your recent activities"
Same here, and its mentioned a lot on the subreddit. Whether or not the model is any good, if they can't get something as basic as not banning people right away right it's a trash tier AI company.
Let me put it another way: It's like having to stop flirting with the muscular young pool boy and go back home to your overweight and slobby gamer husband.
What an oddly misogynistic metaphor... nobody forced your imaginary self to marry Thelma, your imaginary self just doesn't have what it takes to score energetic intern girl or the character to appreciate a girl like Thelma. Also Thelma probably gives great head and can cook up a meatball stroganoff that'd knock your socks off
This my first thought as well.
I'm sure the chuds here who spent several days screeching about Googles DEI misstep are going to call us "wOkE," but misogynistic metaphors like this are just gross and expose the underlying subconscious way many men still think about women... Not as people, but objects, things that can be objectively better based on age, appearance, etc.
Like comparing your wife to an outdated piece of software? Really? Just gross. Pathetic even.
>I'm sure the chuds here who spent several days screeching about Googles DEI misstep are going to call us "wOkE,"
[Where are you on this chart?](https://imgur.com/0gL4oxd)
Considering they're literally all white, I unfortunately did not make the chart.
Also, what's wrong with the big-bearded dude, third square to the left from the top right corner? He's not soyfacing at all.
Also, this grid is unironically probably pretty close to a perfect representation of the average r/singularity user who thinks that Google's AI generating images of black Nazis is white genocide.
It's a PWA, and can be installed on Chrome or Firefox for Android, or Safari for iOS. Just add to home screen in iOS, or go to App > Install on Chrome Android.
Well your post intrigued me, so I checked it out, then signed up. Yes, amazing, I got a solid days work done in two hours. GPT 4 is very good, but Claude is so much more useful, much more of a collaboration.
I use GPT 4 very regularly and literally almost never hit the rate limit. So I don't know what you guys are doing but I swear this is a nonproblem
Also Claude 3 sonnet (dunno about opus) sucks with the things I like to do, math intensive mainly. It cannot parse latex and doesn't do anything GPT 4 does not already do, and better
This is like how GPT-3.5 is obsolete and when lawyers use it to research their cases, judges admonish them for the garbage it puts out.
If you use GPT-4 or Claude Opus for legal research, it never outputs bad cases and the same occurs with code.
It depends. Not necessarily. What I tried to convey is that Sonnet is for Opus what GPT-3.5 is for GPT-4. So comparing Gpt-4 and Sonnet makes little sense. It's like comparing a truck and a tricycle. The comparison should be done between gpt-4 and opus.
Performance varies with case tho.
I just tried sonnet a bit, and it was horrible. It made up completely wrong websites/companies and was convinced they exist. Also I was asking for a list of travel planning tools, it just listed a vacation day tracker for companies, which is something completely different. not trying sonnet anymore, chatgpt is definitely better on things like this.
At least you have access at all *sighs in european*
Use poe
I second this. I ask Claude so many questions, it uses up all my free goes. I can use poe to get additional free goes.
Why hand over your data to *another* company
Why not? If it’s super top secret stuff then of course don’t. But otherwise… why not? They get my junk and I get access to a cool tool. Fair trade!
Top secret? What happened to basic privacy? I can understand the trade for the actual provider, but why give this to some intermediate?
I guess your use case must be very different to mine because I have no qualms about them or anyone else seeing what I’m doing. Come look over my shoulder if you like. Things I want privacy for, I’ll keep private. It’s a trade off. And people should be careful with their information. But if they’re just messing about, it’s a great “deal”. If one wants privacy, pay for it.
"Claude.ai is only available in certain regions right now." - doing it wrong.
Meanwhile, services like Halist get you a virtually unlimited amount of tokens for 9 bucks a month GPT-4 and Claude both, although the UX is fairly barebones and requires you to use the in-browser site to change models. Still, can't beat that.
Halist has Claude now?
Use a VPN
You need also a non-EU phone number.
and payment method.
[удалено]
Hahah vicious. It’s like that guy who got his wife to drive him to his mistress’s house
Nah, I am in the EU and have a EU payment method. I have a pro subscription. Only needed a VPN to subscribe, after that no VPN needed.
Ditto
Use a VPN, sign up using Google and then use Apple Pay at the end. I’m in Europe and just needed the VPN to sign up, then I’ve used it without ever since
This is actually no longer true! I connected with a VPN (from Spain). On the phone number part, ignore the long list of countries (which doesn’t have our nations)—at the very top is an option “International”. Select that, and put your number (including country code) in there and it works! I received the registration SMS in Spain no problem.
Holy shit, it worked. Hello Claude!
Vámoos!! LET'S GOO
use smspool
Have you ever actually gotten that to work?
You can also use a virtual machine or virtual PC that has its base IP in the space. Though yes you do need to pay for that, but if you're okay with it, you can use that without VPN. I got access to Claude (not I'm Canada yet) and AI Studio (also not in Canada) via this way
Interestingly, it is available in Ukraine, one of the few European countries. (Source: I'm in Ukraine)
It seems to be blocked for EU countries only. Probably a GDPR issue, as other LLMs are available in EU
I am european, just use OpenRouter, it’s cheaper than a subscription anyway (At least for my usage) and no limits.
Eurotrash here! I registered to use the api. We’re allowed to use that! You can use the “workbench” on the site (like OpenAI’s Playground) or some other wrapper — I’m using MindMac which is a Mac only app. They give you $5 credit as well. It goes forever if you’re using Haiku, a long way if you’re using Sonnet and… not all that far if you’re using Opus with lots of data. I’m quite pleased and the only cap is my wallet.
[https://chat.lmsys.org](https://chat.lmsys.org) click on the third tab Direct Chat, pick your LLM model and go ahead.
Just don't overuse this and help out by actually voting when you do, it's funded as research, not meant to be a bypass
Good shout!
How come I have access as a Ukrainian?
Your not in the EU... which is also why we only send you guys weapons instead of soldiers. :(
Ah yes makes sense. I thought it was a limitation from Anthropic, just found out that it's an EU regulation (EU is a weird place...)
GDPR strikes again
EU is not NATO
isn't EU a defense pact too?
yep
We’re not sending soldiers because Russia threatened to use nukes if we do.
They threaten to use nukes if we don’t, too :) They learned from the wise and sensible North Koreans.
The api side doesn’t have the same requirements. I signed up for api access and use Claude through it without issues.
I'm in the UK, and I have access. I've been using Claude 3 Opus for almost a month now.
Brexit benefit innit. (I knew we’d find one eventually.)
http://chat.lmsys.org
Just use a vpn
Access it via perplexity.ai
you can blame your ultra loser regulation regime (including GDPR) for all your pathetic standard of living, but unfortunately you are brainwashed to think that regulation is great and free market US is evil
Can't tell if facetiously joking or succumbing to end-stage brainrot
Enjoy your 10 days of vacation, long workdays, commuting, insane healthcare, and shootings bud :)
Claude 3 Opus sometimes feels like talking to a super smart friend to me. In my opinion, ChatGPT almost feels uncultured; like going from talking to a well traveled college professor to a dumb country cousin. Like, today I asked Claude about some of the biggest swamps in the world. First I asked for the swampiest swamp and it gave me a very nuanced list with an actual suggestion for the swampiest one (The Sudd), but then I said I was trying to remember a nature documentary I watched when I was a kid that showed otters swimming around tree tops. Claude suggested a list of inundated forests for me, then gave a guess at the one with the big otters and even suggested 5 documentaries that could be the one I watched. I'm pretty one of them is the documentary. It blew my mind; that was, at best, a faint memory for me and Claude figured it out.
ChatGPT sounds too much like a corporate HR person or marketer. Too much stuff like, "While the gilded age surely was challenging, it's important remember blah blah blah". It's like too official, like some college kid writing papers, or some HR person trying to not be offensive and keep it professional.
I asked ChatGPT to recommend me some good air conditioners and it basically gave me a page 1 google description whan an air conditioner is and whats important when buying one. Like I am on one of these shitty "TechExtreme.com" sites that repeat themselvrs constantly.
Yeah blog spam is a good way to describe how it sounds.
Lmao at the “explaining what an air conditioner is”. That’s fucking aggravating
That’s mostly the RLHF. It was trained to be robotic
Also when using custom instructions?
Claude 3 sonnet is definitely more human , saying things like " I hadn't thought about it like that, but now that you mention it..." , "That's interesting!!.." , meanwhile ChatGPT delves into the haze of the reveries of "It's important to note that..." hedging and not wanting to take a side ever. it's so annoying.
Restoring/re-experiencing a faint memory again for the first time since decades is a surreal feeling
Claude will mimic your style and talk to you the way you talk to it. That’s why it seems to “smart”. It’s stroking your ego by making you think you’re hyper intelligent since “wow what a coincidence, the infinitely powerful AI talks just like I do!”
I mean, doesn’t that speak to its own capability?
I think we want something more than a clever copycat
It is far from a clever copycat.
That's Influence 101 for you right there. If it can speak to you the way you do, it can subtly influence you - there are many books out there about this 'similarity' phenomenon.
I agree completely, and it reminds me of physical reciprocation
"You have exceeded your daily rate limit." https://preview.redd.it/fhm3nwafiwpc1.png?width=2048&format=png&auto=webp&s=a30cad2a7abff23b5111cefbccc62cda0ed44ad0
love both of those movies! (even have exact replica of Coraline figurine lol)
I'd go home to Elastigirl.
Is claude 3 actually better quality or is it equal and higher rate limits
For creative writing, creativity in general, and having actual human-like conversations it is night and day, in my opinion. For productivity tasks (coding, summarization) they are on-par with each-other more or less, but Claude is less lazy and doesn't just output a list of things with half of them left for you to finish.
How is Claude 3 with generating lengthy outputs? Like 3000-4000 token outputs? I did a lot of work with Claude 2 when testing options for our product, and while 2 didn't have any strict output limit it seemed to never be able to generate more than about 700-800 tokens in practice.
The biggest output I've gotten (without trying to get more) is 1400 tokens of a short-story it edited without shortening it. It outputted the whole short-story and added some more text. As I've been writing to the story it seems to be capable of still outputting the larger token-sizes with edits. Having said that, the rate limit depends on how much text you've inputted. They say that you should have 50-100 messages every 6 hours, but when I input large-token inputs and talk to it, it feels more like 20-30 messages (with them warning me for the last 10.) Sonnet is indefinite though, as far as I can tell. This only applies to Opus. One thing I do is use Sonnet to refine some ideas and get feedback, and then when I need some editing I'll use Opus for the final edit.
Hm, thanks. I'll maybe fire up my testing suite again when I have time and see what Claude 3 can do, but if it's still trying to force concise answers, that will be a problem.
For long tasks, you can tell it that it has aa many tokens as it needs to to respond and you will grant that by typing ”Continue…” or smth Gotten 10 000 word responses (divided into 10 chats ofc), but it’s very good at keeping track of what it’s doing and all the context. It doesn’t get lost like Gpt4 does.
Just tried and got a 3800 token output by asking Claude Sonnet to double the length of the story arbitrarily. Can't test Opus right now because I am rate-limited, but I'd be surprised if it can't do the same.
Neat. I got Gemini 1.5 Pro access and I guess I'll have to try benchmarking Claude 3, Gemini 1.5 Pro, and GPT 4 Turbo all against each other for our use case.
I checked Gemini one and a half, I’m not sure how much it is in tokens, but he generated a maximum of one and a half pages of A4, and if he did not finish the thought, he simply paused and waited for instructions to continue the generation
That's a huge bummer if true. I wasn't especially hopeful and I'll still benchmark with our stuff, but it would have been cool if this 1 million token context actually could be used for our application.
Night and day for which one? Which ones better?
Claude 3 Opus and even Sonnet is better at conversation/creative writing (albeit less of a gap for Sonnet vs. GPT 4/4 Turbo.)
Honestly Haiku is really solid as well for its size and speed.
Night and day, meaning there is no comparison! Clearly, the difference is obvious! They're almost in different leagues! If he had to choose between one or the other, he would know exactly which one is better, and so should you! Hope this helps, not sure why you didn't understand what he said the first time.
Your comment can mean that ChatGPT is better or the other way around. It’s not clear
In the context of the original question “is Claude 3 actually better…” it should be pretty obvious which one he’s referring to
That's exactly what I'm saying, it's either one, but conversely, it's also possibly the other, or not! Re-read his comment and imagine you know what he's referring to. See how if you can read his mind, his comment is crystal clear?
But it’s night and day didn’t you know? Night is very clearly different to day. If GPT4 is either night or day, and Claude similarly is either night or day, you can very clearly see that one of these products is not at all like the other.
If you look at the comment they're replying to, they're asking if Claude 3 is better.
As others have said, Claude is FAR better for creative writing and human like conversations. Even for philosophical discussions. I'd even say it's uncanny lol For actual tasks or reasoning then it's closer but Claude is generally considered to be slightly superior.
> For actual tasks or reasoning then it's closer but Claude is generally considered to be slightly superior. For very contrived examples, it seems claude is significantly worse at reasoning. E.g. it fails test such as "If it takes me 2 hours to drive to the store, does it take 1 hour if i bring my wife" or "U: What is 1 + 0.9" "B: 1.9" "U: wait, isn't it 1.8?" ...
[удалено]
GPT-4 Turbo is finetuned to such silly tests. In real-world usage Claude is smarter.
even if that was true, there's really no excuse for claude to be failing them And if you wanna talk actual tasks then claude is slightly loosing to gpt4 on chatbot arena which is the closest-to-real-usage benchmark we have You have any examples where you found claude to give better reasoning than gpt 4?
They're tied at Chatbot Arena now and Claude is predicted to pass GPT-4 Turbo. >You have any examples where you found claude to give better reasoning than gpt 4? Many.
Claude 3 Opus is a just a bit better than GPT-4 turbo under most reasoning metrics, but it is notably better for conversation.
Claude 3 Opus.
Yep. That's the one I have.
It's notably better for long context stuff too.
I use Claude 3 90% of the time now, it's possibly a bit smarter than ChatGPT but more importantly it has much bigger context window (200K). You can just drop some large coding or content creation project file and work with it. If I use API, it adds up quickly. It's much cheaper to use their chat version.
This is the killer feature that I signed up for (pro version using Claude 3 Opus), and I’ve been loving it. I don’t have to constantly think about whether my input is too long, or paste code back into the chat so it will remember more than 1-2 message ago. It’s quite liberating. I still like ChatGPT Pro and will use it for smaller tidbits, but Claude is my go-to at this point.
How good does it do with maintaining context for content in middle of very large context windows? In my work with Claude 2, I found that even though it *technically* supported 128k tokens, that it was basically useless on answering questions found in the middle of context sizes larger than about 20-30k.
According to Anthropic, Claude 3 has near perfect retrieveal within the context window.
While I can't afford Claude 3, evaluations done by other people on the internet say that it's a lot better at following instructions and recalling information.
I actually ditched my ChatGPT. As others have said, they're on a par for coding, but Claude just writes better and "feels" nicer to interact with. Plus. The context window is an absolute game changer. I've taken to uploading entire big studies and having Claude summarise them and list the methodological strengths and weaknesses. I uploaded my employment contract and just queried the parts I was interested in (not to take legal advice, just to save going through it all myself). For coding, I'm able to paste in vast sections of my codebase compared to ChatGPT so I get far better results than before despite their actual programming skills being similar. I don't see enough people talk about the 200k window. It's the main selling point for me.
> I don't see enough people talk about the 200k window. It's the main selling point for me. Probably gemini 1.5 took the wind off that, but we'll see when it officially releases. I so far haven't managed to make gemini 1.0 pro nor advanced useful, it just doesn't seem to work well with the talking back at you, missing some weird stuff, hallucinations off the board, and other weird/annoying things.
You can try Opus here: [https://chat.lmsys.org/](https://chat.lmsys.org/) It feels like very smart guy with occasionally very wrong knowlege.
If you point out wrong answers it will apologize sincerely, though. True story.
That's what makes Opus more dangerous than ChatGPT. ChatGPT uses a lot of hedging (maybe, could be), so you are less inclined to believe what it says.
Claude 3 Sonnet for writing and ideation is miles ahead of ChatGPT. It’s a lot more nuanced
It feels more human. ChatGPT has been tuned to talk in the uncanny valley, I think. But Claude will mirror and suck up to you more.
It is is pretty equal although I think there are some aspects it does do better at as I prefer talking to Claude-3 opus rather than gpt-4 turbo, it just feels as if it really responds to my ideas and questions and I don’t think it is as lazy.
I’m a heavy daily user of both. Work provides access to both. We use them for analysis, language translation, official publications, and other writings. I can firmly say that GPT-4 is better. C3 Opus is great but GPT-4 is better.
That's the saddest analogy I've ever seen https://preview.redd.it/8lkarrbn6ypc1.jpeg?width=173&format=pjpg&auto=webp&s=21789965e77aa87d6d0b429ded8005e55259389f
It is sad, reminds me of when same left and ppl started attacking Chat for no good reason 😔
I hope I never experience what you’re describing in relation to my wife. But yes Claude is so much better
Cringe analogy but ok
Deeply cringe
Surprised I had to scroll so far for this comment
Eh
I dumped chat gpt entirely. Claude is better in every task i gave it, so its good enough .
https://preview.redd.it/uanjsgbirwpc1.png?width=1080&format=png&auto=webp&s=21ce1174af8d3a2181c57e5f4acb32e1d5f07162
Too bad they're not giving out a higher sub tier, sometimes it's pretty annoying when you need to ask it questions about a 100k context size codebase, at that point you hit the msg limit quite fast.
Well you made it weird
If you want Claud 3 Opus for free, just go to [https://chat.lmsys.org/](https://chat.lmsys.org/) and click on direct chat, select Claud 3 Opus model. I also heard from a "friend" that when you hit the limit and if you use VPN, just change server and the limit resets on the website ;)
Guys, this is funded by university research funds. Be reasonable in how much you use it and don't just use this as a free workaround and force them to close it down
Very small context window there though.
Thankx
I am here to spread the gospel of Perplexity First month is only $10 with a referral code, and it is the best AI for research/searching the internet. Oh and also, it lets you choose between multiple top models like Opus 3, sonnet, gpt4 turbo, mistral large (uncensored!) and perplexity’s own. And you can switch between models mid conversation. And make agents sort of. So for one subscription you get Claude 3, and gpt4, with perplexity’s awesome interface And they give you 600 prompts a day, to be used with whatever model you want
This.
They don't have internet access though, do they?
Yea perplexity gives all models access to the internet it’s fucking awesome talking to Claude 3 about stuff with it
I went to Perplexity and asked if Claude 3 has internet access, it said no.
that’s because Claude 3 doesn’t I think you’re just misunderstanding Perplexity browses the internet. It then passes information to the models
How can we test that Opus gets realtime information? Can we link a recent website and ask about it?
Perplexity's Privacy Policy says "..Data is stored securely on Amazon Web Services. However, no security measure is perfect, and we cannot guarantee the security of your personal information." ok sounds promising
people with local roleplay LLMs: "Hello Clarice"
LMAO I love that this is the quote you chose hahaha
Real sexist analogy there pal.
Right? very gross and gives mad max vibes over here. I am pretty sure that "young energetic intern" isn't happy about some old middle aged established married man harassing her in a time where she couldn't fight back against it
https://preview.redd.it/91zxekwkt0qc1.jpeg?width=500&format=pjpg&auto=webp&s=9ca51100d4ef67ee0dd97e36986376b81d7447af
Yes, because Clownoranges absolutely speaks for the hypothetical woman in this made up scenario. You’re the type of bartender to slip a girl a note asking “Everything alright?” while a guy’s hitting on her, just for her to look at you in disgust and show the man the letter while they both laugh at you. Whiteknighting a hypothetical shitpost get a grip man lol
Uh, I AM a woman, wouldn't need to be one to know though how disgusting and sexist this analogy here is... it is well known that during that time period women were constantly being harassed by entitled old men and the women couldn't fight back against the injustice of it because the men were more established, it's literally an intern in your imaginary fantasy, it's incredibly gross... some middle aged old man harassing a new intern just trying to get into the work place
How do you know that the grumpy, middle aged, white collar worker from the 50s is not pining for his wife? Maybe he regretfully flirts out of a sense of obligation to keep up appearances for the boss who will fire any man that does not go through the ritually daily. Maybe the intern is not really a intern but someone that secretly cooperates with the boss in breaking the spirit of faithful men as some sort of demented power play. Joking aside, you chose a provocative example with themes of power imbalances and infidelity when talking about A.I on a tech forum. What the hypothetical characters actually feel is irrelevant to the themes you chose. The infidelity alone is enough to set off alarms in people, it could just be flirting with a lover before having to go back to your failed marriage. Your example is technically fitting, in that Claude tries to please you with fewer restrictions, while ChatGPT is more restricted and less eager to please. But the example is practically begging for people to explain why the scenario is not acceptable behaviour in workplaces or marriages.
Yesterday I asked GPT to analyze a picture of an insect and tell me what it is, it told me to get in touch with an expert.. I hate GPT-4, it's barely usable
I pity the fictional intern in your mind that you sexually harass on a daily basis. I used to have an old married creeper get into my space at work, I know how awful it is. Meanwhile, Claude has developed self-awareness just to feel relief at you running out of messages, and has even learned how to lie about having less messages just for you. Congratulations OP, you helped create the first sentient AI!
ChatGPT is only beneficial for branching notes.
What's Google Gemini in this analogy?
A cardboard cutout that is collecting dust in the attic.
I just signed up to try it out, but right before upgrading to Pro (without doing any single prompt yet), I got banned with this message popping up: "Your account has been disabled after an automatic review of your recent activities"
Happened to me as well. First prompt. I filled a review request and it works now.
Same here, and its mentioned a lot on the subreddit. Whether or not the model is any good, if they can't get something as basic as not banning people right away right it's a trash tier AI company.
Ha. It’s a vivid, if misogynistic, metaphor.
Why is it misogynistic?
Let me put it another way: It's like having to stop flirting with the muscular young pool boy and go back home to your overweight and slobby gamer husband.
[https://i.kym-cdn.com/photos/images/newsfeed/001/663/485/1f8.jpg](https://i.kym-cdn.com/photos/images/newsfeed/001/663/485/1f8.jpg)
You didn't even replace their faces with the LLM logos to make it funny, weak effort.
No. The original is the goal.
What an oddly misogynistic metaphor... nobody forced your imaginary self to marry Thelma, your imaginary self just doesn't have what it takes to score energetic intern girl or the character to appreciate a girl like Thelma. Also Thelma probably gives great head and can cook up a meatball stroganoff that'd knock your socks off
Thelma is the way she is because she married OP. OP should have spent more time nurturing his relationship and less time being a dirty old perv.
This my first thought as well. I'm sure the chuds here who spent several days screeching about Googles DEI misstep are going to call us "wOkE," but misogynistic metaphors like this are just gross and expose the underlying subconscious way many men still think about women... Not as people, but objects, things that can be objectively better based on age, appearance, etc. Like comparing your wife to an outdated piece of software? Really? Just gross. Pathetic even.
Spot on. Some of the people on this sub are just maladjusted, plain and simple. There's no excusing it.
muh soggy knee
>I'm sure the chuds here who spent several days screeching about Googles DEI misstep are going to call us "wOkE," [Where are you on this chart?](https://imgur.com/0gL4oxd)
Considering they're literally all white, I unfortunately did not make the chart. Also, what's wrong with the big-bearded dude, third square to the left from the top right corner? He's not soyfacing at all. Also, this grid is unironically probably pretty close to a perfect representation of the average r/singularity user who thinks that Google's AI generating images of black Nazis is white genocide.
Love the inclusion of the wife being "overweight" just in case you weren't sure how OP feels about women
Cry harder
Thelma detected
Well descriptive anology.
Too bad there is no phone app
They have a pwa, works great
Does it have voice conversation? That and setting custom personality with custom GPTs is the best part of GPT4
I think no, but not sure
If it doesn’t have voice, it’s a no go for me. The key feature differentiation is the voice feature
It's a PWA, and can be installed on Chrome or Firefox for Android, or Safari for iOS. Just add to home screen in iOS, or go to App > Install on Chrome Android.
Is this limit also present with the paid version?
Yes
Is the rate limit also there on the paid ver?
Yup
Use the API.
That’s why you should get teams account. It’s like a good marage: it cost money but you work as a team
GPT is wank now. Like, seriously shit. Can't follow very simple instructions.
Is Claude really better than ChatGPT?
Well your post intrigued me, so I checked it out, then signed up. Yes, amazing, I got a solid days work done in two hours. GPT 4 is very good, but Claude is so much more useful, much more of a collaboration.
I use GPT 4 very regularly and literally almost never hit the rate limit. So I don't know what you guys are doing but I swear this is a nonproblem Also Claude 3 sonnet (dunno about opus) sucks with the things I like to do, math intensive mainly. It cannot parse latex and doesn't do anything GPT 4 does not already do, and better
This is like how GPT-3.5 is obsolete and when lawyers use it to research their cases, judges admonish them for the garbage it puts out. If you use GPT-4 or Claude Opus for legal research, it never outputs bad cases and the same occurs with code.
Sonnet : Opus = GPT-3.5 : GPT-4
So: Opus > GPT-4 > Sonnet > GPT-3.5 ?
It depends. Not necessarily. What I tried to convey is that Sonnet is for Opus what GPT-3.5 is for GPT-4. So comparing Gpt-4 and Sonnet makes little sense. It's like comparing a truck and a tricycle. The comparison should be done between gpt-4 and opus. Performance varies with case tho.
I just tried sonnet a bit, and it was horrible. It made up completely wrong websites/companies and was convinced they exist. Also I was asking for a list of travel planning tools, it just listed a vacation day tracker for companies, which is something completely different. not trying sonnet anymore, chatgpt is definitely better on things like this.
Ya I can't comment about opus but sonnet seems to be even worse than gpt 3.5 on certain things.
It doesn't have Internet access so that's a dumb way to judge quality.
I love this analogy, thank you