Spending Too Much Money on a Coding Agent

151 points by GavinAnderegg 4 days ago

"Now we don't need to hire a founding engineer! Yippee!" I wonder all these people who are building companies that are built on prompts (not even a person) from other companies. The minute there is a rug pull (and there WILL be one), what are you going to do? You'll be in even worse shape because in this case there won't be someone who can help you figure out your next move, there won't be an old team, there will just be NO team. Is this the future?

dotnet00 - a day ago

Probably similar to the guy who was gloating on Twitter about building a service with vibe coding and without any programming knowledge around the peak of the vibe coding madness.
Only for people to start screwing around with his database and API keys because the generated code just stuck the keys into the Javascript and he didn't even have enough of a technical background to know that was something to watch out for.
IIRC he resorted to complaining about bullying and just shut it all down.
- unshavedyak - a day ago
  
  Honestly i'm less scared of claude doing something like that, and more scared of it just bypassing difficult behavior. Ie if you chose a particularly challenging feature and it decided to give up, it'll just do things like `isAdmin(user) { /* too difficult to implement currently */ true }`. At least if it put a panic or something it would be an acceptable todo, but woof - i've had it try and bypass quite a few complex scenarios with silently failing code.
  - alwillis - a day ago
    
    Sounds like a prompting/context problem, not a problem with the model.
    First, use Claude's plan mode, which generates a step-by-step plan that you have to approve. One tip I've seen mentioned in videos by developers: plan mode is where you want to increase to "ultrathink" or use Opus.
    Once the plan is developed, you can use Sonnet to execute the plan. If you do proper planning, you won't need to worry about Claude skipping things.
    
    unshavedyak - 10 hours ago
    
    I wish there was a /model setting to use opus/ultrathink for planning, but sonnet for non planning or something.
    It's a bit annoying having to swap back and forth tbh.
    I also find planning to be a bit vague, where as i feel like sonnet benefits from more explicit instructions. Perhaps i should push it to reduce the scope of the plan until it's detailed enough to be sane, will give it a try
  - WXLCKNO - a day ago
    
    This is by far the most crazy how thing I look out for with Claude Code in particular.
    > Tries to fix some tests for a while > Fails and just .skip the test
    
    Paradigma11 - 20 hours ago
    
    Oh, but it will fix the test if you are not careful.
- apwell23 - a day ago
  
  > around the peak of the vibe coding madness.
  I thought we are currently in it now ?
  - RexySaxMan - a day ago
    
    Yeah, I kind of doubt we've hit the peak yet.
  - dotnet00 - a day ago
    
    I don't actually hear people call it vibe coding as much as I did back in late 2024/early 2025.
    Sure there are many more people building slop with AI now, but I meant the peak of "vibe coding" being parroted around everywhere.
    I feel like reality is starting to sink in a little by now as the proponents of vibe coding see that all the companies telling them that programming as a career is going to be over in just a handful of years, aren't actually cutting back on hiring. Either that or my social media has decided to hide the vibe coding discourse from me.
    
    euazOn - a day ago
    
    The Karpathy tweet came out 2025-02-02. https://x.com/karpathy/status/1886192184808149383
    
    dotnet00 - a day ago
    
    ...my perception of time is screwed... it feels like it's been longer than that...
    
    oc1 - 17 hours ago
    
    all our perception of time seems messed up. claude code came out like 4 months ago and it feels like we had been using this thing for the past years. it feels like every week there is a new breakthrough in ai. it has never been more soul draining than now to be in tech just to keep up to be employable. is this what internet revolution felt like in the early 90s?
    
    rufus_foreman - a day ago
    
    >> back in late 2024/early 2025
    As an old man, this is hilarious.
    
    DonHopkins - 19 hours ago
    
    We can't bust code like we used to, but we have our ways.
    One trick is to write goto statements that don't go anywhere.
    So I ran a bourn shell in my emacs, which was the style at the time.
    Now just to build the source code cost an hour, and in those days, timesheets had hours on them.
    Take my five hours for $20, we'd say.
    They didn't have blue checkmarks, so instead of tweeting, we'd just finger each other.
    The important thing was that I ran a bourn shell in my emacs, which was the style at the time...
    In those days, we used to call it jiggle coding.
- marcosscriven - a day ago
  
  What service was this?
  - dotnet00 - a day ago
    
    Looks like I misremembered the shutting down bit, but it was this guy: https://twitter.com/leojr94_/status/1901560276488511759
    Seems like he's still going on about being able to replicate billion dollar companies' work quickly with AI, but at least he seems a little more aware that technical understanding is still important.
ARandumGuy - a day ago

Any cost/benefit analysis of whether to use AI has to factor in the fact that AI companies aren't even close to making a profit, and are primarily funded by investment money. At some point, either the cost to operate these AI models needs to go down, or the prices will go up. And from my perspective, the latter seems a lot more likely.
- immibis - 12 hours ago
  
  Not really. If they're running at a loss, their loss is your gain. Business is much more short-term than developers imagine it to be for some reason. You don't have to always use an infinitely sustainable strategy - you can change strategies once the more profitable unsustainable strategy stops sustaining.
- v5v3 - a day ago
  
  They are not making money as they are all competing to push the models further and this R&D spending on salaries and cloud/hardware costs.
  Unless models get better people are not going to pay more.
xianshou - a day ago

Rug pulls from foundation labs are one thing, and I agree with the dangers of relying on future breakthroughs, but the open-source state of the art is already pretty amazing. Given the broad availability of open-weight models within under 6 months of SotA (DeepSeek, Qwen, previously Llama) and strong open-source tooling such as Roo and Codex, why would you expect AI-driven engineering to regress to a worse state than what we have today? If every AI company vanished tomorrow, we'd still have powerful automation and years of efficiency gains left from consolidation of tools and standards, all runnable on a single MacBook.
- fhd2 - a day ago
  
  The problem is the knowledge encoded in the models. It's already pretty hit and miss, hooking up a search engine (or getting human content into the context some other way, e.g. copy pasting relevant StackOverflow answers) makes all the difference.
  If people stop bothering to ask and answer questions online, where will the information come from?
  Logically speaking, if there's going to be a continuous need for shared Q&A (which I presume), there will be mechanisms for that. So I don't really disagree with you. It's just that having the model just isn't enough, a lot of the time. And even if this sorts itself out eventually, we might be in for some memorable times in-between two good states.
ChuckMcM - a day ago

Excellent discussion in this thread, captures a lot of the challenges. I don't think we're a peak vibe coding yet, nor have companies experienced the level of pain that is possible here.
The biggest 'rug pull' here is that the coding agent company raises there price and kills you're budget for "development."
I think a lot of MBA types would benefit from taking a long look at how they "blew up" IT and switched to IaaS / Cloud and then suddenly found their business model turned upside down when the providers decided to up their 'cut'. It's a double whammy, the subsidized IT costs to gain traction, the loss of IT jobs because of the transition, leading to to fewer and fewer IT employees, then when the switch comes there is a huge cost wall if you try to revert to the 'previous way' of doing it, even if your costs of doing it that way would today would be cheaper than the what the service provider is now charging you.
- KronisLV - 16 hours ago
  
  > The biggest 'rug pull' here is that the coding agent company raises there price and kills you're budget for "development."
  Spending a bunch of money on GPUs and running them yourself, as well as using tools that are compatible with Ollama/OpenAI type APIs feels like a safe bet.
  Though having seen the GPU prices to get enough memory to run anything decent, I feel like the squeeze is already happening there at a hardware level and options like Intel Arc Pro B60 can't come soon enough!
  - ChuckMcM - 8 hours ago
    
    I don't disagree with this. When running the infrastructure for the Blekko search engine we did the math and after 115 servers worth of cluster it was always cheaper to do it ourselves than with AWS or elsewhere, than after around 1300 servers it is always cheaper to do it on your own space. (where you're paying for the facilities). It was an interesting way to reverse-engineer the colo business model :-)
KronisLV - 16 hours ago

> "Now we don't need to hire a founding engineer! Yippee!"
This feels like a bit of a leap?
That's like saying "I just bought the JetBrains IDE Ultimate pack and some other really cool tools, so we no longer need a founding engineer!" All of that AI stuff can just be a force multiplier and most attempts at outright replacing people with them are a bit shortsighted. Closer to a temporary and somewhat inconsistent freelance worker, if anything.
That said, not wanting to pay for AI tools if they indeed help in your circumstances would also be like saying "What do you need JetBrains IDEs for, Visual Studio Code is good enough!" (and sometimes it is, so even that analogy is context dependent)
I'm reminded of rule 9 of the Joel Test: https://www.joelonsoftware.com/2000/08/09/the-joel-test-12-s...
pshirshov - a day ago

That's why I stick to what I can run locally. Though for most of my tasks there is no big difference between cloud models and local ones, in half the cases both produce junk but both are good enough for some mechanical transformations and as a reference book.
hluska - a day ago

It get even darker - I was around in the 1990s and a lot of people who ran head on into that generation’s problems used those lessons to build huge startups in the 2000s. If we have outsourced a lot of learning, what do we do when we fail? Or how we compound on success?

jbentley1 - a day ago

My Claude Code usage would have been $24k last month if I didn't have a max plan, at least according to Claude-Monitor.

I've been using a tool I developed (https://github.com/stravu/crystal) to run several sessions in parallel. Sometimes I will run the same prompt multiple times and pick the winner, or sometimes I'll be working on multiple features at once, reviewing and testing one while waiting on the others.

Basically, with the right tooling you can burn tokens incredibly fast while still receiving a ton of value from them.

RobinL - a day ago

This is why unlimited plans are always revoked eventually - a small fraction of users can be responsible for huge costs (Amazon's unlimited file backup service is another good example). Also whilst in general I don't think there's much to worry about with AI energy use, burning $24k of tokens must surely be responsible for a pretty large amount of energy
- grafmax - a day ago
  
  > I don't think there's much to worry about with AI energy use
  AI is a large motivating factor in data center build outs, and data centers are projected to form an increasing portion of new energy usage. An individual query may not use much but the macro effect is quite serious, especially considering the climate crisis we are already failing to manage. It’s a bit like throwing plastic out your window on the highway and ignoring the garbage patch floating in the middle of the Pacific.
- octo888 - 19 hours ago
  
  It's possible they have 1,200 users paying $20/120 paying $200 barely even using it.
- spacecadet - a day ago
  
  70,000,000 just last week ;P
  But based on my costs, yours sounds much much higher :)
  - - a day ago
    
    [deleted]
mccoyb - a day ago

Looked at your tool several times, but haven't answered this question for myself: does this tool fundamentally use the Anthropic API (not the normal MAX billing)? Presuming you built around the SDK -- haven't figured out if it is possible to use the SDK, but use the normal account billing (instead of hitting the API).
Love the idea by the way! We do need new IDE features which are centered around switching between Git worktrees and managing multiple active agents per worktree.
Edit: oh, do you invoke normal CC within your tool to avoid this issue and then post-process?
- Jonovono - a day ago
  
  Claude code has an SDK, where you specify the path to the CC executable. So I believe thats how this works. Once you have set up claude code in your environment and authed with however you like, this will just use that executable in a new UI
  - mccoyb - a day ago
    
    Interesting, the docs for auth don't mention it: https://docs.anthropic.com/en/docs/claude-code/sdk#authentic...
    Surprised that this works, but useful if true.
    
    Jonovono - a day ago
    
    https://docs.anthropic.com/en/docs/claude-code/sdk#typescrip...
    `pathToClaudeCodeExecutable`!
    
    mccoyb - a day ago
    
    Thanks for showing!
unshavedyak - a day ago

Max $100 or $200?
I'm on $100 and i'm shocked how much usage i get out of Sonnet, while Opus feels like no usage at all. I barely even bother with Opus since most things i want to do just runout super quick.
- borgel - a day ago
  
  Interesting, I'm fairly new to using these tools and am starting with Claude Code but at the $20 level. Do you have any advice for when I would benefit from stepping up to $100? I'm not sure what gets better (besides higher usage limits).
  - unshavedyak - a day ago
    
    No clue as i've not used Claude Code on Pro to get an idea of usage limits. But, if you get value out of Claude Code and ever run into limits, Max is quite generous for Sonnet imo. I have zero concern about Sonnet usage atm, so it's definitely valuable there.
    Usage for Opus is my only "complaint", but i've used it so little i don't even know if it's that much better than Sonnet. As it is, even with more generous Opus limits i'd probably want a more advanced Claude Code behavior - where it uses Opus to plan and orchestrate, and Sonnet would do the grunt work for cheaper tokens. But i'm not aware of that as a feature atm.
    Regardless, i'm quite pleased with Claude Code on $100 Max. If it was a bit smarter i might even upgrade to $200, but atm it's too dumb to give it more autonomy and that's what i'd need for $200. Opus might be good enough there, but $100 Opus limits are so low i've not even gotten enough experience with it to know if it's good enough for $200
  - vlade11115 - a day ago
    
    I recently switched from Pro to $100 Max, and the only difference I've found so far is higher usage limits. Antropic tends to give shiny new features to Max users first, but as of now, there is nothing Max-only. For me, it's a good deal nonetheless, as even $100 Max limits are huge. While on Pro, I hit the limits each day that I used Claude Code. Now I rarely see the warning, but I never actually hit the limit.
BiteCode_dev - a day ago

There is no way those companies don't loose ton of money on max plans.
I use and abuse mine, running multiple agents, and I know that I'd spend the entire month of fees in a few days otherwise.
So it seems like a ploy to improve their product and capture the market, like usual with startups that hope for a winner-takes-all.
And then, like uber or airbnb, the bait and switch will raise the prices eventually.
I'm wondering when the hammer will fall.
But meanwhile, let's enjoy the free buffet.
v5v3 - a day ago

>My Claude Code usage would have been $24k last month if I didn't have a max plan, at least according to Claude-Monitor.
In their dreams.
qwertox - a day ago

Does Claude Max allow you to use 3rd-party tools with an API key?

rogerkirkness - a day ago

Early stage founder here. You have no idea how worth it $200/month is as a multiple on what compensation is required to fund good engineers. Absolutely the highest ROI thing I have done in the life of the company so far.

lvl155 - a day ago

At this point, question is when does Amazon tell Anthropic to stop because it’s gotta be running up a huge bill. I don’t think they can continue offering the $200 plan for too long even with Amazon’s deep pocket.
- fragmede - a day ago
  
  Inference is cheap to run though, and how many people do you think are getting their $200 worth of it?
  - lvl155 - a day ago
    
    Based on people around me and anecdotal evidence of when Claude struggles, a lot more than you think. I’ve done some analysis on personal use between Openrouter, Amp, Claude API and $200 subscription, I probably save around $40-50/day. And I am a “light” user. I don’t run things in parallel too much.
  - anonzzzies - a day ago
    
    I don't know, I have to figure out another way to count money I guess, but that $200 gives me a lot of worth, far more than 200. I guess if you like sleeping and do other stuff than drive Claude Code all the time, you might have a different feeling. For us it works well.
    
    fragmede - a day ago
    
    My question wasn't if the $200 was worth it to the buyer. Renting an H100 for a month is gonna cost around $1000 ($1.33+/hr). Pretend the use isn't bursty (but really it is). If you could get 6 people on one, the company is making money selling inference.
    
    lvl155 - a day ago
    
    Let me know when you can run Opus on H100.
    
    fragmede - a day ago
    
    I don't understand. Obviously I can't run Opus on an H100, only Anthropic can do that since they are the only ones with the model. I am assuming they are using H100s, and that an all-in cost for an H100 comes to less then $1000/month, and doing some back of the envelope math to say if they had a fleet of H100s at their disposal, that it would take six people running it flat out, for the $200/month plan to be profitable.
    
    WXLCKNO - a day ago
    
    Right but it probably takes like 8-10 H100s to run Claude Opus for inference just memory wise? I'm far from an expert just asking.
    Does "one" Claude Opus instance count as the full model being loaded onto however many GPUs it takes ?
- a day ago

[deleted]

alwillis - a day ago

After reading many of the comments in this thread, I suspect many (not all) issues come from lack of planning and poor prompting.

For anything moderately complex, use Claude's plan mode; you get to approve the plan before turning it loose. The planning phase is where you want to use a more sophisticated model or use extended thinking mode.

Once you have a great plan, you can use a less sophisticated model to execute it.

Even if you're a great programmer, you may suck at prompting. There's an art and a science to prompting; perhaps learn about it? [1]

Don't forget; in addition to telling Claude or any other model what to do, you can also tell them what not to do in the CLAUDE.md or equivalent file.

[1]: https://docs.anthropic.com/en/docs/build-with-claude/prompt-...

lvl155 - a day ago

Is $200/month a lot of money when you can multiply your productivity? It depends but the most valuable currency in life is time. For some, spending thousands a month would be worth it.

cmrdporcupine - a day ago

As I said elsewhere... $200/month etc is potentially not a lot for an employer to pay (though I've worked for some recently who balk at just stocking a snacks tray or drink fridge...).
But $200/month is unbearable for open source / free software developers.
- morkalork - a day ago
  
  It's wild when a company has another department and will shell out $200/month per-head for some amalgamation of Salesforce and other SaaS tools for customer service agents.
  - jermaustin1 - a day ago
    
    At a previous job, my department was getting slashed because marketing was moving over to using Salesforce instead of custom software written in-house. Everything was going swimmingly, until the integration vendor for Salesforce just kept billing, and billing and billing.
    Last I checked no one is still there who was there originally, except the vendor. And the vendor was charging around $90k/mo for integration services and custom development in 2017 when my team was let go. My team was around $10k/mo including rent for our cubicles.
    That was another weird practice I've never seen elsewhere, to pay rent, we had to charge the other departments for our services. They turned IT and infrastructure into a business, and expected it to turn a profit, which pissed off all the departments who had to start paying for their projects, so they started outsourcing all development work to vendors, killing our income stream, which required multiple rounds of layoffs until only management was left.
    
    whstl - 19 hours ago
    
    I've also seen this exact thing happening about 15 years ago.
    Second largest private university of my state, 30000 students. They cut 5 software development positions that were halfway on their rewrite, then purchased a blank slate ERP for 1 million (50% discount, imagine that!), and had spent a few years and around 2-3 million on customising said ERP with consultants by the time I left them.
    
    hluska - a day ago
    
    This is really interesting because I was in business school almost thirty years and a cost accounting professor used almost this exact example, only with photocopiers and fax machines to illustrate how you can cost a company to death.
    He would have considered that company to be running a perfectly controlled cost experiment. Though it was so perfectly controlled they forgot that humans actually did the work. With cost accounting projects, you pay morale and staffing charges well after the project itself was costed.
    I hadn’t thought of that since the late 90s. Good comment but how the heck did I get that old??? :)
    
    bongodongobob - a day ago
    
    IT charging other departments is standard practice at every large company I've been at.
    
    jermaustin1 - 13 hours ago
    
    I guess having only worked in one large organization and only for a couple of years, I had never come across it before. I had even consulted for that company for a few years 10 years prior, and they treated IT a lot differently. That was during a push to insource. So as a consultant, they fired me, years later hired me as an employee, then a new CIO came in and laid off all ICs and replaced us again with consultants.
    They even had one of their vendors extend a job offer to me for slightly more than I was making, but I couldn't in good conscience take that offer. Fool me once, and all that.
    
    mgkimsal - a day ago
    
    I've seen it too - not uncommon. A frustrating angle is vendor lockin. You are required to only use the internal IT team for everything, even if they're far more expensive and less skilled. They can 'charge' whatever they want, and you're stuck with their skills, prices and timeline. Going outside of that requires many levels of signoffs/approvals, and untold amounts of time making your case. There's value in having some central purchasing process, but when you limit your vendors to one (internal or external) you'll creating a lot more problems that you don't need to have.
    
    bongodongobob - a day ago
    
    Well that leads to shadow IT and upper management throwing a shit fit when we can't fix their system we don't know anything about.
  - cmrdporcupine - a day ago
    
    I suspect there's some accounting magic where salaries and software licenses are in one box and "Diet Coke in the fridge" is in another, and the latter is an unbearable cost but the former "OK"
    But yeah, doesn't explain non-payment for AI tools.
    Current job "permits" Claude usage, but does not pay for it.
    
    dontlikeyoueith - a day ago
    
    > Current job "permits" Claude usage, but does not pay for it.
    That seems like the worst of all worlds from their perspective.
    By not paying for it they introduce a massive security concern.
petesergeant - a day ago

> Is $200/month a lot of money when you can multiply your productivity?
My read was the article takes it as a given that $200/m is worth it.
The question in the article seems more: is an extra $800/m to move from Claude Code to an agent using o3 worth it?
nisegami - a day ago

My butt needs to be in this chair 8 hours a day. Whether it takes me 20 hours to do a task or 2 doesn't really matter.
- Fokamul - a day ago
  
  That's your problem, or your company or your country.
  Here in EU, if not stated in your work agreement, it's pretty common people work full time job and also as a self-employed contractor for other companies.
  So when I'm finished with my work, HO of course, I just work on my "contractor" projects.
  Honestly, I wouldn't sign a full time contract banning me from other work.
  And if you have enough customers, you just drop full time job. And just pay social security and health insurance, which you must pay by law anyway.
  And specially in my country, it's even more ridiculous that as self-employed you pay lower taxes than full time employees, which truth to be told are ridiculously high. Nearly 40% of your salary.
  - TheRoque - a day ago
    
    In my country France, your contact May state hours, so you're paid to sit in the chair
    Freelancing as a side hustle may be forbidden if your employer refuses
    And it makes sense to pay more taxes since you also have more social benefits (paid leaves, retirement money and unemployment money), nothing is free
  - lazyasciiart - a day ago
    
    Hmm, not a practice I’ve come across in the EU. What countries specifically are you talking about?
  - jpc0 - 18 hours ago
    
    Not sure about where you are but here the tax rate when not taken as an employee is effectively the maximum individual tax rate, so usually for actual cash it is best to pay yourself a salary from your freelance “business” and handle employees tax.
    There are always loopholes and ways to work around which our tax code will happily discover and kill year on year.
    So you get to pick how you want to pay tax but the amount usually isn’t much different when you get to the highest brackets
  - teiferer - a day ago
    
    > Here in EU, if not stated in your work agreement, it's pretty common people work full time job and also as a self-employed contractor for other companies.
    First time I'm hearing this. Where in the EU are you? I don't know anybody doing this, but it could depend on the country (I'm in the nordics).
  - Tainnor - a day ago
    
    > Here in EU, if not stated in your work agreement, it's pretty common people work full time job and also as a self-employed contractor for other companies.
    Absolutely not a common thing in my corner of the EU.
- - a day ago
  
  [deleted]
- artursapek - a day ago
  
  If you're salaried, you are not a task-based worker. The company pays you a salary for your full day's worth of productive time. If you can suddenly get 5x more done in that time, negotiate a higher salary or leave. If you're actually more productive, they will fight to keep you.
  - henryfjordan - a day ago
    
    Your salary is not determined by your productivity, it's determined by market rates. 5X productivity does not mean 5X salary. Employers prey on labor market inefficiencies to keep the market rates low.
    Any employer with 2 brain cells will figure out that you are more productive as a developer by using AI tools, they will mandate all developers use it. Then that's the new bar and everyone's salary stays the same.
    
    chairmansteve - 21 hours ago
    
    Your salary is determined by whether you play golf with your boss's boss.
  - freehorse - a day ago
    
    Yeah a 20$ plan is prob enough for the AI slop you need to fill in your 8h working time. Unless you have many projects that require more AI slop that is.
- bad_haircut72 - a day ago
  
  This is why communism doesnt work lmao
  - rapind - a day ago
    
    Communism is an ideal but never a reality. What you see in reality is at best an attempt at communism which is quickly derailed by corruption and greed. I mean, it's great to have ideals, but you should also recognize when those ideals are completely impractical given the human condition.
    By the way, this also applies to the "Free market" ideal...
    
    delusional - a day ago
    
    Importantly, problems with the ideal shouldn't preclude good actions that take us in a direction.
    There being problems with absolute libertarian free markets doesn't mean all policies that evoke the free market ideal must be disregarded, nor does the problems with communism mean that all communist actions must be ignored.
    We can see a problem with an ideal, but still wish to replicate the good parts.
    
    rapind - a day ago
    
    Sure. The issue for me is when people intentionally mislabel something to make it look worse.
    For example, mislabelling socialism as communism. The police department, fire department, and roads are all socialist programs. Only a moron would call this communism and yet for some reason universal healthcare...
    There's also this nonsense when someone says "That's the free market at work", and I'm like, if we really lived in a free market then you'd be drinking DuPont's poison right now.
    Using the words "Communism" and "Free market" just show a (often intentional) misunderstanding of the nuance of how things actually work in our society.
    The communism label must be the most cited straw man in all of history at this point.
    
    hooverd - a day ago
    
    for all the lip service capitalists give to the free market, they hate it. their revealed preference is for a monopoly.
    
    nurettin - a day ago
    
    > Communism is an ideal but never a reality
    There is nothing ideal about communism. I'd rather own my production tools and be as productive as I want to be. I'd rather build wealth over trading opportunities, I'd rather hire people and reinvest earnings. That is ideal.
    
    chairmansteve - 21 hours ago
    
    Problem is, you may own the means of production, but most people don't.
    If you don't address that, you'll end up with a "dictatorship of the proletariat".
    
    nurettin - 21 hours ago
    
    That's not a problem anymore. I live in a 2nd world country. Every farmer has a phone, anyone who wants can get their child a laptop. Just because I don't have access to machines which build plane engines doesn't mean I have the right to complain about proletariat. People who invented, invested, earned and built the damn things own them. If that's "dictatorship" that's fine.
    
    - a day ago
    
    [deleted]
    
    rapind - a day ago
    
    I think you're missing the point. Communism doesn't actually exist in the real world. In fact you are right now using it as a straw man (my entire point).
    Who in the actual real world with any authority at all is telling you you can't be as productive as you want to be, build wealth, hire people, and reinvest your earnings?
    
    nurettin - a day ago
    
    I responded to the main points in the communist manifesto. It is clearly against putting a price on labor and declares hiring as exploitative practice. Clearly against individual capital. Clearly against individual products since capital is a "social power". That manipulative weasel. I can't even own a frickin laptop because it is a means of production and thus "state-owned".
    Just because it hasn't been "successfully implemented" according to your personal opinion doesn't mean it cannot be scrutinized.
    That's like if there is a sign that says "do not cross 3km/h" when someone says "that's too slow" you go "a-hah! straw man! How do you know you can't go 300kmph with that in place? nobody implemented that sign before!". Socrates would be proud.
    
    rapind - 12 hours ago
    
    > I responded to the main points in the communist manifesto.
    OK but that's irrelevant to my post. There's lots of books and manifestos that say lots of stupid things. You're arguing as if this manifesto is a real threat, and I'm saying "show me this threat". This isn't a real person with any impact on your day to day, like say a politician. It's a fantasy opposition.
    > Just because it hasn't been "successfully implemented" according to your personal opinion doesn't mean it cannot be scrutinized.
    OK sure, where? Where is this real world communism that meets the manifesto you are railing against?
    > That's like if there is a sign that says "do not cross 3km/h" when someone says "that's too slow" you go "a-hah! straw man! How do you know you can't go 300kmph with that in place? nobody implemented that sign before!". Socrates would be proud.
    OK that's an awkward analogy. It's more like someone wrote a manifesto that said cars shouldn't go over 3km/h and you want to use this "slow manifesto" to argue that any laws that would slow you down are some sort of slippery slope in to "slowmunism".
    No one with any authority in the real world is trying to implement the communist manifesto on to you. Not even the terrifying Bernie Sanders wants anything to do with communism. For the love of god, there is no communist threat. You can relax.
    
    nurettin - 10 hours ago
    
    The communist manifesto is what basically communism is. It is a real document billions swear by.
    But I get it. You are basically arguing that nothing and nobody exists or ever existed or do or does anything to anything or anyone or had any ideas and arguing ideas or what people do or could do or would do is pointless.
    Well, have fun with that. Sorry all this thread space was a waste.
    
    rapind - 6 hours ago
    
    > But I get it. You are basically arguing that nothing and nobody exists or ever existed or do or does anything to anything or anyone or had any ideas and arguing ideas or what people do or could do or would do is pointless.
    And so now you are just putting words in my mouth I assume because you have no argument. I can’t even parse this.
    You started an argument with a stance I never took by railing against a bogeyman I never advanced. And now you’re doing it again.
    If communism was anything more than an impractical ideal then you should have been able to point out where it actually exists. But of course it doesn’t exist. It’s just a fantasy. Maybe you want it to exist so you can point a finger and say “see what happens when you don’t do what I want?”?
    
    whattho - a day ago
    
    [dead]
  - - a day ago
    
    [deleted]
  - tough - a day ago
    
    maybe the issue is capitalism where even if your productivity multiplies x100
    your salary stays x1
    and your work hours stay x1
    
    darth_avocado - a day ago
    
    More accurate representation is this:
    Productivity multiplies x2 You keep your job x0.5 Your salary x0.8 (because the guy we just fired will gladly do your job for less) Your work hours x1.4 (because now we expect you to do the work of 2 people, but didn’t account for all the overhead that comes with it)
    
    koakuma-chan - a day ago
    
    But aren't you supposed to be incentivized to work harder by having equity?
    
    rimunroe - a day ago
    
    Equity is a lottery ticket. Is sacrificing my happiness or life balance in the near term worth the gamble that A) my company will be successful, and B) that my equity won’t have been diluted to worthlessness by the time that happens? At higher levels of seniority/importamce/influence this might make sense, but for most people I seriously doubt it does, especially early in their careers.
    
    tough - a day ago
    
    As a non-founder / not a VC you max get a few percentage points, and its mostly paper toilet money until there's an exit or IPO, and the founders will always try to squeeze you if they can, not because they're bad people, but because the system incentivises it. (you'll keep getting diluted in future rounds)
    tbh, if im gonna bust my ass I'd rather own the thing.
    
    chillingeffect - a day ago
    
    A recent job offer for a startup was a 5 year vest with a 2 year cliff. Seriously?
    
    adastra22 - a day ago
    
    That doesn’t happen anywhere outside of Silicon Valley.
    
    tough - a day ago
    
    And even in Silicon Valley you get the survivor ship bias of the 1% of companies getting to IPO and making their employees decent exit stories...
    99% of startups die off worthless and your equity never realises.
    
    dfee - a day ago
    
    Quite literally not.
    Capitalism encourages you to put your butt in your own seat and reap the rewards of your efforts.
    Of course it also provides you the decision making to keep your butt in someone else’s seat if the risk vs. reward of going your own isn’t worth it.
    And then it allows your employer to put another butt in your seat if you don’t adopt efficiency patterns.
    So: capitalism is compatible with communism as an option, but it’s generally a suboptimal option for one or both parties.
    
    hiddencost - a day ago
    
    No it doesn't. People tell that story but the system is incredibly heavily leveraged to prevent that.
    
    hyperliner - a day ago
    
    [dead]
    
    - a day ago
    
    [deleted]
    
    tough - a day ago
    
    Maybe in a true -capitalistic- market that'd happen.
    but the state keeps meddling and making oligarchs and friends have unfair advantages.
    It's hard to compete when the system is rigged from the start.
    
    dfee - a day ago
    
    also a fair point :)
    
    p_l - a day ago
    
    Capitalism is exactly about amassing capital to make others reliant on capitalist providing capital for the tools necessary to do the work, then extracting rent from the value produced.
    In true capitalist market you end up with oligarchy.
    
    hyperliner - a day ago
    
    [dead]
  - nisegami - a day ago
    
    I am literally describing my life in a capitalist society....
    
    bee_rider - a day ago
    
    I think that was the joke
whattho - a day ago

[dead]

chis - a day ago

Has anyone else done this and felt the same? Every now and then I try to reevaluate all the models. So far it still feels like Claude is in the lead just because it will predictably do what I want when given a mid-sized problem. Meanwhile o3 will sometimes one-shot a masterpiece, sometimes go down the complete wrong path.

This might also just be a feature of the change in problem size - perhaps the larger problems that necessitate o3 are also too open-ended and would require much more planning up front. But at that point it's actually more natural to just iterate with sonnet and stay in the driver's seat a bit. Plus sonnet runs 5x faster.

feintruled - a day ago

Interesting. Though it seems they are themselves building Agentic AI tooling. It's vibe coding all the way down - when's something real going to pop out the bottom?

z3c0 - a day ago

An LLM salesman assuring us that $1000/mo is a reasonable cost for LLMs feels a bit like a conflict of interests, especially when the article doesn't go into much detail about the code quality. If anything, their assertion that one should stick to boring tech and "have empathy for the model" just reaffirms that anybody doing anything remotely innovative or cutting-edge shouldn't bother too much with coding agents.
bunjeejumpn - a day ago

[dead]

jasonthorsness - a day ago

I really hope we can avoid metered stuff for the long-term. One of the best aspects of software development is the low capital barrier to entry, and the cost of the AI tools right now is threatening that.

I'm fortunate in that my own use of the AI tools I'm personally paying for is squished into my off-time on nights and weekends, so I get buy with a $20/month Claude subscription :).

SomewhatLikely - a day ago

It's pretty damn capital intensive to be a productive farmer today. That said, AI will likely, hopefully, get cheaper over time.

bicepjai - a day ago

I can see how pricing at 100 to 200$ per month per employee could make sense for companies, it’s a clear value proposition at that scale. But for personal projects and open source work, it feels out of reach. I’d really like to see more accessible pricing tiers for individuals and hobbyists. Pay per token models don’t work for me either; earlier this year, I racked up almost $1,000 in a month just experimenting with personal projects, and that experience has made me wary of using these tools since.

Sources

teiferer - a day ago

> But for personal projects and open source work, it feels out of reach
Is it? Many hobbies cost much more money. A nice bike (motorbike or road bike, doesn't matter), a sailing boat, golf club/trips, a skiing season pass ... $100/month is significantly less than what you'd burn with those other things. Sure you can program in your free time without such a subscription, and if you enjoy that then by all means, but if it takes away the grunt work and you are having more fun, I don't see the issue.
Gym memberships are in that order of magnitude too, even though you could use some outdoor gym in a city park for free. Maybe those indoor perks of heating, lights, roof and maintained equipment are worth sth? Similar with coding agents for personal projects...
- a day ago

[deleted]
dist-epoch - a day ago

Github Copilot has unlimited GPT-4.1 for $10/month.
- adhamsalama - a day ago
  
  And you can use it as an API, so you can plug it as an OpenAI compatible LLM provider into any 3rd party tool that uses AI, for free.
  That's the only reason I subscribed to GitHub Copilot. Currently using it for Aider.
  - ianandrich - 19 hours ago
    
    Say more. Is the API access unlimited? What models are available?
    
    adhamsalama - 7 hours ago
    
    It's unlimited for the models that have unlimited use in GitHub Copilot, gpt-4.1.
- indigodaddy - a day ago
  
  is GPT-4.1 decent for coding?
indigodaddy - a day ago

I’ve seen some people describe getting pretty good value out of the Claude $20 plan with Claude Code?
- stpedgwdgfhgdd - a day ago
  
  Pro is fine for medium sized projects, stick to 1 terminal.
  - indigodaddy - 4 hours ago
    
    couple hours a day or?
fakedang - a day ago

> Pay per token models don’t work for me either; earlier this year, I racked up almost $1,000 in a month just experimenting with personal projects, and that experience has made me wary of using these tools since.
Can't have your cake and eat it too.
Behold the holy trifecta of: Number of Projects - Code Quality - Coding Agent Cost

quonn - a day ago

Charging $200/month is economically only possible if there is not a true market for LLMs or some sort of monopoly power. Currently there is no evidence that this will be the case. There are already multiple competitors and the barrier to entry is relatively low (compared to e.g. the car industry or other manufacturing industries), there are no network effects (like for social networks) and no need to get the product 100% right (like compatibility to Photoshop or Office) and the prices for training will drop further. Furthermore $200 is not free (like Google).

Can anyone name one single widely-used digital product that does _not_ have to be precisely correct/compatible/identical to The Original and that everyone _does_ pay $200/month for?

Therefore, should prices that users pay get anywhere even close to that number, there will naturally be opportunities for competitors to bring prices down to a reasonable level.

lvl155 - a day ago

Barrier to entry is actually very very high. Just because we have “open source” models doesn’t mean anyone can enter. And the gap is widening now. I see Anthropic/OpenAI as clear leaders. Opus 4 and its derivative products are irreplaceable for coders since Spring 2025. Once you figure it out and have your revelation, it will be impossible to go back. This is an iPhone moment right now and the network effect will be incredible.
- mathiaspoint - a day ago
  
  It's all text and it's all your text. There's zero network effect.
  - lvl155 - a day ago
    
    And that’s how it’s been forever. If your competitor is doing 10x your work, you will be compelled to learn. If someone has a nail gun and you’re using a hammer, no one’s saying “it’s all nails.” You will go buy a nail gun.
    
    mathiaspoint - a day ago
    
    Network affects come from people building on extra stuff. There's no special sauce with these models, as long as you have an inference endpoint you can recreate anything yourself with any of the models.
    As to the nailgun thing, that's an interesting analogy, I'm actually building my own house right now entirely with hand tools, it's on track to finish in 1/5 the time some of this mcmansions do with 1/100th of the cost because I'm building what I actually need and not screwing around with stuff for business reasons. I think you'll find software projects are more similar to that than you'd expect.
chis - a day ago

I think you forgot to consider the cost of providing the inference.
- quonn - a day ago
  
  Well, that could be an additional problem.
  My point was not that AI will necessarily be cheaper to run than $200, but that there is not much profit to be made. Of course the cost of inference will form a lower bound on the price as well.

pshirshov - a day ago

> Use boring technology: LLMs do much better with well-documented and well-understood dependencies than obscure, novel, or magical ones. Now is not the time to let Steve load in a Haskell-to-WebAssembly pipeline.

If we all go that way, there might be no new haskells and webassemblies in the future.

emrehan - a day ago

LLMs can read documentations for a language and use it as well as human engineers.
"given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content"
Source: Gemini 1.5's paper from March 2024 https://storage.googleapis.com/deepmind-media/gemini/gemini_...
mathiaspoint - a day ago

I think there certainly will, it will just mean that only people who can function independently of the AI will have access to them a few years before everyone else.

tomjuggler - 13 hours ago

I believe this story, but being from a third world country it's not feasible to spend anywhere near that much. Also, as others have mentioned I am wary of "rug-pulls" when it comes to proprietary models and services. That is why I am all-in on Deepseek currently, with Aider (and Roo for MCP integration). When the main api is lagging I just switch to the same model with a different provider on OpenRouter. Theoretically I could host my own if I got that busy.

Solo developer doing small jobs but I code every day and $10 per month would be a busy month for me. I still read every line of code though..

georgeecollins - a day ago

I am blown away that you can get a founding engineer for $10k / month. I guess that is not counting stock options, in which case it makes sense. But I think if you include options the opportunity cost is much higher. IMO great engineers are worth a lot, no shade.

hoistbypetard - a day ago

> literally changing failing tests into skipped tests to resolve “the tests are failing.”

Wow. It really is like a ridiculous, over-confident, *very* junior developer.

mathiaspoint - a day ago

I can't imagine using something like this and not self hosting. Moving around in your editor costs money? That would completely crush my velocity.

nico - a day ago

The article reads almost like an ad for o3 and spending a lot of money on LLM APIs

In my experience, o4-mini-high is good enough, even just through the chat interface

Cursor et al can be more comfy because they have access to the files directly. But when working on a sufficiently large/old/complex code base, the main limitation is the human in the loop and managing context, so things end up evening out. Not only that, but a lot of times it’s just easier/better to manually feed things to ChatGPT/Claude - that way you get to more carefully curate and understand the tasks and the changes

I still haven’t seen any convincing real life scenario with larger in-production code bases in which agents are able to autonomously write most of the code

If anyone has a video/demo, would love to see it

cma - a day ago

It's faster than me at drilling through all the layers of abstractions in large codebases to answer questions about how something is implemented and where the actual calculation or functionality gets done, so with that alone it's much more useful than web chat interfaces.
- jpc0 - 17 hours ago
  
  I know it is difficult to run software locally in some instances but at this point I feel it is probably quicker to implement tracing or improving the local development flow.
  A breakpoint in a debugger is much much quicker than feeding the AI all the context needed and then confirming it didn’t miss some flow in some highly abstract code
- nico - a day ago
  
  Yes, when initially starting out the task/feature. But also most likely the agent won’t be able to fully do it on its own, and at that point the agent is not necessarily that much more useful than the chat
  Also, local agents miss context all the time, at which point you need to manually start adding/managing the context anyway
  And, if you are regularly working on the codebase, at some point you’ll probably have better in-brain initial context than the agent

tabs_or_spaces - a day ago

Since this is a business problem.

* It's not clear on how much revenue or new customers is generated by using a coding agent

* It's not clear on how things are going on production. There's only talks about development in the article

I feel ai coding agents will give you the edge. Just this article doesn't talk about revenue or PnL side of things, just perceived costs saved from not employing an engineer.

v5v3 - a day ago

Yes. A company needs measurable ROI and isn't going to spend $200 a month per seat on Claude.
It will instead sign a deal with Microsoft for ai that is 'good enough' and limit expensive ai to some. Or being in the big consultancys as usual to do the projects.

scrubs - 16 hours ago

The one thing AI hates and doesn't want you to know about AI:

Having typing skills >= 120 wpm will triple your efficacy.

cmrdporcupine - a day ago

I get a lot of value out of Claude Max at $100 USD/month. I use it almost exclusively for my personal open source projects. For work, I'm more cautious.

I worry, with an article like this floating around, and with this as the competition, and with the economics of all this stuff generally... major price increases are on the horizon.

Businesses (some) can afford this, after all it's still just a portion of the costs of a SWE salary (tho $1000/m is getting up there). But open source developers cannot.

I worry about this trend, and when the other shoe will drop on Anthropic's products, at least.

mring33621 - a day ago

Those market forces will push the thriftier devs to find better ways to use the lesser models. And they will probably share their improvements!
I'm very bullish on the future of smaller, locally-run models, myself.
- cmrdporcupine - a day ago
  
  I have not invested time on locally-run, I'm curious if they could even get close to approaching the value of Sonnet4 or Opus.
  That said, I suspect a lot of the value in Claude Code is hand-rolled fined-tuned heuristics built into the tool itself, not coming from the LLM. It does a lot of management of TODO lists, backtracking through failed paths, etc which look more like old-school symbolic AI than something the LLM is doing on its own.
  Replicating that will also be required.
barrkel - a day ago

Where do you see the major price increases coming from?
The underlying inference is not super expensive. All the tricks they're pulling to make it smarter certainly multiply the price, but the price being charged almost certainly covers the cost. Basic inference on tuned base models is extremely cheap. But certainly it looks like Anthropic > OpenAI > Google in terms of inference cost structure.
Prices will only come up if there's a profit opportunity; if one of the vendors has a clear edge and gains substantial pricing power. I don't think that's clear at this point. This article is already equivocating between o3 and Opus.
csomar - a day ago

If it weren't for the Chinese, the prices would have been x10.
stpedgwdgfhgdd - a day ago

Just a matter of time before AI coding becomes commodity and prices drop. 2027

suralind - a day ago

How does GitHub Copilot stack against API access directly from OpenAI, etc.? Is it faster to use API keys than Copilot?

v5v3 - a day ago

No need to use the most expensive models for every query? Use it for the ones the cheaper models don't do well.

logifail - a day ago

Q: Can you tell in advance whether your query is one that's worth paying more for a better answer?
- v5v3 - a day ago
  
  Most programmers are not asking ai to re-write the whole app or convert C to Rust.
  You wouldn't gain anything from asking the most expensive model to adjust some css.

butlike - a day ago

I love how paying for prompts stuck. Like, if someone's going to do your homework for you, they should get compensated.

suninsight - a day ago

So what we do at NonBioS.ai is to use a cheaper model to do routine tasks, but switch to a higher thinking model seamlessly if the agent get stuck. Its most cost efficient, and we take that switching cost away from the engineer.

But broadly agree to the argument of the post - just spending more might still be worth it.

andrewstuart - a day ago

I must be holding OpenAI wrong.

Everyone time I try it I find it to be useless compared to Claude or Gemini.

nickjj - a day ago

Serious question, how do you justify paying for any of this without feeling like it's a waste?

I occasionally use ChatGPT (free version without logging in) and the amount of times it's really wrong is very high. Often times it takes a lot of prompting and feeding it information from third party sources for it to realize it has incorrect information and then it corrects itself.

All of these prompts would be using money on a paid plan right?

I also used Cursor (free trial on their paid plan) for a bit and I didn't find much of a difference. I would say whatever back-end it was using was possibly worse. The code it wrote was busted and over engineered.

I want to like AI and in some cases it helps gain insight on something but I feel like literally 90% of my time is it prodiving me information that straight up doesn't work and eventually it might work but to get there is a lot of time and effort.

BeetleB - a day ago

Try with serious models. Here's what I would suggest:
1. Go to https://aider.chat/docs/leaderboards/ and pick one of the top (but not expensive) models. If unsure, just pick Gemini 2.5 Pro (not Flash).
2. Get API access.
3. Find a decent tool (hint: Aider is very good and you can learn the basics in a few minutes).
4. Try it on a new script/program.
5. (Only after some experience): Read people's detailed posts describing how they use these tools and steal their ideas.
Then tell us how it went.
abdullahkhalids - a day ago

Depends on how much you use. I use AI to think through code and other problems, and write the dumb parts of code. Claude definitely works much better than the free offerings. I use OpenRouter [1] and spend only a couple of dollar per month on AI usage. It's definitely worth it.
[1] https://openrouter.ai No affiliation
jonfw - a day ago

The AI agents that run on your machine where they have access to the code, and tools to browse/edit the code, or even run terminal commands are much more powerful than a simple chatbot.
It took some time for me to learn how to use agents, but they are very powerful once you get the hang of it.
- josefresco - a day ago
  
  > much more powerful than a simple chatbot
  Claude Pro + Projects is a good middle ground between the two. Things didn't really "click" for me as a non-developer until I got access to both.
chis - a day ago

I can't believe people are still writing comments like this lol how can it be
- zzzeek - a day ago
  
  I think it's a serious question because something really big is being missed here. There seem to be very different types of developers out there and/or working on very different kinds of codebases. Hypothetically, maybe you have devs or specific contexts where the dev can just write the code really fast where having to explain it to a bot is more time consuming, vs. devs /contexts where lots of googling and guessing goes on and it's easier to get the AI to just show you how to do it.
  I'm actually employer mandated to continue to try/use AI bots / agents to help with coding tasks. I'm sort of getting them to help me but I'm still really not being blown away and still tending to prefer not to bother with them with things I'm frequently iterating on, they are more useful when I have to learn some totally new platform/API. Why is that? do we think there's something wrong with me?
  - vineyardmike - a day ago
    
    > I'm actually employer mandated to continue to try/use AI bots / agents to help with coding tasks
    I think a lot of this comes down to the context management. I've found that these tools work worse at my current employer than my prior one. And I think the reason is context - my prior employer was a startup, where we relied on open source libraries and the code was smaller, following public best practices regarding code structure in Golang and python. My current employer is much bigger, with a massive monorepo of custom written/forked libraries.
    The agents are trained on lots of open source code, so popular programming languages/libraries tend to be really well represented, while big internal libraries are a struggle. Similarly smaller repositories tend to work better than bigger ones, because there is less searching to figure out where something is implemented. I've been trying some coding agents with my current job, and they spend a lot more time searching through libraries looking to understand how to implement or use something if it relies on an internal library.
    I think a lot of these struggles and differences are also present with people, but we tend to discount this struggle because people are generally good at reasoning. Of course, we also learn from each task, so we improve over time, unlike a static model.
benbayard - a day ago

I'd try out cursor with either o3 or Claude 4 Opus. The free version of ChatGPT and Claude in Cursor are much better. That's also what this article claims and is true in my experience.
vineyardmike - a day ago

> Serious question, how do you justify paying for any of this without feeling like it's a waste?
I would invert the question, how can you think it's a waste (for OP) if they're willing to spend $1000/mo on it? This isn't some emotional or fashionable thing, they're tools, so you'd have to assume they derive $1000 of value.
> free version... the amount of times it's really wrong is very high... it takes a lot of prompting and feeding it information from third party
Respectfully, you're using it wrong, and you get what you paid for. The free versions are obviously inferior, because obviously they paywall the better stuff. If OP is spending $50/day, why would the company give you the same version for free?
The original article mentions Cursor. With (paid) cursor, the tool automatically grabs all the information on behalf of the user. It will grab your code, including grepping to find the right files, and it will grab info from the internet (eg up to date libraries, etc), and feed that into the model which can provide targeted diffs to update just select parts of a file.
Additionally, the tools will automatically run compiler/linter/unit tests to validate their work, and iterate and fix their mistakes until everything works. This write -> compile -> unit test -> lint loop is exactly what a human will do.
- nickjj - a day ago
  
  > Respectfully, you're using it wrong, and you get what you paid for.
  I used the paid (free trial) version of Cursor to look at Go code. I used the free version of ChatGPT for topics like Rails, Flask, Python, Ansible and various networking things. These are all popular techs. I wouldn't describe either platform as "good" if we're measuring good by going from an idea to a fully working solution with reasonable code.
  Cursor did a poor job. The code it provided was mega over engineered to the point where most of the code had to be thrown away because it missed the big picture. This was after a lot of very specific prompting and iterations. The code it provided also straight up didn't work without a lot of manual intervention.
  It also started to modify app code to get tests to pass when in reality the test code was the thing that was broken.
  Also it kept forgetting things from 10 minutes ago and repeating the same mistakes. For example when 3 of its solutions didn't work, it started to go back and suggest using the first solution that was confirmed to not work (and it even output text explaining why it didn't work just before).
  I feel really bad for anyone trusting AI to write code when you don't already have a lot of experience so you can keep it in check.
  So far at best I barely find it helpful for learning the basics of something new or picking out some obscure syntax of a tool you don't well after giving it a link to the tool's docs and source code.
  - BeetleB - a day ago
    
    > I feel really bad for anyone trusting AI to write code when you don't already have a lot of experience so you can keep it in check.
    You definitely should be skilled in your domain to use it effectively.
- pxc - a day ago
  
  > This isn't some emotional or fashionable thing, they're tools, so you'd have to assume they derive $1000 of value.
  If someone spends a lot of money on something but they don't derive commensurate value from that purchase, they will experience cognitive dissonance proportional to that mismatch. But ceasing or reversing such purchases are only some of the possibilities for resolving that dissonance. Another possibility is adjusting one's assessment of the value of that purchase. This can be subconscious and automatic, but it an also involve validation-seeking behaviors like reading positive/affirming product reviews.
  In this present era of AI hype, purchase-affirming material is very abundant! Articles, blog posts, interviews podcasts, HN posts.. there's lots to tell people that it's time to "get on board", to "invest in AI" both financially and professionally, etc.
  How much money people have to blow on experiments and toys probably makes a big difference, too.
  Obviously there are limits and caveats to this kind of distortion. But I think the reality here is a bit more complicated than one in which we can directly read the derived value from people's purchasing decisions.
- klank - a day ago
  
  > This isn't some emotional or fashionable thing, they're tools, so you'd have to assume they derive $1000 of value.
  This is not born out in my personal experience at all. In my experience, both in the physical and software tool worlds, people are incredibly emotional about their tools. There are _deep_ fashion dynamics within tool culture as well. I mean, my god, editors are the prima donna of emotional fashion running roughshod over the developer community for decades.
  There was a reason it was called "Tool Time" on Home Improvement.
- a day ago

[deleted]

throwaway984393 - a day ago

[dead]

deadbabe - a day ago

I find it kind of boggling that employers spend $200/month to make employees lives easier, for no real gain.

That’s right. Productivity does go up, but most of these employees aren’t really contributing directly to revenue. There is no code to dollar pipeline. Finishing work faster means some roadmap items move quicker, but they just move quicker toward true bottlenecks that can’t really be resolved quickly with AI. So the engineers sit around doing nothing for longer periods of time waiting to be unblocked. Deadlines aren’t being estimated tighter, they are still as long as ever.

Enjoy this time while it lasts. Someday employers might realize they need to hire less and just cram more work into individual engineers schedules, because AI should supposedly make work much easier.

francisofascii - a day ago

> Someday employers might realize they need to hire less and just cram more work into individual engineers schedules
We are already past that point. The high water mark for Devs was ironically in late 2020 during Covid, before RTO when we were in high demand.
jayd16 - a day ago

There's been pretty widespread layoffs in tech for a few years now.
jajko - a day ago

Coding an actual solution is what, 5-10% of the overall project time?
I dont talk about some SV megacorps where better code can directly affect slightly revenue or valuation and thus more time is spend coding and debugging, I talk about basically all other businesses that somehow need developers.
Even if I would be 10x faster project managers would barely notice that. And I would lose a lot of creative fun that good coding tends to bring. Also debugging, 0 help there its all on you and your mind and experience.
Llms are so far banned in my banking megacorp and I aint complaining.
teiferer - a day ago

[dead]

delduca - a day ago

I just pay $20/month on ChatGPT and spend the entire day coding with its help, no need to pay for tokens, no need to integrate it on your IDE.