this post was submitted on 19 Jun 2026
78 points (95.3% liked)
Technology
85583 readers
5022 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 3 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
My employer has been pushing AI as well so I’ve been using Claude to help me when it makes sense. I’ve had it write python scripts to interact with various API’s, had it analyze log files, etc. I might use it two or three times every other day for between 10 to 30 minutes at a time. So while I’m not using it excessively, I’m still using it what I consider to be a reasonable amount.
I checked my Claude account the other day and it said I have used a whopping 1% of my quota for the past month. That really makes me wonder how all these companies are burning through their usage the way they are reporting. Do they have teams of employees now that are doing nothing but interacting with AIs for 8 hours a day? It seems like that would be the only way for me to put a dent in my monthly quota with Claude…
You can use it all day and stay well below the quota. Small context, with the right model for the job. Surgical precision.
But.. At some point you shut off your brain, use the most expensive model on the highest reasoning level with your whole codebase as context and just wait for tens of minutes while it burns all the tokens. To speed this up you then send six agents to tackle the same problem from all angles.
I've been experimenting a bit with adding LLMs into my workflow, and even when using it constantly for a full 8h workday, it barely uses any of my quota. I'm guessing that those who burn through an excessive number of tokens are probably just letting a bunch of them run unattended and automatically allowing everything. There's just no way to verify that much of its output.
People are using agentic harness software, where the LLM streams output directly to your computer terminal. With this setup the harness recognizes structured commands from the LLM, which lets it read files on your PC using local tools. This supposedly lets the LLM figure out how to engage with the project better. I’ve consumed 500k tokens in no time using this approach.
We had people using it to reply to emails. And using it for personal purposes.