Mike Kelley on Nostr: I would suspect the reason you're seeing higher toking usage in Claude is because the ...
I would suspect the reason you're seeing higher toking usage in Claude is because the the maximum context window for gpt 5.3 is 400k where the maximum contacts window of Claude 4.6 is 1 million. Therefore, it has the ability to send a whole lot more information per prompt request. The beast is hungry and it loves to be fed!