Stop Loading Your Entire Instruction System Into Every Session
Most people talk about better prompts. Hardly anyone talks about what happens before every prompt: the instructions the assistant loads into the context before the actual work begins. Depending on the system, you pay for that in different ways: input tokens, latency, reduced available context, or si








