Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
frotaur
6 months ago
|
parent
|
context
|
favorite
| on:
Claude 4 System Card
This is correct. Caching only saves you from having to recompute self attention on the system prompt tokens, but not from the attention from subsequent tokens, which are free to attend to the prompt.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: