Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
corimaith
11 months ago
|
parent
|
context
|
favorite
| on:
Chain-of-thought can hurt performance on tasks whe...
The reasoning emerges from the long distance relations between words picked up by the parallel nature of the transformers. It's why they were so much more performant than earlier RNNs and LSTMs which were using similar tokenization.
Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: