So context size actually helps with this, relative to how LLMs are actually depl... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

bugglebeetle 3 months ago | parent | context | favorite | on: Qwen2.5-1M: Deploy your own Qwen with context leng...

So context size actually helps with this, relative to how LLMs are actually deployed as applications. For example, if you look at how the “continue” option in the DeepSeek web app works for code gen, what they’re likely doing is reinserting the prior messages (in some form) to a new one to prompt further completion. The more context size a model has and can manage successfully, the better it will likely be able at generating longer code blocks.

nejsjsjsbsb 3 months ago [–]

Isn't input/output lengths an arbitrary distinction. Under the hood, output becomes the input for the next token at each step. OpenAI may charge you more $$ by forcing you to add output to the input and call the API again. But running local you don't have that issue.

Consider applying for YC's Summer 2025 batch! Applications are open till May 13
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact