Promising results from DeepSeek R1 for code

anotherpaulg · 2025-01-28T16:22:49 1738081369

> 99% of the code in this PR [for llama.cpp] is written by DeekSeek-R1

It's definitely possible for AI to do a large fraction of your coding, and for it to contribute significantly to "improving itself". As an example, aider currently writes about 70% of the new code in each of its releases.

I automatically track and share this stat as graph [0] with aider's release notes.

Before Sonnet, most releases were less than 20% AI generated code. With Sonnet, that jumped to >50%. For the last few months, about 70% of the new code in each release is written by aider. The record is 82%.

Folks often ask which models I use to code aider, so I automatically publish those stats too [1]. I've been shifting more and more of my coding from Sonnet to DeepSeek V3 in recent weeks. I've been experimenting with R1, but the recent API outages have made that difficult.

[0] https://aider.chat/HISTORY.html

[1] https://aider.chat/docs/faq.html#what-llms-do-you-use-to-bui...

joshstrange · 2025-01-28T16:45:33 1738082733

First off I want to thank you for Aider. I’ve had so much fun playing with it and using it for real work. It’s an amazing tool.

How do you determine how much was written by you vs the LLM? I assume it consists of parsing the git log and getting LoC from that or similar?

If the scripts are public could you point me at them? I’d love to run it on a recent project I did using aider.

anotherpaulg · 2025-01-28T16:52:23 1738083143

Glad to hear you’re finding aider useful!

There’s a faq entry about how these stats are computed [0]. Basically using git blame, since aider is tightly integrated with git.

The faq links to the script that computes the stats. It’s not designed to be used on any repo, but you (or aider) could adapt it.

You’re not the first to ask for these stats about your own repo, so I may generalize it at some point.

[0] https://aider.chat/docs/faq.html#how-are-the-aider-wrote-xx-...

joshstrange · 2025-01-28T18:26:29 1738088789

Thank you so much for linking me to that! I think an `aider stats`-type command would be really cool (it would be cool to calculate stats based activity since the first aider commit or all-time commits of the repo).

prometheon1 · 2025-01-29T13:33:24 1738157604

Slightly longer than `aider stats` but here you go:

  uv run --with=semver,PyYAML,tqdm https://raw.githubusercontent.com/Aider-AI/aider/refs/heads/main/scripts/blame.py

nyarlathotep_ · 2025-01-28T20:15:13 1738095313

does this mean lines/diffs otherwise untouched are considered written by Aider?

If a small change is made by an end-user to adjust an Aider result, who gets "credit"?

anotherpaulg · 2025-01-28T21:05:24 1738098324

It works like normal git blame -- it literally uses git blame.

Whoever changed a line last gets credit. Only the new or newly changed lines in each release are considered.

So no, "lines/diffs otherwise untouched" are NOT considered written by aider. That wouldn't make sense?

yoyohello13 · 2025-01-28T19:30:32 1738092632

Maybe this is answered, but I didn't see it. How does aider deal with secrets in a git repo? Like if I have passwords in a `.env`?

Edit: I think I see. It only adds files you specify.

FeepingCreature · 2025-01-28T19:34:37 1738092877

Aider has a command to add files to the prompt. For files that are not added, it uses tree-sitter to extract a high-level summary. So for a `.env`, it will mention to the LLM the fact that the file exists, but not what is in it. If the model thinks it needs to see that file, it can request it, at which point you receive a prompt asking whether it's okay to make that file available.

It's a very slick workflow.

anotherpaulg · 2025-01-28T19:40:12 1738093212

You can use an .aiderignore file to ensure aider doesn't use certain files/dirs/etc. It conforms to the .gitignore spec.

almostgotcaught · 2025-01-28T18:27:10 1738088830

> 99% of the code in this PR [for llama.cpp] is written by DeekSeek-R1

you're assuming the PR will land:

> Small thing to note here, for this q6_K_q8_K, it is very difficult to get the correct result. To make it works, I asked deepseek to invent a new approach without giving it prior examples. That's why the structure of this function is different from the rest.

This certainly wouldn't fly in my org (even with test coverage/passes).

Jimmc414 · 2025-01-28T18:49:53 1738090193

>> Small thing to note here, for this q6_K_q8_K, it is very difficult to get the correct result. To make it works, I asked deepseek to invent a new approach without giving it prior examples. That's why the structure of this function is different from the rest.

> This certainly wouldn't fly in my org (even with test coverage/passes).

To be fair, this seems expected. A distilled model might struggle more with aggressive quantization (like q6) since you're stacking two forms of quality loss: the distillation loss and the quantization loss. I think the answer would be to just use the higher cost full precision model.

Philpax · 2025-01-29T02:06:24 1738116384

llama.cpp optimises for hackability, not necessarily maintainability or cleanliness. You can look around the repository to get a feel for what I mean.

almostgotcaught · 2025-01-29T02:37:03 1738118223

i guess that means no one should use it for anything serious? good to know

Philpax · 2025-01-29T02:43:59 1738118639

To some extent, yes. I would not run production off of it, even if it can eek out performance gains on hardware at hand. I'd suggest vLLM or TGI or something similar instead.

fsndz · 2025-01-28T21:34:39 1738100079

I think the secret of DeepSeek is basically using RL to train a model that will generate high quality synthetic data. You then use the synthetic dataset to fine-tune a pretrained model and the result is just amazing: https://open.substack.com/pub/transitions/p/the-laymans-intr...

brianstrimp · 2025-01-29T07:08:40 1738134520

> It's definitely possible for AI to do a large fraction of your coding, and for it to contribute significantly to "improving itself". As an example, aider currently writes about 70% of the new code in each of its releases.

That number itself is not saying much.

Let's say I have an academic article written in Word (yeah, I hear some fields do it like that). I get feedback, change 5 sentences, save the file. Then 20k of the new file differ from the old file. But the change I did was only 30 words, so maybe 200 bytes. Does that mean that Word wrote 99% of that update? Hardly.

Or in C: I write a few functions in which my old-school IDE did the indentation and automatic insertion of closing curly braces. Would I say that the IDE wrote part of the code?

Of course the AI supplied code is more than my two examples, but claiming that some tool wrote 70% "of the code" suggests a linear utility of the code which is just not representing reality very well.

anotherpaulg · 2025-01-29T15:57:21 1738166241

Every metric has limitations, but git blame line counts seem pretty uncontroversial.

Typical aider changes are not like autocompleting braces or reformatting code. You tell aider what to do in natural language, like a pair programmer. It then modifies one or more files to accomplish that task.

Here's a recent small aider commit, for flavor.

  -# load these from aider/resources/model-settings.yml
  -# use the proper packaging way to locate that file
  -# ai!
  +import importlib.resources
  +
  +# Load model settings from package resource
  MODEL_SETTINGS = []
  +with importlib.resources.open_text("aider.resources", "model-settings.yml") as f:
  +    model_settings_list = yaml.safe_load(f)
  +    for model_settings_dict in model_settings_list:
  +        MODEL_SETTINGS.append(ModelSettings(**model_settings_dict))

https://github.com/Aider-AI/aider/commit/5095a9e1c3f82303f0b...

brianstrimp · 2025-01-29T20:05:49 1738181149

Point is that not all lines are equal. The 30% that the tool didn't make are the hard stuff. Not just in line count. Once an approach or an architecture or a design are clear then implementing is merely manual labor. Progress is not linear.

You shouldn't judge your sw eng employees by lines of code either. Those that think the hard stuff often don't have that many lines of code checked in. But it's those people that are the key to your success.

stavros · 2025-01-29T08:31:28 1738139488

That's pretty reaching though if you're comparing an AI to a formatter. Presumably 70% of a new Aider release isn't formatting.

simonw · 2025-01-29T14:18:30 1738160310

"The stats are computed by doing something like git blame on the repo, and counting up who wrote all the new lines of code in each release. Only lines in source code files are counted, not documentation or prompt files."

reitzensteinm · 2025-01-28T18:16:07 1738088167

R1 is available on both together.ai and fireworks.ai, it should be a drop in replacement using the OpenAI API.

SkyPuncher · 2025-01-28T19:03:18 1738090998

The problem is it's very expensive. More expensive than Claude.

7thpower · 2025-01-28T19:37:33 1738093053

You can use the distilled version on Groq for free for the time being. Groq is amazing but frequently has capacity issues or other random bugs.

Perhaps you could set up Groq as your primary and then fail back to fireworks, etc by using litellm or another proxy.

dzhiurgis · 2025-01-28T20:15:17 1738095317

Do you know any assistants for jetbrains that can plug into groq+deepseek?

7thpower · 2025-01-28T20:54:23 1738097663

I do not as I'm not in the ecosystem, but groq is openai compliant, so any tool that is openai compliant (99% are) and lets you put in your own baseurl should work.

For example, many tools will let you use local llms. Instead of putting in the url to the local llm, you would just plug in the groq url and key.

see: https://console.groq.com/docs/openai

manmal · 2025-01-28T22:05:19 1738101919

Continue.dev is available for Jetbrains, though the plugin is not as good as the VSCode counterpart. You can plug in any openai compatible API. Under experimental settings, you can also define an applyCode model (and others) which you could set to a faster, cheaper one (eg Sonnet).

htrp · 2025-01-28T19:03:49 1738091029

Run your deepseek R1 model on your own hardware.

girvo · 2025-01-28T21:17:50 1738099070

Only various distillations are available for most people’s hardware, and they’re quite obviously not as good as actual R1 in my testing.

sampo · 2025-01-28T22:45:46 1738104346

"$6,000 computer to run Deepseek R1 670B Q8 locally at 6-8 tokens/sec"

https://reddit.com/r/LocalLLaMA/comments/1ic8cjf/6000_comput...

maeil · 2025-01-29T06:29:52 1738132192

> I've been shifting more and more of my coding from Sonnet to DeepSeek V3 in recent weeks.

For what purpose, considering Sonnet 3.5 still outperforms V3 on your own benchmarks (which also tracks with my personal experience comparing them)?

hammock · 2025-01-28T19:26:44 1738092404

That's amazing data. How representative do you think your Aider data is of all coding done?

carpo · 2025-01-28T22:49:22 1738104562

aider looks amazing - I'm going to give it a try soon. Just had a question on API costs to see if i can afford it. Your FAQ says you used about 850k tokens for Claude, and their API pricing says output tokens are $15/MTok. Does that mean it cost you under $15 for your Claude 3.5 usage or am I totally off-base? (Sorry if this is has an obvious answer ... I don't know much about LLM API pricing.)

simonw · 2025-01-28T22:54:46 1738104886

I built a calculator for that here: https://tools.simonwillison.net/llm-prices

It says that for 850,000 Claude 3.5 output tokens the cost would be $12.75.

But... it's not 100% clear from me if the Aider FAQ numbers are for input or output tokens.

anotherpaulg · 2025-01-28T22:58:00 1738105080

It's "total" tokens, input plus output. I'd guess more than two-thirds of them are input tokens.

simonw · 2025-01-28T23:03:29 1738105409

If we guess 500,000 for input and 350,000 for output that's a grand total of $6.75. This stuff is so cheap these days!

anotherpaulg · 2025-01-28T22:57:37 1738105057

When I was mostly just using Sonnet I was spending ~$100/month on their API. That included some amount of bulk API use for benchmarking, not just my interactive AI coding.

jsnell · 2025-01-29T06:42:09 1738132929

If you're concerned about API costs, the experimental Gemini models with API keys from API studio tend to have very generous free quota. The quality of e.g. Flash 2.0 Experimental is definitely good enough to try out Aider and see if the workflow clicks. (For me, the quality has been good enough that I just stuck with it, and didn't get around to experimenting with any of the paid models yet.)

nprateem · 2025-01-28T19:17:13 1738091833

> As an example, aider currently writes about 70% of the new code in each of its releases.

Yeah but part of that is because it's physically impossible to stop it making random edits for the sake of it.

Imanari · 2025-01-28T17:49:07 1738086547

Love aider, thank you for your work! Out of curiousity, what are your future plans and ideas for aider in terms of features and workflow?

realo · 2025-01-28T17:05:09 1738083909

Hello...

Is it possible to use aider with a local model running in LMStudio (or ollama)?

From a quick glance i did not see an obvious way to do that...

Hopefully i am totally wrong!

simonw · 2025-01-28T17:07:56 1738084076

https://aider.chat/docs/llms/ollama.html

anotherpaulg · 2025-01-28T17:07:57 1738084077

Thanks for your interest in aider.

Yes, absolutely you can work with local models. Here are the docs for working with lmstudio and ollama:

https://aider.chat/docs/llms/lm-studio.html

https://aider.chat/docs/llms/ollama.html

leetharris · 2025-01-28T17:13:38 1738084418

Yes absolutely

In the left bar there's a "connecting to LLMs" section

Check out ollama as an example

m3kw9 · 2025-01-28T17:11:54 1738084314

Yes and is easy

fragmede · 2025-01-28T19:13:58 1738091638

yeah:

    aider --model ollama_chat/deepseek-r1:32b

(or whatever)

sureglymop · 2025-01-28T21:01:48 1738098108

This didn't work well for me, no changes are ever made but maybe it's because I'm just using the 14B model.

manmal · 2025-01-28T22:07:56 1738102076

In case you are on a 32+GB Mac, you could try deepseek-r1-distill-qwen-32b-mlx in LM Studio. It’s just barely usable speed-wise, but gives useful results most of the time.

rahimnathwani · 2025-01-28T21:19:08 1738099148

When a log line contains {main_model, weak_model, editor_model} does the existence of main_model mean that mean the person was using Aider in Architect/Editor mode?

Do you usually use that mode and, if so, with which architect?

Thank you!

wvlia5 · 2025-01-29T04:30:14 1738125014

Can you make a plot like HISTORY but with axis changed? X: date Y: work leverage (i.e. 50%=2x, 90%=10x, 95%=20x, leverage = 1/(1-pct) )

aledalgrande · 2025-01-28T22:50:32 1738104632

Could you share how you track AI vs human LoC?

simonw · 2025-01-28T22:52:30 1738104750

That's covered here, including a link to the script: https://aider.chat/docs/faq.html#how-are-the-aider-wrote-xx-...

simonw · 2025-01-28T15:22:51 1738077771

Given these initial results, I'm now experimenting with running DeepSeek-R1-Distill-Qwen-32B for some coding tasks on my laptop via Ollama - their version of that needs about 20GB of RAM on my M2. https://www.ollama.com/library/deepseek-r1:32b

It's impressive!

I'm finding myself running it against a few hundred lines of code mainly to read its chain of thought - it's good for things like refactoring where it will think through everything that needs to be updated.

Even if the code it writes has mistakes, the thinking helps spot bits of the code I may have otherwise forgotten to look at.

lacedeconstruct · 2025-01-28T15:39:08 1738078748

The chain of thought is incredibly useful, I almost dont care about the answer now I just follow what I think is interesting from the way it broke the problem down, I tend to get tunnel vision when working for a long time on something so its a great way to revise my work and make sure I am not misunderstanding something

rtsil · 2025-01-28T21:52:33 1738101153

Yesterday, I had it think for 194 seconds. At some point near the end, it said "This is getting frustrating!"

bronco21016 · 2025-01-28T23:24:11 1738106651

I must not be hunting the right keywords but I was trying to figure this out earlier. How do you set how much time it “thinks”? If you let it run too long does the context window fill and it’s unable to do anymore?

throwup238 · 2025-01-29T00:35:25 1738110925

It looks like their API is OpenAI compatible but their docs say that they don’t support the `reasoning_effort` parameter yet.

> max_tokens：The maximum length of the final response after the CoT output is completed, defaulting to 4K, with a maximum of 8K. Note that the CoT output can reach up to 32K tokens, and the parameter to control the CoT length (reasoning_effort) will be available soon. [1]

[1] https://api-docs.deepseek.com/guides/reasoning_model

miohtama · 2025-01-28T15:28:39 1738078119

Also even if the answer is incorrect, you can still cook the eggs on the laptop :)

lawlessone · 2025-01-28T15:35:16 1738078516

i spent a months salary on these eggs and can no longer afford to cook them :(

the_arun · 2025-01-28T15:48:02 1738079282

Hey, where are you getting the eggs? I am unable to find them in the market.

qingcharles · 2025-01-28T21:19:26 1738099166

Sold my GPU, bought chickens.

belter · 2025-01-28T16:03:58 1738080238

The eggs cost more than the laptop...

brandall10 · 2025-01-28T21:29:35 1738099775

If you have a bit more memory, use the 6 bit quant, takes up about 26gb and has been shown to be very minimally lossy as opposed to 4bit.

Also serve it as MLX from LMStudio, will speed things up 30% or so so your 6bit will have similar perf to the 4bit.

Getting about 12-13 tok/sec on my M3 Max 48gb.

thomasskis · 2025-01-29T11:27:31 1738150051

EXO is also great for running the 6bit deepseek, plus it’s super handy to serve from all your devices simultaneously. If your dev team all has M3 Max 48gb machines, sharing the compute lets you all run bigger models and your tools can point at your local API endpoint to keep configs simple.

Our enterprise internal IT has a low friction way to request a Mac Studio (192GB) for our team and it’s a wonderful central EXO endpoint. (Life saver when we’re generally GPU poor)

matwood · 2025-01-29T06:39:28 1738132768

Can you link to the model you’re talking about? I can’t find the exact one using your description. Thanks!

evrenesat · 2025-01-29T11:31:37 1738150297

https://huggingface.co/mlx-community/DeepSeek-R1-Distill-Qwe...

mike31fr · 2025-01-28T20:03:20 1738094600

Noob question (I only learned how to use ollama a few days ago): what is the easiest way to run this DeepSeek-R1-Distill-Qwen-32B model that is not listed on ollama (or any other non-listed model) on my computer ?

codingdave · 2025-01-28T20:19:24 1738095564

If you are specifically running it for coding, I'm satisfied with using it via continue.dev in VS Code. You can download a bunch of models with ollama, configure them into continue, and then there is a drop-down to switch models. I find myself swapping to smaller models for syntax reminders, and larger models for beefier questions.

I only use it for chatting about the code - while this setup also lets the AI edit your code, I don't find the code good enough to risk it. I get more value from reading the thought process, evaluating it, and the cherry picking which bits of its code I really want.

In any case, if that sounds like the experience you want and you already run ollama, you would just need to install the continue.dev VS Code extension, and then go to its settings to configure which models you want in the drop-down.

rahimnathwani · 2025-01-28T21:21:20 1738099280

This model is listed on ollama. The 20GB one is this one: https://ollama.com/library/deepseek-r1:32b-qwen-distill-q4_K...

mike31fr · 2025-01-28T22:19:58 1738102798

Ok, the "View all" option in the dropdown is what I missed! Thanks!

simonw · 2025-01-28T20:40:50 1738096850

Search for a GGUF on Hugging Face and look for a "use this model" menu, then click the Ollama option and it should give you something to copy and paste that looks like this:

  ollama run hf.co/MaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF:IQ1_M

mike31fr · 2025-01-28T22:20:52 1738102852

Got it, thank you!

nyrikki · 2025-01-28T20:07:54 1738094874

   ollama run deepseek-r1:32b

They dropped the Qwen/Llama terms from the string

https://ollama.com/library/deepseek-r1

rahimnathwani · 2025-01-28T21:24:40 1738099480

Whenever they have an alias like this, they usually (always?) have a model with the same checksum but a more descriptive name, e.g. the checksum 38056bbcbb2d corresponds with both of these:

https://ollama.com/library/deepseek-r1:32b

https://ollama.com/library/deepseek-r1:32b-qwen-distill-q4_K...

I prefer to use the longer name, so I know which model I'm running. In this particular case, it's confusing that they grouped the qwen and llama fine tunes with R1, because they're not R1.

marpstar · 2025-01-28T20:07:24 1738094844

I'm using it inside of LM Studio (https://lmstudio.ai), which has a "Discovery" tab where you can download models.

blakesterz · 2025-01-28T15:25:45 1738077945

Is DeepSeek really that big of a deal that everyone else should worry?

m11a · 2025-01-28T15:30:07 1738078207

A lot of the niceness about DeepSeek-R1's usage in coding is that you can see the thought process, which (IME) has been more useful than the final answer.

It may well be that o1's chain of thought reasoning trace is also quite good. But they hide it as a trade secret and supposedly ban users for trying to access it, so it's hard to know.

m11a · 2025-01-28T15:43:20 1738079000

One example from today: I had a coding bug which I asked R1 about. The final answer wasn't correct, but adapting an idea from the CoT trace helped me fix the bug. o1's answer was also incorrect.

Interestingly though, R1 struggled in part because it needed the value of some parameters I didn't provide, and instead it made an incorrect assumption about its value. This was apparent in the CoT trace, but the model didn't mention this in its final answer. If I wasn't able to see the trace, I'd not know what was lacking in my prompt, and how to make the model do better.

I presume OpenAI kept their traces a secret to prevent their competitors from training models with it, but IMO they strategically err'd in doing so. If o1's traces were public, I think the hype around DS-R1 would be relatively less (and maybe more limited to the lower training costs and the MIT license, and not so much its performance and usefulness.)

gwd · 2025-01-29T15:06:48 1738163208

> I presume OpenAI kept their traces a secret to prevent their competitors from training models with it

At some point there was a paper they'd written about it, and IIRC the logic presented was like this:

- We (the OpenAI safety people) want to be able to have insight into what o1 is actually thinking, not a self-censored "people are watching me" version of its thinking.

- o1 knows all kinds of potentially harmful information, like how to make bombs, how to cook meth, how to manipulate someone, etc, which could "cause harm" if seen by an end-user

So the options as they saw it were:

1. RLHF both the internal thinking and the final output. In this case the thought process would avoid saying things that might "cause harm", and so could be shown to the user. But they would have a less clear picture of what the LLM was "actually" thinking, and the potential state space of exploration would be limited due to the self-censorship.

2. Only RLHF the final output. In this case, they can have a clearer picture into what the LLM is "actually" thinking (and the LLM could potentially explore the state space more fully without risking about causing harm), but thought process could internally mention things which they don't want the user to see.

OpenAI went with #2. Not sure what DeepSeek has done -- whether they have RLHF'd the CoT as well, or just not worried as much about it.

manmal · 2025-01-28T15:59:52 1738079992

Do you use Continue.dev or similar tools to load code into the context, or do you copypaste into their web chat?

d3nj4l · 2025-01-28T16:34:36 1738082076

I have a lot of fun just posting a function into R1, saying "Improve this" and reading the chain of thought. Lots of insight in there that I would usually miss or glance over.

emporas · 2025-01-28T23:31:33 1738107093

I tried a month back o1 and Qwen with chain of thought QwQ, to explain to me some chemical reactions, QwQ got it correct, and o1 got it wrong.

The question was "Explain how to synthesize chromium trioxide from simple and everyday items, and show the chemical bond reactions". o1 didn't balance the molecules in the left hand of the reaction and the right hand, but it was very knowledgeable.

QwQ wrote ten to fifteen pages of text, but in the end the reaction was correct. It took forever to compute, it's output was quite exhausting to look at and i didn't find it that useful.

Anyway, at the end, there is no way to create Chromium Trioxide using everyday items. I thought maybe i could mix some toothpaste and soap and get it.

fibers · 2025-01-28T15:48:18 1738079298

how many reported cases of banning are there? that sounds insane for asking it to print out its chain of thought

satvikpendem · 2025-01-28T19:08:06 1738091286

This is generally how I use LLMs anyway, as brainstorming tools, rather than writing code.

horsawlarway · 2025-01-28T16:13:56 1738080836

I would say worry? Yes. Panic? No.

It's... good. Even the qwen/llama distills are good. I've been running the Llama-70b-distill and it's good enough that it mostly replaces my chatgpt plus plan (not pro - plus).

I think if anything - One of my big takeaways is that OpenAI shot themselves in the foot, big time, by not exposing the COT for the O1 Pro models. I find the <think></think> section of the DeepSeek models to often be more helpful than the actual answer.

For work that's treating the AI as collaborative rather than "employee replacement" the COT output is really valuable. It was a bad move for them to completely hide it from users, especially because they make the user sit there waiting while it generates anyways.

pavitheran · 2025-01-28T15:42:59 1738078979

Deepseek is a big deal but we should be happy not worried that our tools are improving.

bbzealot · 2025-01-28T16:09:26 1738080566

Why though?

I'm worried these technologies may take my job away and make the balance between capital and labor even more uneven.

Why should I be happy?

bubbleRefuge · 2025-01-28T18:59:26 1738090766

Think the marginal cost of developing complex software goes down thereby making it affordable to a greater market. There will still be a need for skilled software engineers to understand domains, limitations of AI, and how to harness and curate AI to develop custom apps. Maybe software engineering for the masses. Local small businesses can now maybe afford to take on custom software projects that were before unthinkable.

the_af · 2025-01-28T19:08:23 1738091303

> There will still be a need for skilled software engineers to understand domains, limitations of AI, and how to harness and curate AI to develop custom apps.

But will there be a need for fewer engineers, though? That's the question. And the competition for those who remain employed would be fierce, way worse than today.

Or so I fear. I hope I'm wrong.

andrei_says_ · 2025-01-28T19:30:41 1738092641

I think it might be useful to look at this as multiple forces to play.

One force is a multiplier of a software engineer’s productivity.

Another force is the pressure of the expectation for constant, unlimited increase in profits. This pressure force the CEOs and managers to look for cheaper alternatives to expensive software engineers, ultimately to eliminate the position and expense. The lie that this is a possibility draws huge investments.

And another force is the infinite number of applications of software, especially well designed, truly useful, software.

the_af · 2025-01-28T20:12:45 1738095165

Yes, these are good considerations.

I'd be a hypocrite if I didn't admit I use AI daily in my job, and it's indeed a multiplier of my productivity. The tech is really cool and getting better.

I also understand AI is one step closer for the everyday Jane or Joe Doe to do cool and useful stuff which was out of reach before.

What worries me is the capitalist, business-side forces at play, and what they will mean for my job security. Is it selfish? You bet! But if I don't advocate for me, who will?

btilly · 2025-01-28T19:25:36 1738092336

Jevon's Paradox says that you're probably wrong. But I'm worried about the same thing. The moat around human superiority is shrinking fast. And when it's gone, we may get more software, but will we need humans involved?

svara · 2025-01-28T20:49:41 1738097381

AI doesn't have needs any desires, humans do. And no matter how hyped one might be about AI, we're far away from creating an artificial human. As long as that's true, AI is a tool to make humans more effective.

btilly · 2025-01-29T05:34:46 1738128886

AI may not have desires, but corporations do. And control more resources than humans.

Making corporations more effective is not always in the interest of humans.

svara · 2025-01-29T07:02:24 1738134144

That's fair, but the question was whether AI would destroy or create jobs.

You might speculate about a one-person megacorp where everything is done by AIs that a single person runs.

What I'm saying is that we're very far from this, because the AI is not a human that can make the CEO's needs and desires their own and execute on them independently.

Humans are good at being humans because they've learned to play a complex game, which is to pursue one's needs and desires in a partially adversarial social environment.

This is not at all what AI today is being trained for.

Maybe a different way to look at it, as a sort of intuition pump: If you were that one man company, and you had an AGI that will correctly answer any unambiguously stated question you could ask, at what point would you need to start hiring?

menaerus · 2025-01-29T09:42:37 1738143757

You're taking your opinion to extreme because I don't think anyone is talking about replacing all engineers with a single AI computer doing the work for a one-person mega-corporation.

The actual question, which is much more realistic, is if an average company of, let'say, 50 engineers will still have a need to hire those 50 engineers if AI turns out to be such an efficiency multiplier?

In that case, you will no longer need 10 people to complete 10 tasks in given time-unit but perhaps only 1 engineer + AI compute to do the same. Not all businesses can continue scaling forever, so it's pretty expected that those 9 engineers will become redundant.

svara · 2025-01-29T10:02:55 1738144975

You took me too literally there, that was intended as a thought experiment to explore the limits.

What I was getting at was the question: If we feel intuitively that this extreme isn't realistic, what exactly do we think is missing?

My argument is, what's missing is the human ability to play the game of being human, pursuing goals in an adversarial social context.

To your point more specifically: Yes, that 10-person team might be replaceable by a single person.

More likely than not however, the size of the team was not constrained by lack of ideas or ambition, but by capital and organizational effectiveness.

This is how it's played out with every single technology so far that has increased human productivity. They increase demand for labor.

Put another way: Businesses in every industry will be able to hire software engineering teams that are so good that in the past, only the big names were able to afford them. The kind of team required for the digital transformation of every old fashioned industry.

menaerus · 2025-01-29T11:04:14 1738148654

In my 10-person team example, what in your opinion would the company with the rest of the 9 people do once the AI proves its value in that team?

Your hypothesis is AFAIU is that the company will just continue to scale because there's an indefinite amount of work/ideas to be explored/done so the focus of those 9 people will just be shifted to some other topic?

Let's say I am a business owner I have a popular product with a backlog of 1000 bugs and I have a team of 10 engineers. Engineers are busy both juggling between the features and fixing the bugs at the same time. Now let's assume that we have an AI model that will relieve 9 out of 10 engineers from cleaning the bugs backlog and we will need 1 or 2 engineers reviewing the code that the AI model spits out for us.

What concrete type of work at this moment is left for the rest of the 9 engineers?

Assuming that the team, as you say, is not constrained by the lack of ideas or ambition, and the feature backlog is somewhat indefinite in that regard, I think that the real question is if there's a market for those ideas. If there's no market for those ideas then there's no business value $$$ created by those engineers.

In that case, they are becoming a plain cost so what is the business incentive to keep them then?

> Businesses in every industry will be able to hire software engineering teams that are so good that in the past, only the big names were able to afford them

Not sure I follow this example. Companies will still hire engineers but IMO at much less capacity than what it was required up until now. Your N SQL experts are now replaced by the model. Your M Python developers are now replaced by the model. Your engineer/PR-review is now replaced by the model. The heck, even your SIMD expert now seems to be replaced by the model too (https://github.com/ggerganov/llama.cpp/pull/11453/files). Those companies will no longer need M + N + ... engineers to create the business value.

svara · 2025-01-29T12:27:40 1738153660

> Your hypothesis is AFAIU is that the company will just continue to scale because there's an indefinite amount of work/ideas to be explored/done so the focus of those 9 people will just be shifted to some other topic?

Yes, that's what I'm saying, except that this would hold over an economy as a whole rather than within every single business.

Some teams may shrink. Across industry as a whole, that is unlikely to happen.

The reason I'm confident about this is that this exact discussion has happened many times before in many different industries, but the demand for labor across the economy as a whole has only grown. (1)

"This time it's different" because the productivity tech in question is AI? That gets us back to my original point about people confusing AI with an artificial human. We don't have artificial humans, we have tools to make real humans more effective.

(1) The point seems related to this https://en.wikipedia.org/wiki/Lump_of_labour_fallacy

menaerus · 2025-01-29T14:46:14 1738161974

Hypothetically you could be right and I don't know if "this time will be different" nor am I trying to predict what will happen on the global economic scale. That's out of my reach.

My question is rather of much narrower scope and much more concrete and tangible - and yet I haven't been able to find any good answer for it, or strong counter-arguments if you will. If I had to guess something about it then my prediction would be that many engineers will need to readjust their skills or even requalify for some other type of work.

btilly · 2025-01-29T21:38:41 1738186721

Automation improved life in the Industrial Revolution because it displaced people from spinning and weaving into higher value add professions.

What higher value add professions will humans be displaced into by AI?

strogonoff · 2025-01-29T07:07:51 1738134471

It should be obvious that technology exists for the sake of humans, not the other way around, but I have already seen an argument for firing humans in favour of LLMs since the latter emit less pollution.

LLMs do not have desires, but their existence alters desires of humans, including the ones in charge of businesses.

boothby · 2025-01-29T01:50:14 1738115414

> AI doesn't have needs any desires, humans do.

I fear that this won't age well. But to shamelessly riff on Marx, those who control the means of computation will control society.

svara · 2025-01-29T07:07:21 1738134441

I agree the latter part is a risk to consider, but I really think getting an AI to replace human jobs on a vast scale will take much more than just training a bit more.

You need to train on a fundamentally different task, which is to be good at the adversarial game of pursuing one's needs and desires in a social environment.

And that doesn't yet take into account that the interface to our lives is largely physical, we need bodies.

I'm seeing us on track to AGI in the sense of building a universal question answering machine, a system that will be able to answer any unambiguously stated question if given enough time and energy.

Stating questions unambiguously gets pretty difficult fast even where it's possible, often it isn't even possible, and getting those answers is just a small part of being a successful human.

PS: Needs and desires are totally orthogonal to AI/AGI. Every animal has them, but many animals don't have high intelligence. Needs and desires are a consequence of our evolutionary history, not our intelligence. AGI does not need to mean an artificial human. Whether to pursue or not pursue that research program is up to us, it's not inevitable.

boothby · 2025-01-29T15:09:14 1738163354

Honestly, I wasn't even talking about jobs with that. I worry about an intelligent IOT controlled by authoritarian governments or corporate interests. Our phones have already turned society into a panopticon, and that will can get much worse when AGI lands.

But yes, the job thing is concerning as well. AI won't scrub a toilet, but it will cheaply and inexhaustibly do every job that humans find meaningful today. It seems that we're heading inexorably towards dystopia.

svara · 2025-01-29T15:53:16 1738165996

> AI won't scrub a toilet, but it will cheaply and inexhaustibly do every job that humans find meaningful today

That's the part I really don't believe. I'm open to being wrong about this, the risk is probably large enough to warrant considering it even if the probability of this happening is low, but I do think it's quite low.

We don't actually have to build artificial humans. It's very difficult and very far away. It's a research program that is related to but not identical to the research program leading to tools that have intelligence as a feature.

We should be, and in fact we are, building tools. I'm convinced that the mental model many people here and elsewhere are applying is essentially "AGI = artificial human", simply because the human is the only kind of thing in the world that we know that appears to have general intelligence.

But that mental model is flawed. We'll be putting intelligence in all sorts of places that are not similar to a human at all, without those devices competing with us at being human.

boothby · 2025-01-29T20:58:32 1738184312

To be clear, I'm much more concerned about the rise of techo-authoritarianism than employment.

And further ahead, where I said your original take might not age well; I'm also not worried about AI making humanoid bodies. I'd be worried about a future where mines, factories, and logistics are fully automated: an AI for whom we've constructed a body which is effectively the entire planet.

And nobody needs to set out to build that. We just need to build tools. And then, one day, an AGI writes a virus and hacks the all-too-networked and all-too-insecure planet.

svara · 2025-01-30T19:16:35 1738264595

Yes, okay.

I think we're talking about different time scales - I'm talking about the next few, maybe two or three decades, essential the future of our generation specifically. I don't think what you're describing is relevant on that time scale, and possibly you don't either.

I'd add though that I feel like your dystopian scenario probably reduces to a Marxist dystopia where a big monopolist controls everything.

In other words, I'm not sure whether that Earth-spanning autonomous system really needs to be an AI or requires the development of AI or fancy new technology in general.

In practice, monopolies like that haven't emerged due to competition and regulation, and there isn't a good reason to assume it would be different with AI either.

In other words, the enemies of that autonomous system would have very fancy tech available to fight it, too.

boothby · 2025-01-30T20:17:42 1738268262

I'm not fussy about who's in control. Be it global or national; corporate or governmental; communist or fascist. But technology progresses more or less uniformly across the globe and systems are increasingly interconnected. An AGI, or even a poor simulacrum cobbled together from LLMs with internet access, can eventually hack anything that isn't airgapped. Even if it doesn't have "thoughts" or "wants" or "needs" in some philosophical sense, the result can still be an all-consuming paperclip maximizer (but GPUs, not paperclips). And every software tool and every networked automated system we make can be used by such a "mind."

And while I want to agree that we won't see this happen in the next 3 decades, networked automated cars have already been deployed on the street of several cities and people are eagerly integrating LLMs into what seems to be any project that needs funding.

svara · 2025-01-30T21:11:18 1738271478

It's tempting to speculate about what might happen in the very long run. And different from the jobs question, I don't really have strong opinions on this.

But it seems to me like you might not be sufficiently taking into account that this is an adversarial game; i.e. it's not sufficient for something just to replicate, it needs to also out-compete everything else decisively.

It's not clear at all to me why an AI controlled by humans, to the benefit of humans, would be at a disadvantage to an AI working against our benefit.

boothby · 2025-01-30T21:47:15 1738273635

Agreed on all but one detail. Not to put too fine a point on it, but I do believe that the more emergent concern is AI controlled by a small number of humans, working against the benefit of the rest of humanity.

the_af · 2025-01-30T12:50:48 1738241448

> I'm also not worried about AI making humanoid bodies. I'd be worried about a future where mines, factories, and logistics are fully automated: an AI for whom we've constructed a body which is effectively the entire planet.

I know scifi is not authoritative, and no more than human fears made into fiction, but have you read Philip K. Dick's short story "Autofac"?

It's exactly what you describe. The AI he describes isn't evil, nor does it seek our extinction. It actually wants our well-being! It's just that it's taken over all of the planet's resources and insists in producing and making everything for us, so that humans have nothing left to do. And they cannot break the cycle, because the AI is programmed to only transition power back to humans "when they can replicate Autofac output", which of course they cannot, because all the raw resources are hoarded by the AI, which is vastly more efficient!

boothby · 2025-01-30T16:43:05 1738255385

I think that science fiction plays an important role in discourse. Science fiction authors dedicate years deeply contemplating potential future consequences of technology, and packaging such into compelling stories. This gives us a shorthand for talking about positive outcomes we want to see, and negative outcomes that we want to avoid. People who argue against scifi with a dismissal that "it's just fiction" aren't participating in good faith.

On the other hand, it's important not to pay too close attention to the details of scifi. I find myself writing a novel, and I'm definitely making decisions in support of a narrative arc. Having written the comment above... that planetary factory may very well become the third faction I need for a proper space opera. I'll have to avoid that PKD story for the moment, I don't want the influence.

Though to be clear, in this case, that potentiality arose from an examination of technological progress already underway. For example, I'd be very surprised if people aren't already training LLMs on troves of viruses, metasploit, etc. today.

the_af · 2025-01-29T13:55:59 1738158959

To be clear, I'm not arguing humans will stop being involved in software engineering completely. What I fear is that the pool of employable humans (as code reviewers, prompt engineers and high-level "solution architects") will shrink, because fewer will be needed, and that this will cause ripples in our industry and affect employment.

We know this isn't far-fetched. We have strong evidence to suspect during the big layoffs of a couple of years ago, FAANG and startups all colluded to lower engineer salaries across the board, and that their excuse ("the economy is shrinking") was flimsy at best. Now AI presents them with another powerful tool to reduce salaries even more, with a side dish of reducing the size of the cost center that is programmers and engineers.

visarga · 2025-01-29T04:46:41 1738126001

In the AI age, those who own the problems stand to own the AI benefits. Utility is in the application layer, not the hosting or development of AI models.

kelipso · 2025-01-29T07:28:54 1738135734

Can’t be sure though, there used to be way more accountants decades ago.

bubbleRefuge · 2025-01-28T20:30:33 1738096233

this is a better world. we can work a few hours a week and play tennis, golf, and argue politics with our friends and family over some good cheese and wine while the bots do the deployments.

hgomersall · 2025-01-29T07:32:37 1738135957

We're already there in terms of productivity. The problem is the inordinate number of people doing nothing useful yet extracting huge amounts. Think most of finance for example.

bubbleRefuge · 2025-01-30T18:20:19 1738261219

oh. yeah. finance is to big. they've captured the government.

the_af · 2025-01-28T21:01:53 1738098113

Assuming you retain a good paying job and are not treated like a disposable commodity. That cheese and wine is not going to be free.

bubbleRefuge · 2025-01-28T21:17:10 1738099030

as long as we keep learning and our heads in the game we will be fine. I worry much more for the non-techno savy like scrum masters. yikes.

weatherlite · 2025-01-29T09:43:03 1738143783

If it's any consolation, if indeed the extra productivity happens, and kills the number of SWE jobs I don't see why this dynamic shouldn't happen in almost all white collar job across the private sector (government sectors are pretty much protected no matter what happens). There'll be a decreasing demand for lawyers, accountants, analysts, secretaries, HR personnel, designers, marketers etc etc. Even doctors might start feeling this eventually.

bubbleRefuge · 2025-01-28T20:28:42 1738096122

no I think more engineers. especially those who can be a jack-of-all-trades. if a software project that takes normally 1 year of customer development can be done in 2 months, then that project is affordable to a wide array of business who would could never fund that kind of project before.

the_af · 2025-01-28T21:04:31 1738098271

I can see more projects being deployed by smaller businesses, that would otherwise not be able to.

But how will this translate to engineering jobs? Maybe there will be AI tools to automate most of the stuff a small business needs done. "Ah," you may say, "I will build those tools!". Ok. Maybe. How many engineers do you need for that? Will the current engineering job market shrink or expand, and how many non-trash, well paid jobs will there be?

I'm not saying I know for sure how it'll go, but I'm concerned.

dwaltrip · 2025-01-28T23:49:21 1738108161

Just had a thought, perhaps software engineers will become more like car mechanics.

the_af · 2025-01-29T13:58:53 1738159133

That's not an encouraging thought.

By the way, car mechanics (especially independent ones, your average garage mechanic) understand less and less about what's going on inside modern cars. I don't want this to happen to us.

bubbleRefuge · 2025-01-28T21:15:44 1738098944

would be similar to solution engineers today. you build solutions using ai. think about all the moving parts to building a complex business app. user experience, data storage, business logic, reporting, etc. etc. the engineer can orchestrate the ai to build the solution and validate its correctness.

the_af · 2025-01-29T13:59:44 1738159184

I fear even this role will need way fewer people, meaning the employment pool will heavily shrink, and those competing for a job will need to accept lower paychecks.

bubbleRefuge · 2025-01-29T15:47:18 1738165638

like someone said above. demand is infinite. imagine a world where the local AI/Engineer tech is a ubiquitous as the uber driver. don't think it will necessarily create smaller paychecks. hard to say. But I see demand skyrocketing for customized software that can be provided at 1/10 of today's costs.

We are far away from that though. As an enterprise software/data engineer, AI has been great in answering questions and generating tactical code for me. Hours have turned into minutes. It even motivated me to work on side projects because they take less time. You will be fine. Embrace the change. Its good for you. Will lead to personal growth.

the_af · 2025-01-29T16:52:29 1738169549

I'm not at all convinced demand is infinite, nor that this demand will result in employment. This feels like begging the question. This is precisely what I fear won't happen!

Also, I don't want to be a glorified uber driver. It's not good for me and not good for the profession.

> As an enterprise software/data engineer, AI has been great in answering questions and generating tactical code for me. Hours have turned into minutes.

I don't dispute this part, and it's been this way for me too. I'm talking about the future of our profession, and our job security.

> You will be fine. Embrace the change. Its good for you. Will lead to personal growth.

We're talking at cross-purposes here. I'm concerned about job security, not personal growth. This isn't about change. I've been almost three decades in this profession, I've seen change. I'm worried about this particular thing.

bubbleRefuge · 2025-01-30T18:25:01 1738261501

3 decades. me too. since 97. maybe uber driver was a bad example. what about having a work model similar to a lawyer? whereby one can specialize in creating certain types of business or personal apps at a high hourly rate ?

llm_trw · 2025-01-29T03:30:25 1738121425

Did the introduction of assemblers lead to creating more or fewer programming jobs?

the_af · 2025-01-29T13:51:47 1738158707

I get this argument, but it feels we cannot always reason by analogy. Some jumps are qualitatively different. We cannot always claim "this didn't happen before, therefore it won't happen now".

Of course assemblers didn't create fewer programming jobs, nor did compilers or high level languages. However, with "NO CODE" solutions (remember that fad?) there was an attempt at reducing the need for programmers (though not completely taking them out of the equation)... it's just that NO CODE wasn't good enough. What if AI is good enough?

Vampiero · 2025-01-29T07:07:46 1738134466

In what AI-powered world do you think that local small software businesses will survive?

simonw · 2025-01-29T14:16:09 1738160169

One where other businesses need help figuring out how to use AI for their own businesses.

It doesn't matter how "easy" technology gets to use, there will always be a market for helping other people figure out best to apply it.

joshmarlow · 2025-01-28T16:25:22 1738081522

> make the balance between capital and labor even more uneven.

I think it's interesting to note that as opens source models evolve and proliferate, the capital required for a lot of ventures goes down - which levels the playing field.

When I can talk to one agent-with-a-CAD-integration and have it design a gadget for me and ship the design off to a 3D printer and then have another agent write the code to run on the gadget, I'll be able to build entire ventures that would require VC funding and a team now.

When intellectual capital is democratized, financial capital looses just a bit of power...

breuleux · 2025-01-28T19:45:40 1738093540

What value do you bring to the venture, though? What makes your venture more likely to succeed than anybody else's, if the barrier is that low? I mean, I'll tell you: if anyone can spend $100 to design the same new gadget, the winner is going to be whoever can spend a million in production (to get economy of scale) and marketing. Currently, financial capital needs your brain, so you can leverage that. But if they can use a brain in the cloud instead, they're going to do just that. Sure, you can use it and design anything you can imagine, but nobody is going to pay you for it unless you, yourself, bring some irreplaceable value to the table.

visarga · 2025-01-29T04:49:28 1738126168

Since everyone has AI, then it stands that humans still make the difference. That is why I don't think companies will be able to automate software dev too much, they would be cutting the one advantage they could have over their competition.

breuleux · 2025-01-29T15:58:08 1738166288

It stands that humans will make the difference if they can do things that the AI cannot. The more capable the AI gets, however, the less humans will meet that threshold, and they are the ones that will lose out. Capital, on the other hand, will always make a difference.

toth · 2025-01-28T19:07:40 1738091260

I can't understand how you reach your conclusion.

At present, if you have financial capital and need intellectual capital you need to find people willing to work for you and pay them a lot of money. With enough progress in AI you can get the intellectual capital from machines instead, for a lot less. What loses value is human intellectual capital. Financial capital just gained a lot of power, it can now substitute for intellectual capital.

Sure, you could pretend this means you'll be able to launch a startup without any employees, and so will everyone. But why wouldn't Sam Altman or whomever just start AI Ycombinator with hundreds of thousands of AI "founders"? Do you really think it would be more "democratic"?

visarga · 2025-01-29T05:05:28 1738127128

> But why wouldn't Sam Altman or whomever just start AI Ycombinator with hundreds of thousands of AI "founders"? Do you really think it would be more "democratic"?

AI is useful in the same way with Linux

- can run locally

- empowers everyone

- need to bring your own problem

- need to do some of the work yourself

The moral is you need to bring your problem to benefit. The model by itself does not generate much benefits. This means AI benefits are distributed like open source ones.

toth · 2025-01-29T10:15:48 1738145748

Those points are true of current AI models, but how sure are you they will remain true as technology evolves?

Maybe you believe that they will always stay true, that there's some ineffable human quality that will never be captured by AI and value creation will always be bottle-necked by humans. That would be nice.

But even if you still need humans in the loop, it's not clear how "democratizing" this would be. It might sound great if in a few years you and everyone else can run an AI on their laptop that is as a good as a great technical co-founder that never sleeps. But note that means that someone who owns a data-center can run the equivalent of the current entire technical staff of Google, Meta, and OpenAI combined. Doesn't sound like a very level playing field.

ghxst · 2025-01-28T20:27:45 1738096065

> I'm worried these technologies may take my job away

The way I look at this is that with the release of something like deepseek the possibility of running a model offline and locally to work _for_ you while you are sleeping, doing groceries, spending time with your kids / family is coming closer to a reality.

If AI is able to replace me one day I'll be taking advantage of that way more efficiently than any of my employee(s).

esafak · 2025-01-29T01:26:27 1738113987

Why wouldn't your employer just hire fewer people to do it since you seem to have enough spare time to do lots of things besides work?

ghxst · 2025-01-29T05:27:55 1738128475

Meant to say employer(s).

jonas21 · 2025-01-28T19:56:18 1738094178

Do you feel the same way about open source software?

llm_trw · 2025-01-29T03:29:42 1738121382

The only people who think that Ai models won't result in more demand for human labour are the ones who have never used them.

CamperBob2 · 2025-01-28T16:23:12 1738081392

You won't be happy doing a robot's job either, at least not for long.

In the ideal case, we won't be dependent on the unwilling labor of other humans at all. Would you do your current job for free? If not -- if you'd rather do something else with your productive life -- then it seems irrational to defend the status quo.

One thing's for certain: ancient Marxist tropes about labor and capital don't bring any value to the table. Abandon that thinking sooner rather than later; it won't help you navigate what's coming.

cryptopian · 2025-01-28T17:14:24 1738084464

That's not historically what's happened though, is it? We've had plenty of opportunities to reduce the human workload through increased efficiency. What usually happens is people demand more - faster deliveries, more content churn; and those of us who are quite happy with what we have are either forced to adapt or get left behind while still working the same hours.

satvikpendem · 2025-01-28T19:09:06 1738091346

Jevon's paradox really does work for everything, not just in the current way people have used it this last week in terms of GPU demand. People always demand more, and thus, there is an endless amount of work to be done.

esafak · 2025-01-29T01:27:08 1738114028

If you really have enough, you can retire early.

Capricorn2481 · 2025-01-29T21:10:06 1738185006

We don't have enough because the productivity improvements are not shared with the working class. The wealth gap increases, people work the same. This is historically what has happened and it's what will happen with AI. The next generations will never have the opportunity to retire.

hooverd · 2025-01-28T16:34:24 1738082064

Because billionaires think that you are a horse and that the best course of action is to turn you into glue while they hope AGI lets them live forever.

CamperBob2 · 2025-01-28T17:58:54 1738087134

Billionaires don't think about you at all. That's what nobody seems to get.

We enjoy many luxuries unavailable even to billionaires only a few decades ago. For this trend to continue, the same thing needs to happen in other sectors that happened in (for example) the agricultural sector over the course of the 20th century: replacement of human workers by mass automation and superior organization.

breuleux · 2025-01-28T19:55:57 1738094157

In the past, human workers were displaced. The value of their labour for certain tasks became lower than what automation could achieve, but they could still find other things to do to earn a living. What people are worrying about here is what happens when the value of human labour drops to zero, full stop. If AI becomes better to us at everything, then we will do nothing, we will earn nothing, and we will have nothing that isn't gifted to us. We will have no bargaining power, so we just have to hope the rich and powerful will like us enough to share.

CamperBob2 · 2025-01-28T20:09:21 1738094961

If anything like that had actually happened in the past, you might have a point. When it comes to what happens when the value of human labor drops to zero, my guess is every bit as good as yours.

I say it will be a Good Thing. "Work" is what you call whatever you're doing when you'd rather be doing something else.

breuleux · 2025-01-28T22:17:07 1738102627

The value of our labour is what enables us to acquire things and property, with which we can live and do stuff. If your labour is valueless because robots can do anything you can do better, how do you get any of the possessions you require in order to do that something else you'd rather be doing? Capitalism won't just give them to you. If you do not own land, physical resources or robots, and you can't work, how do you get food? Charity? I'd argue there will need to be a pretty comprehensive redistribution scheme for the people at large to benefit.

nuancebydefault · 2025-01-28T20:37:39 1738096659

What we see through history is that human labour cost goes up and machine cost goes down.

Suppose you want to have your car washed. Hiring someone to do that will most likely give the best result: less physical resources used (soap, water, wear of cloth), less wear and tear on the car surface and less pollution and optionally a better result.

Still the benefit/cost equation is clearly in favor of the machine when doing the math, even when using more resources in the process.

What is lacking in our capitalist economic system is the fact of hiring people to perform services is punished by much higher taxes compared to using a machine, which is often even tax deductible. That way, the machine brings only benefits to the user of the machine (often a more wealthy person), less much to society as a whole. If only someone could find a solution to this tragedy.

Vampiero · 2025-01-29T07:10:56 1738134656

> If only someone could find a solution to this tragedy.

Well, someone earlier in the thread said to abandon Marxist thought because it's obsolete. So I don't know how to help you!

nuancebydefault · 2025-01-29T11:46:40 1738151200

I prefer to not use -ist's and -ism's. I read that Marx wrote he was not a Marxist. Surely his studies and literature got used as a frame of reference for a rather wide set of ideologies. Maybe someone with a deeper background on the topic can chime in with ideas?

CamperBob2 · 2025-01-29T16:37:53 1738168673

If only someone could find a solution to this tragedy.

We did. Save up a few bucks, nothing out of reach, and (as you suggested yourself!) you can afford to buy your own machine. Here you go: https://xcancel.com/carrigmat/status/1884244369907278106

You'd have received no such largesse from the Marxists. You're welcome.

Capricorn2481 · 2025-01-29T21:26:27 1738185987

Forgetting the offhand implication that $6,000 is not out of reach for anyone, this will do nothing. If we're really taking this to its natural conclusion, that AI will be capable of doing most jobs, companies won't care that you have an AI. They will not assign you work that can be done with AI. They have their own AI. You will not compete with any of them, and even if you find a novel way to use it that gives you the gift of income, that won't be possible for even a small fraction of the population to replicate.

You can keep shoehorning lazy political slurs into everything you post, but the reality is going to hit the working class, not privileged programmers casually dumping 6 grand so they can build their CRUD app faster.

But you're essentially arguing for Marxism in every other post on this thread, whether you realize it or not.

CamperBob2 · 2025-01-30T01:50:16 1738201816

Yeah, there's always some reason why you can't do something, I guess... or why The Man is always keeping you down, even after putting capabilities into your hands that were previously the exclusive province of mythology.

Perhaps other sites beckon.

p_j_w · 2025-01-28T19:03:58 1738091038

>Billionaires don't think about you at all.

If that were true they wouldn't be building ultra secure bunkers to escape to when the climate shit hits the fan.

inglor_cz · 2025-01-28T19:40:22 1738093222

How many of them did that? Five out of a thousand?

Anecdotally, around two people in a hundred in my proximity are preppers as well, though obviously with smaller budgets.

It is just a specific fringe way of thinking.

hnthrow90348765 · 2025-01-29T13:33:01 1738157581

This added momentum to two things: reducing AI costs and increasing quality.

I don't know when the threshold of "replace the bottom X% of developers because AI is so good" happens for businesses based on those things, but it's definitely getting closer instead of stalling out like the bubble predictors claimed. It's not a bubble if the industry is making progress like this.

weatherlite · 2025-01-29T07:53:07 1738137187

I think it's a mixed bag but if people want to be happy I'm not going to spoil the party!

flmontpetit · 2025-01-28T16:15:34 1738080934

As far as realizing the prophecy of AI as told by its proponents and investors goes, probably not. LLMs still have not magically transcended their obvious limitations.

However this has huge implications when it comes to the feasibility and spread of the technology, and further implications with regards to economy and geopolitics now that confidence in the American AI sector has been hit and people and organizations internationally have somewhere else to look for.

edit: That being said, this is the first time I've seen a LLM do a better job than even a senior expert could do, and even if it's on small scope/in a limited context, it's becoming clear that developers are going to have to adopt this tech in order to stay competitive.

buyucu · 2025-01-28T17:28:24 1738085304

There are two things. First, deepseek v3 and r1 are both amazing models.

Second, the fact that deepseek was able to pull this off with such modest resources is an indication that there is no moat, and you might wake up tomorrow and find an even better model from a company you have never heard of.

girvo · 2025-01-28T21:28:08 1738099688

Pull this off with such modest resources, including using ChatGPT itself for its RL inputs. It’s quite smart, and doesn’t disagree with your point that there is no moat per se, but without those frontier models and their outputs there is no V3, there is no R1.

buyucu · 2025-01-29T07:39:13 1738136353

this is how science works.

simonw · 2025-01-28T15:30:55 1738078255

Yeah, it is definitely a big deal.

I expect it will be a net positive: they proved that you can both train and run inference against powerful models for way less compute than people had previously expected - and they published enough details that other AI labs are already starting to replicate their results.

I think this will mean cheaper, faster, and better models.

This FAQ about it is very good: https://stratechery.com/2025/deepseek-faq/

netdevphoenix · 2025-01-28T15:46:00 1738079160

Why did DeepSeek not kept this for themselves? Is this a Meta style scorched earth strategy?

Rzor · 2025-01-28T16:05:10 1738080310

>An Yong: But DeepSeek is a business, not a nonprofit research lab. If you innovate and open-source your breakthroughs—like the MLA architecture innovation releasing in May—won’t competitors quickly copy them? Where’s your moat?

>Liang Wenfeng: In disruptive tech, closed-source moats are fleeting. Even OpenAI’s closed-source model can’t prevent others from catching up.

>Therefore, our real moat lies in our team’s growth—accumulating know-how, fostering an innovative culture. Open-sourcing and publishing papers don’t result in significant losses. For technologists, being followed is rewarding. Open-source is cultural, not just commercial. Giving back is an honor, and it attracts talent.

https://thechinaacademy.org/interview-with-deepseek-founder-...

simonw · 2025-01-28T15:54:03 1738079643

There are a bunch of theories floating round.

Personally this looks to me like an ego thing: the DeepSeek team are really, really good and their CEO is enjoying the enormous attention they are getting, plus the pride of proving that Chinese AI labs can take the lead in a field that everyone thought the USA was unassailable in.

Maybe they are true believers in building and sharing "AGI" with the world?

Lots of people see this as a Chinese government backed conspiracy to undermine the US AI industry. I'm not sure how credible that idea is.

I saw somewhere (though I've not confirmed it with a second source) that none of the people listed on the DeepSeek papers got educated at US universities - they all went to school in China, which further emphasizes how good China's home-grown talent pool has got.

kragen · 2025-01-28T20:49:00 1738097340

> a Chinese government backed conspiracy to undermine the US AI industry

To me this sounds like describing Lockheed as a US government backed conspiracy to undermine the Tupolev Aerospace Design Bureau. It really stretches the normal connotations of words, and it presupposes that the center of the world is conveniently located very close to the speaker.

rfoo · 2025-01-28T16:08:51 1738080531

> none of the people listed on the DeepSeek papers got educated at US universities

"You have been educated at foreign universities / worked at foreign companies" is indeed an excuse they have used at least once to refuse a candidate. n=1 though so maybe that's just a convenient excuse. There's one guy who went to University of Adelaide (IIRC) on the paper.

eptcyka · 2025-01-28T16:06:41 1738080401

It makes Trump look like a chump.

dluan · 2025-01-28T16:37:58 1738082278

> Chinese government backed conspiracy

Do you understand how ginormous China is and how ridiculous this kind of made up boogeyman statement sounds?

simonw · 2025-01-28T16:47:07 1738082827

AnotherGoodName · 2025-01-28T16:12:15 1738080735

It’s a bunch of known optimisations bundled together rather than any single revolutionary change.

More open than any other model (but still a bespoke licence) and bundles together a bunch of known improvements. There’s nothing to hide here honestly and without the openness it wouldn’t be as interesting.

nuancebydefault · 2025-01-28T21:23:52 1738099432

From the faq

'So are we close to AGI? It definitely seems like it. This also explains why Softbank (and whatever investors Masayoshi Son brings together) would provide the funding for OpenAI that Microsoft will not: the belief that we are reaching a takeoff point where there will in fact be real returns towards being first.'

Interesting.

startupsfail · 2025-01-28T15:43:41 1738079021

This may mean that $3k/task on some benchmarks published by OpenAI are now at slightly lower price tag.

It is possible however that OpenAI was using similar level acceleration in the first place, they’ve just not published the details. And a few engineers left and replicated (or even bested it) in a new lab.

Overall, it’s a good boost, modern software is getting a better fit into new generation of hardware and is performing faster. Maybe we should pay more attention when NVIDIA is publishing their N-times faster ToPS numbers, and not completely dismissing it as marketing.

GaggiX · 2025-01-28T15:44:33 1738079073

DeepSeek R1 is o1 but free to use, open source, and also distilled on different models, even the ones that could run on your phone so yeah.

llm_trw · 2025-01-29T03:27:01 1738121221

End result is on par with o1 preview, which is ironically more intelligent than o1, but the intermediate tokens are actually useful. I've got it running locally last night and out of 50 questions so far I've gotten the answer in the chain of thought in more than half.

steeeeeve · 2025-01-28T19:03:57 1738091037

Today it is. Tomorrow everyone will look at it like Wish or Temu.

ryao · 2025-01-29T09:48:04 1738144084

This seems relevant:

https://finance.yahoo.com/news/deepseek-temu-ai-analysts-132...

People are already looking at it like Temu.

coliveira · 2025-01-28T15:51:15 1738079475

It depends on the problem type. If your problem requires math reasoning, deepSeek response is quite impressive and surpasses what most people can do in a single session.

csomar · 2025-01-28T15:46:30 1738079190

Everyone else should rejoice. OpenAI is probably cooked, however. Nvidia might be cooked too.

snowram · 2025-01-28T15:51:36 1738079496

Is Nvidia really cooked? If this new RF tech does scale, couldn't a bigger model be made that would require more compute power for training and inference?

swfsql · 2025-01-28T20:40:04 1738096804

I read around that DeepSeek's team managed to work-around hardware limitations, and that in theory goes against the "gatekeeping" or "frontrunning" investment expectations from nvidia. If a partial chunk of investment is a bet on those expectations, that would explain a part of the stock turbulence. I think their 25x inference price reduction vs openai is what really affected everything, besides the (uncertain) training cost reduction.

BeefWellington · 2025-01-29T06:08:08 1738130888

We all use PCs and heck even phones that have thousands of times the system memory of the first PCs.

Making something work really efficiently on older hardware doesn't necessarily imply less demand. If those lessons can be taken and applied to newer generations of hardware, it would seem to make the newer hardware all the more valuable.

cool_dude85 · 2025-01-28T19:05:16 1738091116

Imagine an s-curve relating capital expenditure on compute and "performance" as the y-axis. It's possible that this does not change the upper bound of the s-curve but just shifts the performance gains way to the left. Such a scenario would wipe out a huge amount of the value of Nvidia.

treis · 2025-01-28T21:10:36 1738098636

I don't think it matters much to Nvidia so long as they're the market leader. If AI gets cheaper to compute it just changes who buys. Goes from hyperscalers to there being an AI chip in every phone, tablet, laptop, etc. still lots and lots of money to be made.

whitehexagon · 2025-01-28T19:23:53 1738092233

Agreed, I switched from qwq now to the same model. I'm running it under ollama on a M1 Asahi Linux and it seems maybe twice the speed (not very scientific but not sure how to time the token generation), and more, dare I say smarter? than qwq, and maybe a tad less RAM. It still over ponders, but not as bad as some of the pages and pages of, 'that looks wrong, maybe I should try...' circles with qwq, but which was already so impressive.

I'm quite new to this, how are you feeding in so much text? just copy/paste? I'd love to be able to run some of my Zig code through it, but I haven't managed to get Zig running under Asahi so far.

buyucu · 2025-01-28T17:24:46 1738085086

DeepSeek-R1-Distill-Qwen-32B is my new default model on my home server. previously it was aya-32b.

xenospn · 2025-01-28T20:10:30 1738095030

What do you use it at home for?

m3kw9 · 2025-01-28T17:12:53 1738084373

What does distil qwen 32b mean? It uses qwen for what?

buyucu · 2025-01-28T17:29:34 1738085374

deepseek fine-tuned qwen32b with data generated by deepseek671b

amarcheschi · 2025-01-28T16:05:01 1738080301

For what i can understand, he asked deepseek to convert arm simd code to wasm code.

in the github issue he links he gives an example of a prompt: Your task is to convert a given C++ ARM NEON SIMD to WASM SIMD. Here is an example of another function: (follows a block example and a block with the instructions to convert)

https://gist.github.com/ngxson/307140d24d80748bd683b396ba13b...

I might be wrong of course, but asking to optimize code is something that quite helped me when i first started learning pytorch. I feel like "99% of this code blabla" is useful as in it lets you understand that it was ai written, but it shouldn't be a brag. then again i know nothing about simd instructions but i don't see why it should be different for a capable llm to do simd instructions or optimized high level code (which is much harder than just working high level code, i'm glad i can do the latter lol)