More

CardenB · 2025-09-26T15:17:58 1758899878

I know it isn’t your question exactly, and you probably know this, but the models for coding assist tools are generally fine tunes of models for coding specific purposes. Example: in OpenAI codex they use GPT-5-codex

neutronicus · 2025-09-26T15:26:41 1758900401

I think the question is, can I throw a couple thousand bucks of GPU time at fine-tuning a model to have knowledge of our couple million lines of C++ baked into the weights instead of needing to fuck around with "Context Engineering".

Like, how feasible is it for a mid-size corporation to use a technique like LoRA, mentioned by GP, to "teach" (say, for example) Kimi K2 about a large C++ codebase so that individual engineers don't need to learn the black art of "context engineering" and can just ask it questions.

pu_pe · 2025-09-26T15:38:47 1758901127

I'm curious about it too. I think there are two bottlenecks, one is that training a relatively large LLM can be resource-intensive (so people go for RAGs and other shortcuts), and making it finetuned to your use cases might make it dumber overall.

koakuma-chan · 2025-09-26T23:45:14 1758930314

> making it finetuned to your use cases might make it dumber overall.

LoRa doesn't overwrite weights.

pu_pe · 2025-09-27T09:34:56 1758965696

Do you need to overwrite weights to produce the effect I mentioned above?

koakuma-chan · 2025-09-27T21:24:12 1759008252

Good point

koakuma-chan · 2025-09-26T23:47:07 1758930427

I think they fine tune them for tool calling, not knowledge

CardenB · 2025-06-25T02:48:25 1750819705

No because he was able to achieve the speedup without changing the GPU.

chickenzzzzu · 2025-06-25T21:35:10 1750887310

A more technically correct way to express this feeling is:

"The computational power of the cores on the GPU was never the issue-- however the code that I wrote resulted in a memory bandwidth bottleneck that starved the GPU cores of data to work on, which is firmly within my responsibilities as a programmer -- to fully understand the bandwidth and latency characteristics of the device(s) i'm running on"

saagarjha · 2025-06-27T09:31:41 1751016701

I mean they didn't write the code

chickenzzzzu · 2025-06-27T20:57:10 1751057830

And that's the reason why they misspoke

CardenB · 2025-06-06T23:19:02 1749251942

It's perhaps noteworthy that OBBBA is not the first bill to attempt to revert this tax law. It's simply the latest. There have been other attempts to revert section 174.

Other attempts that come to mind: 1. Tax Relief for American Families and Workers Act of 2024 (H.R. 7024) 2. American Innovation and R&D Competitiveness Act of 2025 (H.R. 1990)

This article is informative: https://www.cebn.org/media_resources/section-174-sign-on-let...

CardenB · 2024-12-21T07:37:37 1734766657

The motivation to try drugs, especially initially, is often social. I would wager that’s really what OP means

CardenB · 2024-11-02T01:38:57 1730511537

long enough to pass on genes also includes living long enough to make sure your child survives, so there is that as well.

devmor · 2024-11-02T20:57:25 1730581045

Yes, which is why I said that.

CardenB · on Sept 26, 2024

Probably better to use openclip

llm_trw · on Sept 26, 2024

You're welcome to provide the code.

CardenB · on Sept 26, 2024

Here’s some code: https://github.com/mlfoundations/open_clip?tab=readme-ov-fil...

llm_trw · on Sept 26, 2024

You're as helpful as the original gpt2.

CardenB · on Sept 24, 2024

Interesting, but can't help but question if there's some way to leverage nerf to estimate the keypoints directly, rather than supervise with nerf. It seems a bit likely to be redundant.

CardenB · on July 23, 2024

Doubtful that GPUs purchased today would be in use for a similar time scale. Govt investment would also drive the cost of GPUs up a great deal.

Not sure why a publicly accessible GPU cluster would be a better solution than the current system of research grants.

ygjb · on July 23, 2024

Of course they won't. The investment in the Heavy Press Program was the initial build, and just citing one example, the Alcoa 50,000 ton forging press was built in 1955, operated until 2008, and needed ~$100M to get it operational again in 2012.

The investment was made to build the press, which created significant jobs and capital investment. The press, and others like it, were subsequently operated by and then sold to a private operator, which in turn enabled the massive expansion of both military manufacturing, and commercial aviation and other manufacturing.

The Heavy Press Program was a strategic investment that paid dividends by both advancing the state of the art in manufacturing at the time it was built, and improving manufacturing capacity.

A GPU cluster might not be the correct investment, but a strategic investment in increasing, for example, the availability of training data, or interoperability of tools, or ease of use for building, training, and distributing models would probably pay big dividends.

dmix · on July 23, 2024

I don't think there's a shortage of capital for AI... probably the opposite

Of all the things to expand the scope of government spending why would they choose AI, or more specifically GPUs?

devmor · on July 23, 2024

There may however, be a shortage of capital for open source AI, which is the subject under consideration.

As for the why... because there's no shortage of capital for AI. It sounds like the government would like to encourage redirecting that capital to something that's good for the economy at large, rather than good for the investors of a handful of Silicon Valley firms interested only in their own short term gains.

hluska · on July 23, 2024

Look at it from the perspective of an elected official:

If it succeeds, you were ahead of the curve. If it fails, you were prudent enough to fund an investigation early. Either way, bleeding edge tech gives you a W.

Geezus_42 · on July 24, 2024

Or you wasted a bunch of tax payer money on some over hyped and over funded nonsense.

ygjb · on July 24, 2024

Yeah. There is alot of over hyped and over funded nonsense that comes out of NASA. Some of it is hype from the marketing and press teams, other hype comes from misinterpretation of releases.

None of that changes that there have been major technical breakthroughs, and entire classes of products and services that didn't exist before those investments in NASA (see https://en.wikipedia.org/wiki/NASA_spin-off_technologies for a short list). There are 15 departments and dozens of Agencies that comprise the US Federal government, many of whom make investments in science and technology as part of their mandates, and most of that is delivered through some structure of public-private partnerships.

What you see as over-hyped and over-funded nonsense could be the next ground breaking technology, and that is why we need both elected leaders who (at least in theory) represent the will of the people, and appointed, skilled bureaucrats who provide the elected leaders with the skills, domain expertise, and experience that the winners of the popularity contest probably don't have.

Yep, there will be waste, but at least with public funds there is the appearance of accountability that just doesn't exist with private sector funds.

seunosewa · on July 24, 2024

You'll be long gone before they find out.

hluska · on July 24, 2024

Which happens every single day in every government in the world.

alickz · on July 24, 2024

how would you determine that without investigation?

phatfish · on July 24, 2024

If it succeeds the idea gets sold to private corporations or the technology is made public and everyone thinks the corporation with the most popular version created it.

If it fails certain groups ensure everyone knows the government "wasted" taxpayer money.

JumpCrisscross · on July 23, 2024

> A GPU cluster might not be the correct investment, but a strategic investment in increasing, for example, the availability of training data, or interoperability of tools, or ease of use for building, training, and distributing models would probably pay big dividends

Would you mind expanding on these options? Universal training data sounds intriguing.

ygjb · on July 23, 2024

Sure, just on the training front, building and maintaining a broad corpus of properly managed training data with metadata that provides attribution (for example, content that is known to be human generated instead of model generated, what the source of data is for datasets such as weather data, census data, etc), and that also captures any licensing encumbrance so that consumers of the training data can be confident in their ability to use it without risk of legal challenge.

Much of this is already available to private sector entities, but having a publicly funded organization responsible for curating and publishing this would enable new entrants to quickly and easily get a foundation without having to scrape the internet again, especially given how rapidly model generated content is being published.

mnahkies · on July 23, 2024

I think the EPC (energy performance certificate) dataset in the UK is a nice example of this. Anyone can download a full dataset of EPC data from https://epc.opendatacommunities.org/

Admittedly it hasn't been cleaned all that much - you still need to put a bit of effort into that (newer certificates tend to be better quality), but it's very low friction overall. I'd love to see them do this with more datasets

randomdata · on July 23, 2024

If the public is going to go to all the trouble of doing something, why would that public not make it clear that there is no legal threat to using any data available?

The public is incredibly lazy, though. Don't expect them to do anything until their hand is forced, which doesn't bode well for the action to meet a desirable outcome.

whimsicalism · on July 23, 2024

there are many things i think are more capital constrained, if the government is trying to subsidize things.

JumpCrisscross · on July 23, 2024

> Doubtful that GPUs purchased today would be in use for a similar time scale

Totally agree. That doesn't mean it can't generate massive ROI.

> Govt investment would also drive the cost of GPUs up a great deal

Difficult to say this ex ante. On its own, yes. But it would displace some demand. And it could help boost chip production in the long run.

> Not sure why a publicly accessible GPU cluster would be a better solution than the current system of research grants

Those receiving the grants have to pay a private owner of the GPUs. That gatekeeping might be both problematic, if there is a conflict of interests, and inefficient. (Consider why the government runs its own supercomputers versus contracting everything to Oracle and IBM.)

rvnx · on July 23, 2024

It would be better that the government removes IP on such technology for public use, like drugs got generics.

This way the government pays 2'500 USD per card, not 40'000 USD or whatever absurd.

kube-system · on July 23, 2024

> It would be better that the government removes IP on such technology for public use, like drugs got generics.

20-25 year old drugs are a lot more useful than 20-25 year old GPUs, and the manufacturing supply chain is not a bottleneck.

There's no generics for the latest and greatest drugs, and a fancy gene therapy might run a lot more than $40k.

JumpCrisscross · on July 23, 2024

> better that the government removes IP on such technology for public use, like drugs got generics

You want to punish NVIDIA for calling its shots correctly? You don't see the many ways that backfires?

gpm · on July 23, 2024

No. But I do want to limit the amount we reward NVIDIA for calling the shots correctly to maximize the benefit to society. For instance by reducing the duration of the government granted monopolies on chip technology that is obsolete well before the default duration of 20 years is over.

That said, it strikes me that the actual limiting factor is fab capacity not nvidia's designs and we probably need to lift the monopolies preventing competition there if we want to reduce prices.

JumpCrisscross · on July 23, 2024

> reducing the duration of the government granted monopolies on chip technology that is obsolete well before the default duration of 20 years is over

Why do you think these private entities are willing to invest the massive capital it takes to keep the frontier advancing at that rate?

> I do want to limit the amount we reward NVIDIA for calling the shots correctly to maximize the benefit to society

Why wouldn't NVIDIA be a solid steward of that capital given their track record?

gpm · on July 23, 2024

> Why do you think these private entities are willing to invest the massive capital it takes to keep the frontier advancing at that rate?

Because whether they make 100x or 200x they make a shitload of money.

> Why wouldn't NVIDIA be a solid steward of that capital given their track record?

The problem isn't who is the steward of the capital. The problem is that economically efficient thing to do for a single company is (given sufficient fab capacity, and a monopoly) to raise prices to extract a greater share of the pie at the expense of shrinking the size of the pie. I'm not worried about who takes the profit, I'm worried about the size of the pie.

whimsicalism · on July 23, 2024

> Because whether they make 100x or 200x they make a shitload of money.

It's not a certainty that they 'make a shitload of money'. Reducing the right tail payoffs absolutely reduces the capital allocated to solve problems - many of which are risky bets.

Your solution absolutely decreases capital investment at the margin, this is indisputable and basic economics. Even worse when the taking is not due to some pre-existing law, so companies have to deal with the additional uncertainty of whether & when future people will decide in retrospect that they got too large a payoff and arbitrarily decide to take it from them.

gpm · on July 23, 2024

You can't just look at the costs to an action, you also have to look at the benefits.

Of course I agree I'm going to stop marginal investments from occurring into research into patent-able technologies by reducing the expect profit. But I'm going to do so very slightly because I'm not shifting the expected value by very much. Meanwhile I'm going to greatly increase the investment into the existing technology we already have, and allow many more people to try to improve upon it, and I'm going to argue the benefits greatly outweigh the costs.

Whether I'm right or wrong about the net benefit, the basic economics here is that there are both costs and benefits to my proposed action.

And yes I'm going to marginally reduce future investments because the same might happen in the future and that reduces expected value. In fact if I was in charge the same would happen in the future. And the trade-off I get for this is that society gets the benefit of the same actually happening in the future and us not being hamstrung by unbreachable monopolies.

whimsicalism · on July 23, 2024

> But I'm going to do so very slightly because I'm not shifting the expected value by very much

I think you're shifting it by a lot. If the government can post-hoc decide to invalidate patents because the holder is getting too successful, you are introducing a substantial impact on expectations and uncertainty. Your action is not taken in a vacuum.

> Meanwhile I'm going to greatly increase the investment into the existing technology we already have, and allow many more people to try to improve upon it, and I'm going to argue the benefits greatly outweigh the costs.

I think this is a much more speculative impact. Why will people even fund the improvements if the government might just decide they've gotten too large a slice of the pie later on down the road?

> the trade-off I get for this is that society gets the benefit of the same actually happening in the future and us not being hamstrung by unbreachable monopolies.

No the trade-off is that materially less is produced. These incentive effects are not small. Take for instance, drug price controls - a similar post-facto taking because we feel that the profits from R&D are too high. Introducing proposed price controls leads to hundreds of fewer drugs over the next decade [0] - and likely millions of premature deaths downstream of these incentive effects. And that's with a policy with a clear path towards short-term upside (cheaper drug prices). Discounted GPUs by invalidating nvidia's patents has a much more tenuous upside and clear downside.

[0]: https://bpb-us-w2.wpmucdn.com/voices.uchicago.edu/dist/d/312...

JumpCrisscross · on July 23, 2024

> I'm going to do so very slightly because I'm not shifting the expected value by very much

You're massively increasing uncertainty.

> the same would happen in the future. And the trade-off I get for this is that society gets the benefit

Why would you expect it would ever happen again? What you want is an unrealized capital gains tax. Not to nuke our semiconductor industry.

hluska · on July 23, 2024

You have proposed state ownership of all successful IP. That is a massive change and yet you have demonstrated zero understanding of the possible costs.

Your claim that removing a profit motivation will increase investment is flat out wrong. Everything else crumbles from there.

gpm · on July 23, 2024

No, I've proposed removing or reducing IP protections, not transferring them to the state. Allowing competitors to enter the market will obviously increase investment in competitors...

IG_Semmelweiss · on July 23, 2024

This is already happening - its called China. There's a reason they don't innovate in anything, and they are always playing catch-up, except in the art of copying (stealing) from others.

I do think there are some serious IP issues, as IP rules can be hijacked in the US, but that means you fix those problems, not blow up IP that was rightfully earned

psd1 · on July 24, 2024

> they don't innovate in anything

They are leaders in solar and EVs.

Remember how Japan leapfrogged the western car industry, and six sigma became required reading for managers in every industry?

hluska · on July 24, 2024

Removing IP restrictions transfers them to the state. Grow up.

salawat · on July 24, 2024

>Why wouldn't NVIDIA be a solid steward of that capital given their track record?

Past performance is not indicative of future results.

tick_tock_tick · on July 23, 2024

> That said, it strikes me that the actual limiting factor is fab capacity not nvidia's designs and we probably need to lift the monopolies preventing competition there if we want to reduce prices.

Lol it's not "monopolies" limiting fab capacity. Existing fab companies can barely manage to stand-up a new fab in different cities. Fabs are impossibly complex and beyond risky to fund.

It's the kind of thing you'd put government money to making but it's so risky government really don't want to spend billions and fail so they give existing companies billions so if they fail it's not the governments fault.

hluska · on July 23, 2024

So, if a private company is successful, you will nationalize its IP under some guise of maximizing the benefit to society? That form of government was tried once. It failed miserably.

Under your idea, we’ll try a badly broken economic philosophy again. And while we’re at it, we will completely stifle investment in innovation.

whimsicalism · on July 23, 2024

there is no such thing as a lump-sum transfer, this will shift expectations and incentives going forward and make future large capital projects an increasingly uphill battle

Teever · on July 23, 2024

There was a post[0] on here recently about how the US went from producing woefully insufficient numbers of aircraft to producing 300k by the end of world war 2.

One of the things that the post mentioned was the meager profit margin that the companies made during this time.

But the thing is that this set the America auto and aviation industry up to rule the world for decades.

A government going to a company and saying 'we need you to produce this product for us at a lower margin thab you'd like to' isn't the end of the world.

I don't know if this is one of those scenarios but they exist.

[0] https://www.construction-physics.com/p/how-to-build-300000-a...

rvnx · on July 23, 2024

In the case of NVIDIA it's even more sneaky.

They are an intellectual property company holding the rights on plans to make graphic cards, not even a company actually making graphic cards.

The government could launch an initiative "OpenGPU" or "OpenAI Accelerator", where the government orders GPUs from TSMC directly, without the middleman.

It may require some tweaking in the law to allow exception to intellectual property for "public interest".

whimsicalism · on July 23, 2024

y'all really don't understand how these actions would seriously harm capital markets and make it difficult for private capital formation to produce innovations going forward.

inetknght · on July 23, 2024

> y'all really don't understand how these actions would seriously harm capital markets and make it difficult for private capital

Reflexively, I count that harm as a feature. I don't like private capital markets because I've been screwed by private capital on multiple occasions.

But you are right: I don't understand how these actions would harm. So please do expand your concerns.

freeone3000 · on July 23, 2024

If we have public capital formation, we don’t necessarily need private capital. Private innovation in weather modelling isn’t outpacing government work by leaps and bounds, for instance.

whimsicalism · on July 23, 2024

because it is extremely challenging to capture the additional value that is being produced by better weather forecasts and generally the forecasts we have right now are pretty good.

private capital is absolutely the driving force for the vast majority of innovations since the beginning of the 20th century. public capital may be involved, but it is dwarfed by private capital markets.

freeone3000 · on July 23, 2024

It’s challenging to capture the additional value and the forecasts are pretty good because of continual large-scale government investment into weather forecasting. NOAA is launching satellites! it’s a big deal!

Private nuclear research is heavily dependent on governmental contracts to function. Solar was subsidized to heck and back for years. Public investment does work, and does make a didference.

I would even say governmental involvement is sometimes even the deciding factor, to determine if research is worth pursuing. Some major capital investors have decided AI models cannot possibly gain enough money to pay for their training costs. So what do we do when we believe something is a net good for society, but isn’t going to be profitable?

nickpsecurity · on July 24, 2024

They said remove legally-enforced monopolies on what they produce. Many of these big firms made their tech with millions to billions of taxpayer dollars at various points in time. If we’ve given them millions, shouldn’t we at least get to make independent implementations of the tech we already paid for?

panarky · on July 23, 2024

To the extent these are incremental units that wouldn't have been sold absent the government program, it's difficult to see how NVIDIA is "harmed".

latchkey · on July 24, 2024

> Those receiving the grants have to pay a private owner of the GPUs.

Along similar lines, I'm trying to build a developer credits program where I get whomever (AMD/Dell) to purchase credits on my super computers, that we then give away to developers to build solutions, which drives more demand for our hardware, and we commit to re-invest those credits back into more hardware. The idea is to create a win-win-win (us, them, you) developer flywheel ecosystem. It isn't a new idea at all, Nvidia and hyperscalers have been doing this for ages.

jvanderbot · on July 23, 2024

A much better investment would be to (somehow) revolutionize production of chips for AI so that it's all cheaper, more reliable, and faster to stand up new generations of software and hardware codesign. This is probably much closer to the program mentioned in the top level comment: It wasn't to produce one type of thing, but to allow better production of any large thing from lighter alloys.

photonthug · on July 23, 2024

> Not sure why a publicly accessible GPU cluster would be a better solution than the current system of research grants.

You mean a better solution than different teams paying AWS over and over, potentially spending 10x on rent rather than using all that cash as a down payment on actually owning hardware? I can't really speak for the total costs of depreciation/hardware maintenance but renting forever isn't usually a great alternative to buying.

CardenB · on July 23, 2024

Do you have some information to share to support your bias against leasing especially with a depreciating asset?

manux · on July 24, 2024

In Canada, all three major AI research centers use clusters created with public money. These clusters receive regular additional hardware as new generations of GPUs become available. Considering how these institutions work, I'm pretty confident they've considered the alternatives (renting, AWS, etc). So that's one data point.

photonthug · on July 24, 2024

sure, I’ll hand it over after you spend your own time first to show that everything everywhere that’s owned instead of leased is a poor financial decision.

vasili111 · on July 24, 2024

AWS is not only hardware but also software, documentation, support and more.

CardenB · on May 2, 2024

you are shifting the burden of proof and failing to make a good argument here.

knighthack · on May 2, 2024

Your statement make no sense.

I cite how science permits plausibility upon evidence. This is in contrast to OP's wishful dogma, where anything not presently known must be fantasy/sci-based and implausible - which is not how science works.

throw4847285 · on May 2, 2024

I didn't say Martians can't be real. I said the evidence that I've encountered (obviously limited by the fact that I'm not a subject matter expert) is insufficient to overcome the threshold of doubt. I then theorized (uncharitably, I'll admit) that many people are predisposed to believe in martians because they've read too much science fiction stories which themselves are actually just metaphors for conditions of life here on Earth. I then made fun of these people, lightly I think.

CardenB · on April 5, 2024

They likely would not be in this position if they weren’t abusing it for an egregious fee. If they were noble, they wouldn’t have the fee. I think it’s more about holding profits captive than anything else.