Serverless-registry: A Docker registry backed by Workers and R2

jacobwg · 2024-09-05T18:34:50 1725561290

The annoying thing about trying to implement a Docker registry on Workers and R2 is that it's so close to having everything you need, but the 500MB request body limit means Workers is unable to accept pushes of layers larger than 500MB. The limit is even lower at 100MB on the Pro plan[0].

We are running a registry that does store content on R2[1], and today this is implemented as the unholy chimera of Cloudflare Workers, AWS CloudFront, Lambda@edge, regular Lambda, S3, and R2.

Pushes go first to CloudFront+Lambda@edge, content is saved in S3 first, then moved to R2 in background jobs. Once it's transited to R2, then pulls are served from R2.

I would so love for Workers + R2 to actually be able to accept pushes of large layers, unfortunately I have yet to talk to anyone at Cloudflare who believes it's possible. Especially in this era of AI/ML models, some container images can have single layers in the 10-100GB range!

[0] https://developers.cloudflare.com/workers/platform/limits/#r...

[1] https://depot.dev/docs/guides/ephemeral-registry

ljm · 2024-09-06T00:33:25 1725582805

Live by the cloud, die by the cloud, right?

It’s trivial to set up a self hosted docker registry on your own storage. I’m running the helm version of it on my own k3s cluster, the cluster itself costing 20€/mo.

Why would you need all of this fancy vendor lockin when a few low cost ARM boxes can get the job done?

rfoo · 2024-09-06T09:28:03 1725614883

Because I actually want it to serve 10GB container images to our "enthusiastic" users over the Internet.

Oh, and our "enthusiastic" users want to pull it on 100s of boxes at the same time. What are you going to do? Send a legal notice to them for "DDoS"-ing your poor low cost ARM boxes?

lstamour · 2024-09-06T09:49:46 1725616186

I should preface this by saying that I figure you know all this, but I'm laying it out anyway. :)

Theoretically (I haven't read the parent article yet) it's just GET and HEAD requests for a Docker Registry[1], so if CloudFlare or another CDN supports large binary files, you could cache the images from the low cost box.

That said, obviously I'm suggesting adding "cloud" CDN infrastructure to your tiny ARM box. In a more normal scenario, most would probably pick a free Docker Registry and just upload a mirror of the images to multiple registries to spread out the bandwidth load. E.g. Docker Hub, GitHub, etc.

For a better solution, don't serve 10GB container images. Instead, start from someone else's 10GB container image and add the layers you need on top of it. Or consider a solution where you don't need to ship 10GB of data in your application, but could perhaps side-load only the necessary data.

Another workaround: because Docker images can be pushed and pulled with identical signatures, it's also possible to encourage your end users to keep their own Docker registry of the image and refer to it from an internal or private copy of the image. This is a best practice for pretty much every kind of production deployment of a Docker container, so you don't have 100s of boxes pulling from your shared infrastructure, and leads to more reliable Docker deployments. An example is, in fact, pulling an image - because it stores a copy locally that you can refer to and run multiple containers at the same time from one local image. Pretty much every container image deployment I refer to in production, I prefer to use a privately mirrored copy instead of pulling from a source repository directly. Maybe it's just me? :)

[1]: https://ochagavia.nl/blog/using-s3-as-a-container-registry/

rfoo · 2024-09-06T10:16:25 1725617785

> For a better solution, don't serve 10GB container images. Instead, start from someone else's 10GB container image and add the layers you need on top of it.

This is assuming I'm not shipping 10GB of novel bytes, but in (2024's) reality quite some people ship model files in container image, guess how large they are :p

> it's also possible to encourage your end users to keep their own Docker registry of the image and refer to it from an internal or private copy of the image

Of course. Or I can just piggy back on Cloudflare until they decided to stop the party. Sounds much easier for my casual users.

For serious users they setup internal mirror anyway so I don't need to encourage someone over the Internet which is hard.

msq22 · 2024-09-06T19:25:24 1725650724

Honest question: why don't you just handle the upload part using a $5 VPS instead of workers?

ljm · 2024-09-06T20:15:05 1725653705

Docker gives you everything you need to cache a build and keep the final image small: multi-stage builds, layer caching, build caching (cache-from/cache-to).

It's more recent of course, so there are a lot of docker images out there that don't contain any of that.

As to my original point, for purely internal purposes you can self-host for a pitiful cost. You're not going to open that to the internet and start running a public registry from it, but it's a trivial setup, easy to integrate with (not abstracted through cloud IAM setups) and cheaper than paying SaaS for it.

djbusby · 2024-09-06T18:31:16 1725647476

Not just you. I use a vendored on-prem copy for all kinds of stuff. CPAN, PECL, NPM, bespoke binaries and docker images.

All the upstream vendor deps get caches in this corp-main type repo that we all use internally.

10000truths · 2024-09-06T17:27:45 1725643665

The specs of the box are the least of your worries at that point. You tell your enthusiastic customers to set up a pull-through cache, because with that kind of traffic, you're going to pay through the nose for egress on typical cloud providers, unless you can guarantee that your users are collocated.

jacobwg · 2024-09-06T09:50:44 1725616244

Yeah in our case we are operating a private registry on behalf of our customers, so slightly different use-case than running your own registry for your own internal use.

If you do want to run your own registry, there's some great OSS projects including https://github.com/project-zot/zot, https://goharbor.io/, and of course https://github.com/distribution/distribution.

zimbatm · 2024-09-05T19:03:23 1725563003

Does the worker need to process the body?

Sometimes, the worker can return a signed URL and have the client directly upload to R2.

znpy · 2024-09-06T09:02:29 1725613349

> Sometimes, the worker can return a signed URL and have the client directly upload to R2.

That's not what client expect though, and that would be (iaas) provider-dependent.

amenghra · 2024-09-05T18:44:16 1725561856

Can you generate a signed url to upload directly to R2? Or perform the upload in chunks?

jacobwg · 2024-09-05T19:23:15 1725564195

Uploading in chunks could definitely solve the issue, and the OCI Distribution Specification does actually have some language about an optional chunked push API[0].

Unfortunately very few of the registry clients actually support this, critically containerd does not[1], so this means your regular `docker push` and a whole lot of ecosystem tooling does not work.

This also means that the single PUT must be able to support very large pushes as a single request, possibly even larger than what R2 or S3 would allow without using multipart upload. This means you actually need a server to accept the PUT, then do its own chunked upload to object storage or otherwise stage the content before it's finally saved in object storage.

This rules out presigned URLs for push too, since the PUT request made to the presigned URL can be too large for the backing object storage to accept.

There's also other processing that ideally happens on push (like hash digest verification of the pushed layer) that mean a server somewhere needs to be involved.

[0] https://github.com/opencontainers/distribution-spec/blob/mai...

[1] https://github.com/containerd/containerd/blob/192679b05917b5...

telgareith · 2024-09-05T18:52:47 1725562367

What am I missing such that presigned Urls aren't the solution to this issue?

andreasmetsala · 2024-09-05T20:19:40 1725567580

R2 is ridiculously cheap compared to S3. The price difference was more than 40x when I last looked at it.

compootr · 2024-09-05T22:31:12 1725575472

Rediculously cheap until their sales team shows up and says otherwise!

kobalsky · 2024-09-05T22:37:04 1725575824

Mind explaining?

fragmede · 2024-09-05T22:55:13 1725576913

https://robindev.substack.com/p/cloudflare-took-down-our-web...

kirubakaran · 2024-09-06T01:11:43 1725585103

Discussion with 473 comments: https://news.ycombinator.com/item?id=40481808

Summerbud · 2024-09-06T03:54:18 1725594858

Ouch, that is a bad move even this is happening on a gamble site :((

judge2020 · 2024-09-05T23:10:07 1725577807

Once again i would like eastdakota to respond to this sales tactic. Surely this can't be a double-digit driver for revenue to where he must stay silent on it.

alemanek · 2024-09-06T00:15:33 1725581733

Yikes, looks like CF is off my list of things to use. Super scummy

stingraycharles · 2024-09-06T06:35:39 1725604539

If you read the article, the OOP was running a gambling site so it is fairly reasonable for Cloudflare not to want their IP blocks to be associated with that.

It’s a communication failure on CF’s end, but it’s not an ordinary situation.

_ugfj · 2024-09-06T12:07:28 1725624448

It is. To quote my comment from back then:

And all of that is fine when communicated properly. Even if OP is an unreliably narrator are we to believe they also left out some of CF's emails?

To me it looks like https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_pr... is entirely the wrong email to send in the situation and if you are as old as I am and come from where I come from, you will have flashbacks to "reading between the lines" of the party daily in the 1980s. The real content is at the bottom:

> As we have a very short window to report back to Trust & Safety team, please let me know if you can make time tomorrow

Big red flashing lights: the right questions are 1) why is T&S involved at all 2) What are their concerns which forces such a hurried deadline? 3) What are the consequences of missing this deadline.

The right email would start with something like this:

> Providing services to your business constitutes serious legal risk to Cloudflare. We are happy to work with you in the future if you are buying an Enterprise plan. As we need to commit significant resources to accommodate you, we need an annual commitment. Otherwise, with much regret we need to terminate our services provided to you as it is our right per Terms on date/time. ("We may at our sole discretion terminate your user account or Suspend or terminate your use or access to the Service at any time, with or without notice for any reason or no reason at all.")

> This plan would also include these features:

alemanek · 2024-09-06T16:21:10 1725639670

I did read it and understand that they are running a gambling site. CFs handling of it though where they instead took this as an opportunity for high pressure upsell is the scummy part.

Why wouldn’t they allow them to pay monthly for enterprise? This would allow them to use their own IPs, eliminating the risk to CF, and allow for an orderly migration off the platform if they wanted. Forcing an annual contract with a massive price tag is again just scummy sales tactics.

Also after reading that and searching a bit it looks like this isn’t the first time that CF has had these types of “communication failures”.

EDIT: a few words to clarify what I meant on the lack of a monthly option in this exchange.

znpy · 2024-09-06T09:05:07 1725613507

They just asked for more money, they did not ask them to leave the service.

nkmnz · 2024-09-06T09:55:18 1725616518

Thats what the casino tells a gambler, too.

compootr · 2024-09-15T04:16:53 1726373813

They did, until they mentioned they were speaking to Fastly. Then, bye bye proxying and protection!

more of a goodbye than an ask :wink:

compootr · 2024-09-15T04:14:30 1726373670

> It’s a communication failure

Ten thousand percent. When I read the article earlier they made the email something to the tune of "CF engineering says your account is seriously messing up our network. contact us to resolve this"

and it was a sales call. scummy as hell to get your kpis

koolba · 2024-09-05T20:48:26 1725569306

> We are running a registry that does store content on R2[1], and today this is implemented as the unholy chimera of Cloudflare Workers, AWS CloudFront, Lambda@edge, regular Lambda, S3, and R2.

What’s the advantage over just using ECR? Cost of storage? Cost of bandwidth to read? Hosting provider genetic diversity?

Shakahs · 2024-09-05T22:07:16 1725574036

ECR is slow. Despite being a static datastore presumably backed by S3 it will only serve container image layers at around 150mbps, when dealing with large (10GB) container images this is a problem. R2 will happily serve the same data at multi-gigabit speed.

jacobwg · 2024-09-06T09:36:35 1725615395

Data transfer costs. Our registry is for our customers, and they expect to pull their images from environments outside AWS, e.g. GitHub Actions.

If that customer has a 2GB image, they want to build the image and then pull it into 10 separate matrix jobs (think like parallel Cypress tests), and they have 1,000 commits in a month, then the AWS data transfer costs are $1,800/mo, just for that one customer to pull their images.

With R2, it acts both as a CDN and since Cloudflare does not charge for egress, reduces the cost to only storage + the one-time transfer out of AWS.

gyre007 · 2024-09-05T21:28:35 1725571715

Cost of storage and egress cost are wildly more expensive on ECR compared to CF+R2

captainjapeng · 2024-09-07T08:51:08 1725699068

I was able to push to a serverless-registry deployment a 10GB docker image, I've shared my approach here

https://github.com/cloudflare/serverless-registry/issues/42

_ugfj · 2024-09-06T12:11:28 1725624688

Why R2 instead of Backblaze B2? Isn't R2 more than double the price of B2?

jzelinskie · 2024-09-05T20:24:42 1725567882

If you are a CloudFlare employee reading this, you should get involved with the OCI Distribution group that develops the standards for the registry: https://github.com/opencontainers/distribution-spec

gyre007 · 2024-09-05T21:35:52 1725572152

OCI is demonstrably broken as a specification body as demonstrated by the referrers API. Distirbution spec as it is at the moment is just a very poorly written technical doc.

beeboobaa3 · 2024-09-05T22:27:43 1725575263

Can you explain a bit more what you mean?

re-thc · 2024-09-06T04:39:38 1725597578

> you should get involved with the OCI Distribution group

This is likely just a sample to showcase workers. Not sure it's enough reason for CloudFlare to get involved?

champtar · 2024-09-05T18:41:48 1725561708

I would love if the container pull protocol stopped using custom headers or content-type, so we could use any dumb http server.

mayli · 2024-09-05T22:33:38 1725575618

this? https://github.com/NicolasT/static-container-registry and this? https://github.com/jpetazzo/registrish

champtar · 2024-09-05T23:44:44 1725579884

Both exemples generate custom nginx config

Fire-Dragon-DoL · 2024-09-05T17:29:39 1725557379

How's the pricing with low usage? I suspect this is great. I wanted an image registry so that I can use it to deploy with Kamal, but the $5 plan is overpriced, given I push an image maybe once every 3 months. This could solve that

bayesianbot · 2024-09-05T17:38:59 1725557939

I don't use much of CloudFlare services but it seems kinda cheap, $0.015/GBmonth for storage (+10GB free), Workers are charged per request and CPU time, both of which would probably be quite low for a registry so free plan would go quite far?

I just set up the official registry on a VPS (for similar usage pattern) and it was a bit of work and probably much more expensive, this seems quite attractive unless I've misunderstood something.

Fire-Dragon-DoL · 2024-09-05T18:30:26 1725561026

Yeah it does sound great. The alternative for me is to host my own docker registry on my home server. That would cost me 0 essentially (I have good internet at home)

thangngoc89 · 2024-09-05T18:12:41 1725559961

I think this is wonderful. I’m running a Gitea instance in one of our dev machine just for private registry. Keeping the instance only had been extra workflow for us.

But 500MB limit of layer size is a dealbreaker for AI related workflow.

geek_at · 2024-09-05T18:25:10 1725560710

gitea does also bring their own registry though. If you self-host you can also use LFS for unlimited filesizes

thangngoc89 · 2024-09-05T19:47:45 1725565665

I’m self-hosting gitea just for their private docker registry. LFS is actually slow for heavy deep learning workflow with millions of small files. I’m using DVC [1] instead.

[1]: https://dvc.org

arjvik · 2024-09-05T20:57:02 1725569822

Absolutely love DVC for data version control! What storage backend are you using with DVC?

thangngoc89 · 2024-09-05T21:54:00 1725573240

Local (mounted NFS) from our internal NAS

mikeocool · 2024-09-05T18:35:17 1725561317

Have any container running tools just implemented basic S3 compatibility for pushing/pulling images? If your registry doesnt accept pushes from untrusted sources, it doesn't seem like there is a ton of value in having "smarts" in the registry server itself.

When you push, the client could just PUT a metadata file and an object for each layer in the object store, and pulling would just read the metadata file, which would tell it where to get each layer. And could use etags to skip downloaded layers that have already been downloaded.

For auth just use the standard S3 auth.

Would be compatible with S3/r2/any other S3-compatible storage.

tfolbrecht · 2024-09-06T07:15:33 1725606933

The docker registry container supports S3 as a storage backend you could use it locally

You can also `docker image save` then write the tarball to S3, then load to use.

ImJasonH · 2024-09-06T01:19:14 1725585554

I built a similar PoC using Workers+R2, before Cloudflare released theirs, in case you find it useful: https://github.com/chainguard-dev/crow-registry

We eventually built our own registry in Go running on Cloud Run, which now serves all our images on cgr.dev.

Zero egress fees is really a game changer.

qudat · 2024-09-05T22:27:49 1725575269

This is pretty nice. Does it support an API for deleting images (and having it properly garbage-collected)? It looks like maybe this does it? https://github.com/cloudflare/serverless-registry/blob/13c4e...

We have a managed docker registry and could have definitely used this project!

Slightly unrelated, but we've been experimenting with using SSH for authenticating with a docker registry if anyone is interested: https://github.com/picosh/tunkit?tab=readme-ov-file#why

yecuken · 2024-09-05T19:00:41 1725562841

I'm using this registry with regctl[0] to chunk uploads (to circumvent 100MB limit), works just fine for huge layers with models. With regctl you will also get 'mount' query parameter for upload initialization with the proper blob name so you can skip additional R2 copy when multi-part upload finalisation which speeds up the upload (and avoids crashes on larger blobs). This is not part of docker registry API, so I never got to PR that.

[0] https://github.com/regclient/regclient

miohtama · 2024-09-05T19:01:40 1725562900

When you switch to private Docker or Github registry to Cloudflare, are you effectively just trading one vendor lock in to another, or is there more into this?

vineyardmike · 2024-09-05T19:33:02 1725564782

None of these are really vendor lock in. The registry protocol is an open standard, this is just one more source that implements it. So you’re only locked in as far as your data is stored somewhere, but that data is behind an open API, so minimal risk there.

RyeCombinator · 2024-09-05T18:47:11 1725562031

Great feat.

However I am ever more confused now on what Cloudflare does and builds. They have everything from CDN, DNS to Orange Meets and this now?

yazaddaruvala · 2024-09-05T20:51:26 1725569486

My understanding is Cloudflare is a competitor of AWS, Azure, and GCP.

rozenmd · 2024-09-05T18:49:23 1725562163

There's a developer platform: https://workers.cloudflare.com/

airocker · 2024-09-05T22:12:00 1725574320

Is there a registry that would work on extremely cheap disk storage if the use case if only push and very infrequent pulls?

victorbjorklund · 2024-09-05T18:26:09 1725560769

Nice. I been seriously thinking about building exactly this (but Im glad someone smarter made it already)

ram_rattle · 2024-09-06T05:50:51 1725601851

Looks like a neat idea, does anyone know any open source version that does just this?

fswd · 2024-09-05T21:49:58 1725572998

using this same architecture, it would be cool to build a serverless-git

mdaniel · 2024-09-05T22:53:01 1725576781

I would have thought for sure someone would have already tried that, but regrettably trying to search for "serverless git" coughs up innumerable references to the framework that is hosted on GIThub

Anyway, I was curious how much work such a stunt would be and based on the git-http-backend docs <https://git.github.io/git-scm.com/docs/git-http-backend#Docu...> it seems like there are actually a manageable number of endpoints

I actually prefer their second scenario of splitting out access to the objects and pack files but since doing that would still require a function (or, ahem, a web server running) I suspect that optimization is not within the scope of what you had in mind

jedberg · 2024-09-05T23:09:33 1725577773

Isn’t GitHub already serverless git?

spikey_sanju · 2024-09-05T20:18:42 1725567522

Interesting. Just wish it handled larger image layers a bit better!

bravetraveler · 2024-09-05T22:58:22 1725577102

Interesting approach when running one of these on your 'LAN' is relatively easy

Though, to be fair, the pull-through mechanism in the reference registry been kind of goofy for years. Ask me how I know /s

Alifatisk · 2024-09-05T18:06:15 1725559575

> regitry

stpedgwdgfhgdd · 2024-09-06T07:49:13 1725608953

It is unfortunate that the cloudflare dev eco system is tight into the javascript world (e.g. npm). Wasm is not there yet to be a full replacement.