Sharing new research, models, and datasets from Meta FAIR

cube2222 · 2024-12-13T22:29:48 1734128988

There’s honestly so much interesting stuff here, esp. the llm-related things - large concept models (operating on and predicting concepts, not tokens), dynamic byte latent transformers (byte-level alternative to standard tokenization), sparse memory layers (successfully scaling key-value memory layers without an increase in computational requirements).

Here they are presented as separate things, each of which apparently improves quality / efficiency. I wonder what the quality / efficiency increase is of all those methods put together? Maybe that’s what Llama 4 will be?

This looks like a lot of innovation is happening at Meta in those areas, really cool!

ms8 · 2024-12-14T15:14:32 1734189272

I hope that Llama 4 or 5 will have a different architecture. All released llamas are +/- same inference with a better training pipeline. The downside is that llamacpp will probably not be able to run new models and maybe it will be too much big rewrite, so we will need new c,cpp,go,rust programs.

janeway · 2024-12-14T12:38:24 1734179904

Side track, but does anyone have suggestions about how to better present such content. I am struggling with similar docs/demos.

As a documentation page, each section is laid out uniformly with section heading, content, link to code and link to paper.

However the page itself is a blog post which will be difficult to find again next year.

Are there other examples of companies having well presented technical summaries which remain findable from the hime page?

airstrike · 2024-12-14T15:20:17 1734189617

I'd put a table of contents-like page up front with some exciting short description of each section and use hyperlinks, allowing the user to navigate to the section and back

rmbyrro · 2024-12-15T09:17:48 1734254268

it's a bit ironic that Meta ended up becoming the largest "open ai" org.

all right, yeah, it's not "open source", but hey, it is open to use and they're publishing their research openly as well.

airstrike · 2024-12-13T22:51:57 1734130317

This is so cool! Playing around with the first demo is a lot of fun. First one to get the model to moonwalk wins. My best attempt was probably something like `(body_speed_forward < -0.3) * (head_height > 1.0) * (stay_still > 0.2) * (body_speed_vertical < 0.1) * (stay_upright > 0.9)`

https://i.imgur.com/O5hGMo5.gif

Then the "Meta Explore Theory of Mind" is even more interesting. There was a thread about a month ago in which some of us were discussing some of the concepts here like "beliefs" and updating a model of the world accordingly. https://news.ycombinator.com/item?id=42035985

modeless · 2024-12-13T23:53:52 1734134032

I really hope Dynamic Byte Latent Transformers work out. Death to tokenizers!

Interesting that it's a a hierarchical structure but only two levels of hierarchy. Stacking more levels seems like an obvious direction for further research.

entilzha · 2024-12-14T08:01:49 1734163309

Author here :), I do think it’s a good direction to look into! That said, aside from it being a bit too much to do at once, you’d also have to be careful about how you distributed your FLOP budget across the hierarchy. With two levels, you can make one level (bytes/local encoder) FLOP efficient and the other (patches/global encoder) FLOP intensive. You’d also need to find a way to group patches into larger units. But ya, there are many directions to go from here!

Permik · 2024-12-14T08:54:56 1734166496

In a way I'm kinda sad that if tokenizers will go the way of the dinosaurs as asking someone to give me a Unicode character from the private use area was one of the last ways you could actually distinguish a co-operative human from an LLM online They simply don't have those characters tokenized, so they can't output them. (But this is technically moot if the LLM has a python interpreter handy)

djhn · 2024-12-14T23:00:40 1734217240

How do you ask someone to give you a Unicode character from the private use area?

ks2048 · 2024-12-14T02:02:44 1734141764

When I wonder about the business behind Meta doing this, I see they have $70B in cash, so giving a bunch of AI experts hundreds of millions is pocket change.

wrsh07 · 2024-12-14T03:40:29 1734147629

Imagine that something fundamental shifts in the world of AI research. It could be anything: AI suddenly makes programmers much more productive, AI becomes very good at identifying vulnerabilities, AI chat becomes a new major source of entertainment, AI images become an item popularly shared on Instagram (etc)

Suppose any one of these things happened and suddenly Facebook wished that it had access to state of the art models so that it could customize them for its uses (internal developers or tools, embedding in their app).

Imagine how they would feel if the only way they could access these models were by signing 7-9 figure deals with a model dealer like OpenAI. Even worse, imagine if one of their main competitors in advertising started providing robust AI tools to help advertisers adapt their creatives to various form factors. Facebook is now way behind and possibly has to shell out millions to a company like OpenAI all while also losing ad market share worth billions per quarter (ads on Google start performing much better, so Google gets more ad spend)

If this worst case scenario came to pass, Facebook would look foolish. If even one of these things were likely their investments make sense. The rest (open source, make meta a cool place to work) are a strategy credit.

aoanevdus · 2024-12-14T02:10:30 1734142230

“Commoditize you complement” may be a good way of framing it. Consider that if OpenAI succeeds dramatically and is the only game in town, they could extract huge rents for anyone using their service. So it’s in other companies interests (or anyone who wants to use AI) that the AI ecosystem have lots of competition to keep prices low.

cma · 2024-12-14T08:07:38 1734163658

You can't have enough top researchers without letting them publish.

sangnoir · 2024-12-14T08:50:05 1734166205

Those AI experts are a played a critical role in Meta getting that $70B in the first place

almostgotcaught · 2024-12-14T08:43:18 1734165798

everyone that has responded so far has it wrong (naively so).

FB sells ad space on several apps. those apps needs people on them in order for the ad space to be worth anything. people, in turn, need content to attract them to the apps. so it's simple: enable people/companies/whomever to generate tons of content for cheap and consequently share it on the apps. that's it.

SideQuark · 2024-12-14T13:47:28 1734184048

Except giving out the tools makes easier for competitors like TikTok to do the same, drawing revenue away from meta.

So that’s not it. Naively so.

tzs · 2024-12-14T16:16:56 1734193016

Couldn't the same argument be made for all kinds of things companies have made open? Some examples:

• Tesla gave away its EV patents.

• Pixar and DreamWorks have both open-sourced some of their tools, including tools used to make some of their best works. For example DreamWorks' MoonRay renderer has been used on everything they have done since "How to Train Your Dragon: The Hidden World", including "Puss in Boots: The Last Wish" and "The Wild Robot", and will be used on their upcoming films.

• Facebook open-sourced React.

• Google open-sourced Chromium.

SideQuark · 2024-12-15T02:45:08 1734230708

Yes, it can. But my reply is to the person I directly responded to that claimed these tools are for meta product benefit, but ignored that same argument applies to competitors.

A better answer is meta releases them for some combination of they see it benefitting the business and/or a desire to provide broad benefits to everyone. They certainly expend tremendous resources to create these models. No other company has provided this much value to such a large base of users in this space.

bee_rider · 2024-12-14T17:06:13 1734195973

In the case of Tesla, if you want to sell cars, you benefit from open up your charging tech, right?

almostgotcaught · 2024-12-14T20:51:29 1734209489

this is like saying that AMD making chips that intel/nvidia employees can buy and use to do their jobs is a bad strategy for AMD. lol. ok not every single strategic choice needs to both grow the top line and be anti-competitive. some can just grow the top line.

mttddd · 2024-12-14T17:49:31 1734198571

the tools but not necessarily the data, presumably they have internally trained versions

mttddd · 2024-12-14T17:50:30 1734198630

content but also better ad targetting by better understanding all of the content that users post

mtkd · 2024-12-14T13:08:27 1734181707

I was fortunate to get to a talk by Ross Taylor ex-Meta recently at the AI Engineer London meetup

He's recorded the full talk here now: https://www.youtube.com/watch?v=S5l5OvJ01ws

I had missed how much Meta have been doing on reasoning, ToM etc.

sharih · 2024-12-15T01:38:21 1734226701

This is a great video - places o1 in context. with openAI, google and meta releases going at it at this pace, anthropic is next up..

intalentive · 2024-12-14T00:22:50 1734135770

Every time I have to clean text I wonder why I haven’t just trained a byte level denoising autoencoder to handle it for me.

anon373839 · 2024-12-14T22:31:56 1734215516

That’s a fun idea. I’ve always wondered about experimenting with u-nets and hourglass nets for text data since they’re so efficient at capturing global and local context (in vision, anyway). But I’ve never tried it.

puttycat · 2024-12-13T23:44:02 1734133442

Can someone explain how watermarking AI videos voluntarily helps make AI safer?

benatkin · 2024-12-13T23:46:44 1734133604

It lets those providing AI video generation services watermark all of their videos. So it isn't intended to by voluntary. You would be left with those services that don't comply with whatever the current Big Tech rules are, like people who used Grok/X.ai to generate images in support of Trump despite Grok/X.ai being inferior. https://arstechnica.com/information-technology/2024/08/musks...

refulgentis · 2024-12-14T05:09:23 1734152963

Think this the wrong / older article - when I click the link, this is twitter's hosted Flux model making pictures of Kamala and Trump flying into the world trade center and Trump on a surfboard with busty cat girls. The X.ai one launched this week

sangnoir · 2024-12-14T08:55:53 1734166553

X hosted a white-label Flux model for a while, and freely admitted so .

refulgentis · 2024-12-14T16:57:12 1734195432

Correct, that's how I know :)

sangnoir · 2024-12-14T22:02:27 1734213747

Got it! I had misunderstood your earlier comment.

bee_rider · 2024-12-14T17:11:47 1734196307

How much does it take to train a model at this point? I’d tend to expect that it’ll be in range of any major state or most oligarchs in the next couple years (if it isn’t already). So, making it is probably best of everybody understands the watermarking to be voluntary. Images and videos aren’t worth the bits they are printed in at this point, as evidence of anything in particular.

bbor · 2024-12-14T00:03:38 1734134618

Crazy stuff. Everyone’s covering how exciting all these are (especially LCM and the non-tokenizing-tokenizer), but I have to ask in case anyone’s been paying attention: why are they using the term “advanced machine intelligence”?

My initial thought is that they want to please/distract the doomers, but I’m prolly just self-centered!

rajman187 · 2024-12-14T00:32:44 1734136364

It originates in Yann LeCunn’s paper from 2022 [1], the term AMI being district from AGI. However, the A has changed over the past few years from autonomous to advanced and even augmented, depending on context

[1] https://openreview.net/pdf?id=BZ5a1r-kVsf

esafak · 2024-12-14T04:40:16 1734151216

I think Lecun doesn't like the term AGI.

stevenhuang · 2024-12-15T04:11:42 1734235902

I'm waiting for when they're called Minds :)

devmor · 2024-12-14T00:11:59 1734135119

I would guess it’s in response to the recent market studies showing that the general public views anything labeled “AI” as a likely scam and untrustworthy.

pkkkzip · 2024-12-13T22:53:49 1734130429

meta has certainly redeemed itself and helping AI become moat-free

echelon · 2024-12-13T23:23:29 1734132209

Even though Meta doesn't sell I/PaaS, Meta's fitness goes up when AI is in the hands of more players than just Google and OpenAI. Commoditize AI and you create a diverse set of businesses that will reach customers through Meta's platforms.

ponector · 2024-12-13T23:18:04 1734131884

They still ruin society with the Facebook, no matter how much good they do with LLM.

bubaumba · 2024-12-14T04:20:20 1734150020

Like it or not Meta is a major player in AI world with its free models and tools.

As for social impact of the rest it's debatable. I personally don't have active social accounts, and not sure this is good.

mupuff1234 · 2024-12-14T18:49:08 1734202148

Like it or not the social impact isn't really debatable, there's a decent amount of evidence, enough for the surgeon general Dr to issue a warning:

https://www.hhs.gov/about/news/2023/05/23/surgeon-general-is...

dailykoder · 2024-12-14T07:07:23 1734160043

They are not free

croes · 2024-12-14T06:52:32 1734159152

Free by accident.

mupuff1234 · 2024-12-13T23:52:06 1734133926

It's not redeeming if you still continue with the original sin.

SpaceManNabs · 2024-12-14T01:14:41 1734138881

This is like learning 10 different new architectures lol

Flomolok · 2024-12-13T23:08:49 1734131329

It's not a hype when it's delivers and I'm also not seeing a ceiling yet

Yet again interesting progress.

Also I like the idea of using the pose model to generate not a NPC but a avatar living in my phone or glas cube as a hologram. That would be quite scifi futuristic

vouaobrasil · 2024-12-13T22:58:20 1734130700

[flagged]

dang · 2024-12-13T23:10:17 1734131417

"Please don't post shallow dismissals, especially of other people's work. A good critical comment teaches us something."

https://news.ycombinator.com/newsguidelines.html

lukemerrick · 2024-12-13T23:07:17 1734131237

Meta is a very large organization, and I'm willing to believe that a good chunk of Meta FAIR (the lab releasing all of this stuff) truly do care about innovations for advancing AI safety and are doing great work along these lines. I'm not disagreeing with your point about the company being led by its financial incentives as a unit, but let's also allow ourselves permission to celebrate this work by this group of people.

causalmodels · 2024-12-13T23:45:28 1734133528

Safety and capabilities research are pretty much two sides of the same coin.

bigs · 2024-12-13T22:59:37 1734130777

Don’t you love the doublespeak of using “FAIR” as well!

th0ma5 · 2024-12-14T00:26:18 1734135978

It is a shame that this is flagged for being denigrating or negative. The better comment could be to ask where is the documentation for safety? How do we define it? Where are the disclosures about failures, negative results, etc? Perhaps these things are unanswerable, but raising awareness of them is important.

nurumaik · 2024-12-14T09:33:10 1734168790

https://dontfuckwithscroll.com/

Roccan · 2024-12-14T01:51:25 1734141085

Meta's "Video Seal": Because nothing says "trustworthy" like a digital chastity belt. Imperceptible, they claim, yet robust enough to survive the gauntlet of internet mangling - sounds like the perfect tool to invisibly track content, not just watermark it.

Rastonbury · 2024-12-14T07:15:44 1734160544

I think it's reasonable to assume that any large social media company is already tracking video similarity in reuploads/edits. The remix and reused audio features are already baked in. Reverse image search screen caps of tiktok/reel pretty often return the source/original

logicchains · 2024-12-14T07:55:17 1734162917

It seems such tracking can be gotten around by something as simple as sticking a Subway Surfers clip underneath the video, given how common that is.

redleader55 · 2024-12-14T02:30:10 1734143410

I want to have a way to detect if content is AI generated. You might want to run that model on your own creations to ensure you get the credit for them and that no one can steal them.

UltraSane · 2024-12-14T05:44:00 1734155040

Like all tools it can be used for good and evil. It could be installed directly in cameras to sign videos. And people with the power to turn it off could make AI fake videos that much more believable.

idle_zealot · 2024-12-14T10:00:25 1734170425

I would make the argument that these AI safety initiatives yield messaging that muddles and confuses the public on the simple fact that they should not, under any circumstances, use a video or image as proof or assume its veracity. When I tell someone this it is common for them to come back with something like "aren't they working on things to detect if a video is fake?" I think this idea, that video content can still be trusted and that {COMPANY} is being responsible is the real goal of the money pumped into these watermarking techniques. These techniques will not actually help people, images and video will continue to be used for disinformation. The only thing that can stymie that is a broad cultural shift to default to distrust of photographs and video footage, to treat it all like you might a painting or animated cartoon depicting an event; maybe an accurate portrayal, but just as easily totally fabricated. The responsible thing for companies to do would be to spread messaging indicative of this fact, but they would rather engage in safety theater and score some points while keeping users dumb and easily fooled.

UltraSane · 2024-12-14T19:53:20 1734206000

"they should not, under any circumstances, use a video or image as proof or assume its veracity"

This is just silly. Courts never assume the validity of evidence. It is actually assumed to be invalid unless it can be proved to have not been tampered with. Photos have been able to be edited for over 100 years but they are still used as evidence. The person who took the photo will sign an affidavit and or testify in court that it is real. And AI videos are going to be easily detectable for a long time.

idle_zealot · 2024-12-14T23:16:52 1734218212

I'm talking about your average person, not the court system. I'm asserting that culturally we need to shift to acknowledging that photos are not proof, rather than pretending that some fancy counter-model or watermarking will somehow allow us to maintain an already-misplaced trust in the veracity of images.