Idle Until Urgent

alangpierce · on Sept 20, 2018

> Important! While browsers can run input callbacks ahead of queued tasks, they cannot run input callbacks ahead of queued microtasks. And since promises and async functions run as microtasks, converting your sync code to promise-based code will not prevent it from blocking user input!

Wow, my initial reaction while reading was "just use async functions and then then code will naturally allow user input in the middle", good to know that that doesn't work.

hinkley · on Sept 20, 2018

This is slowly turning into a rock in my shoe. You would think a single threaded language would implement some sort of cooperative multitasking scheme. But promises aren’t cooperative, especially in Bluebird (the queue management logic was converted to LIFO a long time ago, although that code still contained comments or variable names that imply the opposite, when last I looked).

I’m tempted to call this the legacy of Brendan “design a language in a week” Eich, but that would let too many other people off the hook.

IMO, webasm can’t happen fast enough.

paulddraper · on Sept 20, 2018

> But promises aren’t cooperative

Promises are cooperative.

You cannot interrupt execution (unlike, say, preemptible threads).

You can only perform other execution when the current execution terminates or yields, e.g. with await.

---

I think you are wanting browser UI events to preempt between Promises/microtasks.

That's fair enough, but saying that Promise aren't cooperative is the wrong description.

hinkley · on Sept 20, 2018

The difference between multitasking and cooperative multitasking is that you can yield the CPU in the middle of a long process. You can do that in Javascript but it involves combining multiple asynchrony APIs in complex ways. Ways you probably don’t want to invite your team to use frequently.

You cannot split a large calculation in the middle by chaining promises to allow even other promises to make progress, let alone event loop processing.

paulddraper · on Sept 20, 2018

> The difference between multitasking and cooperative multitasking is that you can yield the CPU in the middle of a long process.

As long as "can yield" means "able to yield, and able to not yield". More clearly put, the difference between preemptable and cooperative multitasking is

* A preempting scheduler interrupts execution without requiring cooperation from the task https://en.wikipedia.org/wiki/Preemption_(computing)

* A cooperative scheduler interrupts execution only when the task voluntarily yields control https://en.wikipedia.org/wiki/Cooperative_multitasking

> You cannot split a large calculation in the middle by chaining promises

You can split up large calculations easily:

    (async() => {
        console.log('1');
        await (async() => {});
        console.log('2');
    })();

    (async() => {
        console.log('3');
        await (async() => {});
        console.log('4');
    })();

prints interleaved 1, 3, 2, 4. Calculation split!

---

HTML5 defines a scheduling system ("tasks") on top of ECMAScript's job ("microtask") system.

Thus, browsers have a tiered scheduling system. And DOM events happen at the higher task level.

In a browser, if you want to cooperatively schedule at the task level,

    (async() => {
        console.log('1');
        await new Promise(resolve => setTimeout(resolve));
        console.log('2');
    })();

    (async() => {
        console.log('3');
        await new Promise(resolve => setTimeout(resolve));
        console.log('4');
    })();

You could argue that HTML5 should not have created tiered queues. Perhaps you are correct, though I think it can come in handy.

hinkley · on Sept 20, 2018

I'm not sure what you trying to demonstrate here. It's not the problem I'm talking about. For starters, you have no calculation. You're just running a couple awaits and setTimeouts. Of course those are going to run in 1,3,2,4 order.

The question is how would you make sure 4 happens before 2?

Here's a real world example, from Node: You need to make 3 service calls, A B & C, to build a page. Service A is the fastest call, but takes a lot of processing time. Service C is the slowest call, but requires a bit of data from Service B. Since A and B are unrelated, odds are good they're being invoked from completely separate parts of the code.

If you fire A and B, service C won't get called until service A's processing is complete. You could await both A and B, then call C before you start the processing, but you have to turn your code flow inside out to do that, so it only works for trivial applications.

Adding promise chaining to A.process() won't get B's promise to resolve before A's chain finishes resolving. setImmediate() might work in some places, and you might be able to come up with a code pattern that works for your team, but I don't believe it's guaranteed to work everywhere.

paulddraper · on Sept 20, 2018

My initial point what that your terms are muddled.

Your hypothetical could benefit from a preemptable (aka non-cooperative) scheduler, which can forcibly interrupt A, to allow C to start.

A cooperative scheduler (which is what JS has) is at the mercy of A to properly yield.

---

As for how to yield on the macro- or microtask queue of your choice, they are the same difficulty to write.

   // HTML5, Node.js
   await new Promise(resolve => resolve());
   await new Promise(resolve => setTimeout(resolve, 0));

   // Node.js
   await new Promise(resolve => resolve());
   await new Promise(setImmediate);

You're correct that setTimeout and setImmediate are not guaranteed to work on all ES runtimes, because they are HTML5 and Node.js specific additions. (As is the entire concept of a separate macrotask queue, which you dislike so much.)

Xichom2k · on Sept 20, 2018

You can yield back to the UI event loop in an async function through

   await new Promise((resolve) => setImmediate(resolve))

or similar constructs using setTimeout or requestAnimationFrame

paulddraper · on Sept 20, 2018

Correct.

Though setImmediate is specific to IE/Edge.

    setTimeout(resolve, 0)

will allow any queued events/macrotasks to occur, and then immediately resolve.

    requireAnimationFrame(resolve)

will wait until the browser is ready to render another frame, and then resolve.

Touche · on Sept 20, 2018

Yeah this is often misunderstood. Instead of thinking that promises make your code "run later" it's better to think about it as reorganizing still-synchronous code.

sequoia · on Sept 20, 2018

[cw: tangential point]

> This is the JavaScript equivalent of death by a thousand cuts.

Seems to me there's one really big knife in particular, doling out most of the cuts: https://i.imgur.com/Hzrfq13.png

I wonder what analytics platform is contributing all this slow-down to his first interaction... ;)

philipwalton · on Sept 20, 2018

Article author here. Yep, not hiding that fact (I could have easily used a trace with minified code, but I didn't to point this out).

Two things though:

1. I used to work on Google Analytics, and I've created a lot of open source libraries around Google Analytics, which I use on my own site because I like to test my own libraries (and feel any pain they may be causing). The way most people use Google Analytics does not block for nearly this long.

2. I've updated my Google Analytics libraries to take advantage of this strategy [1], and I'm working with some of my old teams internally to see if they can bake it in to GA's core analytics.js library, because I strongly believe that analytics code should never degrade the user experience.

[1] https://github.com/googleanalytics/autotrack/pull/235

sequoia · on Sept 20, 2018

> Yep, not hiding that fact (I could have easily used a trace with minified code, but I didn't to point this out).

Kudos to you for your honesty here! I was a bit confused by your question "So what’s taking so long to run?" when it seemed pretty clear what was taking so long to run. If the goal were simply "speed up the pageload/FID", removing browser analytics (in favor of server e.g.) would seem to be at least an _option_ to immediately achieve that end.

Thanks for the article.

philipwalton · on Sept 20, 2018

Right, when I said "what's taking so long to run?", in my mind I was thinking there'd be one obviously slow thing that I could just remove or refactor, but it turned out that it wasn't any one single slow function/API causing the problem.

And yes, clearly removing the analytics code would have also solved the problem for me, and in many cases, removing code is the best solution.

In this particular case I couldn't remove any code because I was refactoring an open source library that a lot of people use. I wanted to try to make it better for input responsiveness in general, so people who use the library (and maybe don't know much about performance) will benefit for free.

Also, I wanted to help educate people about how tasks run on the browser's main thread, and how certain coding styles can lead to higher than expected input latency.

Anyway, glad you enjoyed the post!

jschwartzi · on Sept 20, 2018

If I had a nickel for every timea page load was blocked connecting to an analytics platform I would be able to retire.

philipwalton · on Sept 20, 2018

It's likely not "blocked" by analytics since pretty much all analytics libraries get loaded async.

However, scripts loaded before the `load` event do delay the load event, and analytics script are typically loaded with the lowest priority, so they're usually last and thus the ones you notice in the bottom-left corner of your window.

But the only way they'd be "blocking" anything is if the site was waiting for the load event to initialize any critical functionality (which it shouldn't be).

nathancahill · on Sept 20, 2018

Shouldn't, but this is absolutely common in the wild. Another one I get often (which makes me need to disable my ad blocker) is UI actions that first log the action in X analytics library and then execute the function. When X library is blocked, the code path never reaches the function. Off the top of my head, flight/hotel reservation websites are particularly bad about this.

enobrev · on Sept 20, 2018

I don't think this comment matters, but since so many other comments on this article are speaking up against, maybe it does.

I don't care, at all whatsoever, if this person wants to use JavaScript for their blog.

Otherwise, nice article about improving performance and prioritizing important functionality during page-load.

craftyguy · on Sept 20, 2018

> I don't care, at all whatsoever, if this person wants to use JavaScript for their blog.

In most cases complaining about this is 'offtopic', but in this case it is very much on-topic since the blog is about optimizing javascript usage.

I'm just happy that you can actually read what they have to say without JS enabled!

AstralStorm · on Sept 20, 2018

I'm not sure why you trust Google Web "Fundamentals" that 0-100ms is perceived as instant.

The upper bound is perceptibly laggy. I have no idea where Google took their numbers from.

FID of 100 ms is already bad as you have to add network latencies on top of it.

To put things into perspective, it's more than the time it takes to fully boot embedded Linux as coreboot from not too fast flash or start up Commodore 64 with a good extension cartridge.

The browsers are terribly slow.

espeed · on Sept 20, 2018

> I'm not sure why you trust Google Web "Fundamentals" that 0-100ms is perceived as instant.

Those numbers go back to the studies Jakob Nielsen did at the Sun Microsystems usability lab and the results posted in his oft cited AlertBox articles from 1993 and 1997...

https://www.nngroup.com/articles/website-response-times/

dragonwriter · on Sept 20, 2018

> Those numbers go back to the studies Jakob Nielsen did at the Sun Microsystems usability lab and the results posted in his oft cited AlertBox articles from 1993 and 1997...

Actually, Nielsen indicates that had already been a consistent finding for ~30 years at that point, citing “Response time in man-computer conversational transactions” (Miller, 1968) [0] as the original source.

[0] https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Resp...

davemp · on Sept 20, 2018

The rule of thumb I've seen used in the embedded world is 50ms latency for user input and I found that it's still noticeable vs say 25ms.

taneq · on Sept 20, 2018

Getting offtopic here but arguably the best thing about the growing prevalence of VR is that it's forcing everyone to focus on maximum latency rather than frame throughput, and so that godawful stutter is finally being exorcised from our rendering stacks.

bluquark · on Sept 20, 2018

100ms is slightly arbitrary, but the real point of saying "100ms" is that it's lower than the multiple seconds typically required for page load, and higher than the 16ms required for smooth animation. An appropriate target for input response time (the "R" in RAIL) is somewhere in between those two.

If 100ms strikes you as too high, then by all means target a lower number like 50ms. But it's still not a disaster if your 95th percentile is 100ms. Also, if your A time is above 16ms or your L time is above 5 seconds, your limited development time might be better spent improving those rather than bringing R down even further.

treerock · on Sept 20, 2018

r.e. the 100ms, some references are given on this stackoverflow question.

https://stackoverflow.com/questions/536300/what-is-the-short...

Xichom2k · on Sept 20, 2018

I don't think scroll gestures or incrementally-loaded content with layout reflows were a thing back then, so that might need re-evaluating.

AstralStorm · on Sept 20, 2018

I'm pretty sure layout reflows were considered, as HTML rendering itself is incremental in almost all sensible browsers.

(Thus the ideas of embedding CSS and JS in localized pieces where applicable.)

Scroll gestures were not a thing typically, you just did a full request.

Xichom2k · on Sept 20, 2018

Sure, it's incremental, but static site layouts, especially the old float-based ones will have had their headers and sidebars loaded from the start and the main content would not jump around.

Modern pages with ads and widgets popping in potentially anywhere main remain unusable and unreadable because the main content keeps jumping around.

jcelerier · on Sept 20, 2018

> ...a team of neuroscientists from MIT has found that the human brain can process entire images that the eye sees for as little as 13 milliseconds...That speed is far faster than the 100 milliseconds suggested by previous studies...

teej · on Sept 20, 2018

This is easy to internalize when you remember that humans can tell the difference between 30fps and 60fps which is one frame every ~33ms vs every ~17ms

extrapickles · on Sept 20, 2018

You can tell the difference between 144hz and 60hz, especially if user input is involved (eg: using mouse to look around). The resolution of human skill depends heavily on the exact scenario as the body is complex. Even with eyes you have high density in the center of vision, low density outside, and they have different light sensitivities which makes them hard hard to model. To further complicate matters is the brain that does further processing that can result in hyperacuity.

A good example of hyperacuity is in reading Vernier scales where you can see differences much below the angular resolution of the eye.

mrob · on Sept 20, 2018

Humans can tell the difference between 1000fps and 2000fps too (with a test signal of a point light source flickering at >500Hz, and viewed with rapid eye movement to produce the phantom array effect. Note that temporal anti-aliasing is needed to avoid cheating with easily visible beat patterns.) This doesn't mean we can process an image in 0.5ms.

powerbook5300CS · on Sept 20, 2018

I’ve always believed that the 100ms figure includes network round trip time.

jgtrosh · on Sept 20, 2018

Reading this article and applying a similar technique to a different webpage could be a good exercise for advanced students in front end development. The core idea is put forward, and no implementation detail is left out; great article.

theandrewbailey · on Sept 20, 2018

I guess I'm an old man that hasn't smoked the hype, but I don't understand why one would write this:

    const main = () => {
        setTimeout(() => drawer.init(), 0);
        setTimeout(() => contentLoader.init(), 0);
        setTimeout(() => breakpoints.init(), 0);
        setTimeout(() => alerts.init(), 0);
        requestIdleCallback(() => analytics.init());
    };

over this:

    function main(){
        setTimeout(drawer.init, 0);
        setTimeout(contentLoader.init, 0);
        setTimeout(breakpoints.init, 0);
        setTimeout(alerts.init, 0);
        requestIdleCallback(analytics.init);
    };

rbonvall · on Sept 20, 2018

JavaScript is a minefield of unexpected behavior when you try things like this. Besides the issue with this-binding already mentioned in the thread, there are other examples of weird stuff that happens when you don't introduce an apparently redundant lambda:

    > ['10', '10', '10', '10', '10'].map(parseInt)
    [ 10, NaN, 2, 3, 4 ]

    > ['10', '10', '10', '10', '10'].map(x => parseInt(x))
    [ 10, 10, 10, 10, 10 ]

infogulch · on Sept 20, 2018

Ok this was a 'wat' moment for me until I realized that map passes up to 3 args to the function, including index, and parseint accepts a radix parameter. Still kinda weird/unintuitive, I could see myself not paying attention and reducing to the first example accidentally.

kristjansson · on Sept 20, 2018

> I’ve heard something about a surprise principle in API design. Principle of most surprise I think it was? Let’s go with that.

The design process of this, I imagine.

evntdrvn · on Sept 20, 2018

I lol'd. Downvote away haters, sometimes you gotta laugh.

jcelerier · on Sept 20, 2018

I'll take C++ UB every day over this

rictic · on Sept 20, 2018

Huh, I'm the opposite. I'd gladly take a confusing combination of consistent spec'd behavior over time travel, arbitrary code execution, and nasal demons.

jcelerier · on Sept 21, 2018

> I'd gladly take a confusing combination of consistent spec'd behavior over time travel, arbitrary code execution, and nasal demons.

the point of UB is that you can tell your compiler to transform it into a specific behaviour -e.g., an assert with -fsanitize=undefined for instance.

marcosdumay · on Sept 20, 2018

Well, at least with a consistent spec you can create a compiler for a good language.

Somehow, undefined behavior in C is a much smaller problem than it looks like. But the principle still applies.

streptomycin · on Sept 20, 2018

The 2nd style will not work if those functions internally use `this` and were not previously manually bound to their parent objects, like `drawer.init.bind(drawer)`.

taneq · on Sept 20, 2018

Presumably if you were going to write the latter, you'd also write your init() functions accordingly?

mason55 · on Sept 20, 2018

If you didn’t write the init functions you don’t have a choice

taneq · on Sept 20, 2018

If we were talking about a different situation then we'd be discussing a different article.

Xichom2k · on Sept 20, 2018

Due to javascript's funky notion of object methods. If you just pass the function reference itself invoking it will execute it with a |this| set to undefined.

Put differently, a property access x = foo.bar followed by x() is not the same as foo.bar()

avip · on Sept 20, 2018

The default this context would be the global window object (in the browser).

Xichom2k · on Sept 20, 2018

Not in strict mode

avip · on Sept 20, 2018

This.

ak39 · on Sept 20, 2018

How long have you waited for this?

bryanrasmussen · on Sept 20, 2018

probably as long as I've waited for that.

ealhad · on Sept 20, 2018

I can't figure out if the pun was intended.

tome · on Sept 20, 2018

Come on, give avip some credit! That's one of the best comments ever to appear on HN.

ealhad · on Sept 20, 2018

Only because of the context.

dwaltrip · on Sept 20, 2018

All greatness depends on the context.

ealhad · on Sept 21, 2018

Especially “this”.

Cthulhu_ · on Sept 20, 2018

The second argument (0) is also optional, 0 is the default value.

thegeomaster · on Sept 20, 2018

What I don't understand is why a very simple blog like this needs a ~56KB bundle of JavaScript at all.

mrspeaker · on Sept 20, 2018

Looking at the blog post, I'd say it was for drawer.init(), contentLoader.init(), breakpoints.init(), alerts.init(), and analytics.init(). The site works fine without javascript, so it doesn't need it - perhaps the author thinks the 50k is worth the drawer, content loading, breakpoint?, and alert features and also wants a bit of user analytics tracking.

AstralStorm · on Sept 20, 2018

Even so, you could backload the drawer etc. placing them at the end of the document and attaching to potentially already rendered page.

Reorder so that your main content loads first.

Author did part of it by deferring the analytics init. Not sure why they used setTimeout though for the initialization instead of requestIdleTimeout like everything else. The page is supposed to work without JS after all.

"I mentioned above that requestIdleCallback() doesn’t come with any guarantees that the callback will ever run."

Not true - there's a timeout argument. It guarantees that the callback will by ran by then.

mjdease · on Sept 20, 2018

Regardless of the demonstrated use case, these principles can be applied to improve the performance of web pages that use JavaScript

CydeWeys · on Sept 20, 2018

Thank you. That'd be like someone criticizing a coding example for a new language saying "What's the point of using OOP for implementing Tic-Tac-Toe? It's too simple to necessitate it."

thegeomaster · on Sept 20, 2018

I'm not criticizing the technique from the article. It's useful. I'm just expressing my confusion that the blog, simple as it looks, needs this JavaScript (and analytics are in a separate file from the one I'm referencing, it appears).

philipwalton · on Sept 20, 2018

That 56K number isn't gzipped, gzipped it's only 18K (plus there's also some inline JS, some webpack boilerplate, and then analytics.js).

The reason for its size is my site is my playground. It's where I get to experiment with all the things I want to experiment with.

I also work on quite a few open source projects, which I usually test on my site before releasing them publicly just to make sure they work in production without errors.

Cthulhu_ · on Sept 20, 2018

^ exactly; blogs are mostly static. You can improve the responsiveness of e.g. a comments section with a bit of JS, but that's very low priority and doesn't need to be much.

ummonk · on Sept 20, 2018

You make that sound like a lot. It's only equivalent to like 20 pages of raw plain text in size.

neetdavid · on Sept 20, 2018

I poked around a little and it seemed like the majority of it is related to analytics events (googleanalytics/autotrack)

I suppose the author just wants to know a little about how their blog and writing is performing in the wild

Sort of off topic, but it would be interesting if the browser handled these common cases instead and gave the user a way to opt-in/out. I suppose it sort of does by broadcasting those events to the js listeners in the first place.

oftenwrong · on Sept 20, 2018

This blog works fine without javascript.

pc86 · on Sept 20, 2018

Which doesn't answer the question.

lucb1e · on Sept 20, 2018

It shows that the answer is simply "nothing whatsoever".

benjaminjackman · on Sept 20, 2018

Here is:

- The [repo](https://github.com/GoogleChromeLabs/idlize) implementing the pattern described in the article

- The [package](https://yarnpkg.com/en/package/idlize)

code-is-code · on Sept 20, 2018

A few month ago, I used a similar optimisation that does the same idle-awaiting but with server-requests instead of cpu-usage. So instead of submitting all ajax to the server directly, background tasks can be delayed until the important tasks have finished. This is useful especially when the 6 requests per origin limit gets hit often.

https://github.com/pubkey/custom-idle-queue

iofiiiiiiiii · on Sept 20, 2018

Not being a JavaScript guy, my eyes sort of glazed over after the flame graphs. Do I understand the gist of it right that a 200+ millisecond delay is normal if you just have 56KB of light blog page style JavaScript code whose loading you do not somehow optimize? Or is there something pathological in play here?

alanning · on Sept 20, 2018

It depends on what the JavaScript code is doing.

The article discusses an example where code is loading Intl.DateTimeFormat which takes some time but is not immediately used.

So if the 56KB code doesn’t do a lot of loading of expensive components then it may not need further optimization, although it may still have the problem of blocking user input.

Main moral of the story is you can’t assume performance based on code size, you have to measure.

lilyball · on Sept 20, 2018

If the measurement here is how long it takes to respond to the first user input, why does a 233ms main function matter? As a user, how am I expected to scan the page, locate a link, and click on it within that 233ms?

jschwartzi · on Sept 20, 2018

Imagine that his site was linked somewhere else. In that scenario it takes 233ms from click to display. That's why this matters, because users aren't opening a browser to that one page. They're clicking around on different pages, and if each one takes 233ms that's a very slow process.

bovermyer · on Sept 20, 2018

While this is impressive work, I can't help but wonder if this is solving the wrong problem.

If the goal is to load fast, wouldn't it be better to just stop using JavaScript altogether? It's a blog, not an app.

alangpierce · on Sept 20, 2018

I don't think the author is suggesting that people should put this much effort into optimizing their blog. It's a toy example that's simple enough to explain concepts that can then be applied to bigger and more complex webapps where "don't use JavaScript" isn't an option, like with the Redux example mentioned further down in the article.

elfakyn · on Sept 20, 2018

The Amazon app has an absolutely horrendous FID of around 10 seconds or so. They could use a little bit of optimization.

a012 · on Sept 20, 2018

Meanwhile, there are many websites don't care if their page loads in a few seconds. For example: https://i.imgur.com/coBKPa1.jpg