That's still not capable of producing a central "work area" the angular size of a book at the reading distance with a reasonable angular resolution figure.
Speculating but what if this screen pixel density is not uniform but much more dense in the center and they use eye tracking to shift those screens mechanically similar like the shift camera sensor array for image stabilization?
>Saccades to an unexpected stimulus normally take about 200 milliseconds (ms) to initiate, and then last from about 20–200 ms, depending on their amplitude (20–30 ms is typical in language reading).
Are you sure? Seems like plenty of time - even current iphone camera since many generations have 240fps and most android even handle 720fps mode - current iphones should handle it probsbly even in software (except their camera has quite high latency last time when tested) but they handle it in hardware R1 chip anyway. Since they using probably something like truedepth IR pattern projector, with 720fps grayscale camera should be easy to have very fast motion detector just checking only regions around dots
Yeah definitely this would be the most tricky part, but someone mentioned here those screen are the size of post stamp - so if that's true they probsbly closer than lenses in standard correction glasses and the shift wouldn't have to be that big
If they alternate eyes rapidly, like active 3D glasses do, couldn't that double or nearly double the perceived resolution (if each eye is offset by 1/2 pixel from the other)?