More

bakery2k · on March 18, 2020

> what can you expect from a language that uses "+", the quintessential notation for commutative operations, for the (very non-commutative) operation of concatenation?

Is this any worse than using `+` for floating-point addition, which isn't even associative?

enriquto · on March 18, 2020

But floating point addition is commutative!

Even if it wasn't, floating point numbers are very often regarded as approximations of real numbers, where addition is associative. String concatenation cannot be construed to have any resemblance of commutativity.

bakery2k · on March 15, 2020

So, Envoy will be embedding WebAssembly alongside (or instead of) Lua?

Are other projects moving from Lua (or other embedded scripting languages) to WebAssembly? What are the benefits of compiling an extension to WASM rather than writing it in Lua?

bluejekyll · on March 15, 2020

To me the big improvement over Lua won’t be security, or necessarily performance. It’s the ability to target a WASM runtime with your desired language, and it would still be a safe environment.

Be it Rust, C, C++, Typescript, and I bet there will be a Lua interpreter in WASM at some point too.

It will be more flexible, and possibly more performant.

fulafel · on March 15, 2020

I think the future is still hazy for managed languages, the GC side of Wasm is still in the design phase and it's not clear how widely it will be implemented. And it's been quiet for a long time, is the effor even alive? (https://github.com/WebAssembly/proposals/issues/16)

If it's going to encompass the possibility to hook into the JS side GC engine, this might not be implemented at all in headless Wasm applications, and so you wouldn't be able to use the GC'd languages on those platforms, just C(++)/Rust. Which is not very attractive compared to the easier / safer high level languages most programmers are used to.

bluejekyll · on March 15, 2020

You’re right. I should have said AssemblyScript, not TypeScript.

Also, As someone familiar with Rust, it is a very high-level language. It’s zero-overhead features make it a bit of a steep learning curve, but you rarely need to drop to its low-level unsafe features.

That being said, incorporating it into a new WASM environment probably means doing a lot of low-level stuff.

thefounder · on March 15, 2020

You can use GC languages. One of them is Go. WASM support is pretty good though direct dom access or some well tested wrappers would make it even better.

danielheath · on March 15, 2020

The obvious benefits (to me) are performance and security.

Performance: Other than the need for trampolines between the host and the wasm code, WebAssembly runs at essentially full speed. Lua is fast for an language built on dynamic types and dynamic dispatch - which is to say, quite slow.

Security: The reference Lua implementation was not designed for untrusted code; there have been various attacks where loading invalid bytecode could grant arbitrary execution. Using a format designed for executing untrusted code has real advantages there.

dgb23 · on March 15, 2020

Isn’t the performance claim a bit of a misconception?

Lua with JIT is very fast even compared to statically typed, compiled languages.

The bottleneck doesn’t seem to be dynamic types or dispatch but rather relying on garbage collection. Another example of this would be Julia.

Admittedly WASM is typically targeted by languages like C and Rust, but I wouldn’t put Lua in the same, general performance category as for example python or most other dynamically typed languages with dynamic dispatch.

brabel · on March 15, 2020

The cost of interop in WASM is currently very big as there's no way to pass non-numeric data to the host language without explicit conversion, which is very slow. WASI is trying to address this but as far as I know it's very far from being used by WASM language implementations. Most WASM implementations , by the way, are completely experimental at this point... listing 30 languages as supporting WASM seems highly misleading to me, as someone who has played around with a few languages compiling to WASM. I would say that only C, C++ and Rust have decent support. Even the Go support is currently extremely experimental and limited when compiling to WASM, and that would be the next "best" option. Lack of GC is a huge problem here and the current approach seems to be to allow runtimes to only optionally provide one.

vardump · on March 16, 2020

No one is using the Lua reference implementation when performance is required. I haven't used Lua(JIT) for years, but when I did, its performance was approaching C-compiler in some cases. Although a tracing JIT does have its weaknesses as well.

armitron · on March 15, 2020

I think all your points are more wrong than right since you pick the weakest possible constructions to attack (reference Lua implementation, invalid bytecode) and seem not to be aware of how good LuaJIT is performance-wise.

fulafel · on March 15, 2020

WebAssembly is slower than native too, how does it currently compare to Lua with a comparably safe & high level language?

miohtama · on March 15, 2020

Here an emulator author benchmarks Wasm implementation against JavaScript and Closure (2018).

Wasm is 1.5x - 11x faster depending on a browser and such.

Here is a very technical 2019 paper comparing Wasm against native compilation. Wasm is 1.5x slower than bare metal https://www.usenix.org/system/files/atc19-jangda.pdf

fulafel · on March 15, 2020

I think the first link is missing? But emulators are pretty unrepresentative for this use case and they're often written in low-level & unsafe languages.

The second is about benchmarking C, I suspect it doesn't give us a lot of information in this question either.

hardwaresofton · on March 15, 2020

Sounds like a good idea to me -- just having more languages available for people to write extensions in sounds like a huge win.

chatmasta · on March 15, 2020

What is the state of the art for WASM VM scripting? Adding Lua to a C program requires very little effort. Is the same true of WASM? What is the best “drop in” WASM VM for C programming?

follower · on March 15, 2020

I've recently been working on WASM VM embedding with C (well, via libffi) for a personal project and uncovered a couple of WASM VM options, wasmtime & wasmer:

* https://github.com/bytecodealliance/wasmtime

* https://wasmerio.github.io/wasmer/c/runtime-c-api/

The underlying implementation of each is written in Rust.

(Recently I also discovered there's a "micro VM" too: https://github.com/bytecodealliance/wasm-micro-runtime)

The official WASM C API is still WIP but wasmtime targets it directly: https://github.com/WebAssembly/wasm-c-api/

Not sure if Wasmer is aiming for compatibility with the official WASM C API currently.

From my experience, Wasmtime's C support is probably best described as "under-documented but functioning".

I haven't implemented anything with Wasmer.

My impression is that Wasmer may offer a higher level API than Wasmtime but not sure if it counts as "drop in" yet.

Part of the reason why I ended up going with Wasmtime was...I kinda forgot Wasmer existed. :D But also I do like that they're targeting what will hopefully become a standard API--but I think that results in a lower level API than what is most "friendly" for starting out.

Been meaning to make this embedded Wasmtime C API example--created while figuring things out--public after tidying it up a bit more but... well, I just now made it public, as is (it at least has a ReadMe now :D ): https://gitlab.com/RancidBacon/wasm-embed-test :)

Also, this is one option for test compiling C to WASM without setting up a tool chain: https://wasdk.github.io/WasmFiddle/

Some of my notes from development so far: http://www.labradoc.com/i/follower/p/notes-webassembly#20200...

Edit: Grammar. Add & then move micro WASM VM link.

chatmasta · on March 15, 2020

Thanks for all the info! Looks like I've got some Sunday reading. :)

bakery2k · on Feb 17, 2020

> It was released just as the PS2 was announced

I think that's the point. A lot of people bought a PS2 because it was also a DVD player (same with the PS3 and Blu-ray).

Izkata · on Feb 17, 2020

At least among myself and friends, we didn't care about that - we had to decide which system we wanted because our parents would only get one, and went for the bigger name/more familiar games.

(To my and my brothers' initial disappointment, my parents didn't understand why we wanted a PS2 and got us a Dreamcast instead, then refused to get a second game system. One of my brothers still has and plays that Dreamcast.)

nsteel · on Feb 17, 2020

But the PS4 never got a UHD Blu-ray player, some think it was an obvious candidate for the 4K pro model. Or perhaps we've gone the other way again and discs are dead, streaming is king.

kop316 · on Feb 17, 2020

You have to keep in mind that when the PS2 was released, the DVD was also just released. I remember a stand alone DVD player by itself was ~$200 during the PS2 launch. So part of the argument of it being $299 is that it was a bargain to get it and then not have to buy a DVD player on top of it.

The Xbox required the IR remote controller to play a DVD (So you had to pay $300 plus that price), and the Gamecube could not play a DVD at all (but it was $150 at release).

bakery2k · on Feb 17, 2020

Isn't code generation during parsing still common today? In particular, bytecode generation in interpreters (and JIT compilers) for scripting languages, e.g. Lua?

Athas · on Feb 17, 2020

It's sometimes a good idea to do it that way in practice, but it's still a conflation of two conceptually distinct processes. I think it is a bad approach when teaching compiler implementation, as it means you avoid the extremely core concept of an abstract syntax tree.

badsectoracula · on Feb 17, 2020

But it isn't a core concept if you do not need it. And an AST builder can be "injected" between the parser and codegen at a later point in time, if needed. You do not even need to do it in one go, e.g. if your compiler has something like a "ParseExpression" (assuming recursive descent parsing that spits out code as it parses), you can start by making a partial AST just for the expressions and leave everything else (e.g. declarations, control structures, assignments - assuming those aren't part of an expression, etc) as-is.

This is useful for both practical and teaching purposes: for practical because it keeps things simple in case the additional complexity isn't needed (e.g. scripting languages) and for teaching purposes because someone learns both ways (which are used in real world problems) while at the same time learning why one might be preferable to the other. And if you do the partial AST bit you even introduce the idea of an AST gradually by building off existing knowledge and experience the student has acquired.

bakery2k · on Feb 17, 2020

Thanks, that makes sense.

vidarh · on Feb 17, 2020

Yes, it is. It also doesn't really make much difference per se, as e.g. Wirth-style compilers still maintain careful separation of the code generation and parsing.

And if you want to/need to later, you can trivially introduce an AST in those compilers by replacing the calls to the code-generator with calls to a tree builder, and then write a visitor-style driver for the code generation.

mcejp · on Feb 18, 2020

"trivially"?

vidarh · on Feb 19, 2020

Yes, it's work of course, but it's quite mechanical work that requires little thought.

Instead of calling the code emitter, you call an AST builder.

Then you build a tree walker that call the code emitter.

At least the Wirth Oberon compiler was retrofitted with an AST by at least one student in Wirth's group as part of experiments with optimization passes.

bakery2k · on Feb 17, 2020

If all we do is write Pythonic code (especially now that "Pythonic" seems to include type hints), what's the benefit of the highly dynamic CPython virtual machine?

Surely a faster VM, or even an ahead-of-time compiler, would be possible if we give up on some dynamism? Is that a direction the community should take?

(I think Guido's answer would be no, based on his apparent dislike of existing "Python compiler" projects such as Nuitka.)

bakery2k · on Feb 17, 2020

The Wren scripting language supports this kind of "overloading by arity" [0].

Wren therefore allows overloads such as `range(stop)` and `range(start, stop)`. This is more intuitive than Python's `range(start=0, stop)`, which might be the only function in the language that has an optional parameter before a required one.

[0] http://wren.io/method-calls.html

progval · on Feb 17, 2020

> which is the only(?) function in the language that has an optional parameter before a required one.

The documentation shows it as being overloaded, rather than having default arguments:

    range(stop) -> range object
    range(start, stop[, step]) -> range object

There's iter which is overloaded:

    iter(iterable) -> iterator
    iter(callable, sentinel) -> iterator

bakery2k · on Feb 17, 2020

That's the thing - it's documented as overloaded, because that's the most intuitive explanation. Wren would allow it to actually be implemented as an overloaded function.

Python doesn't support overloading, and it doesn't support optional arguments before required ones, so the actual implementation in Python is a bit messy - something like:

  def range(start_or_stop, optional_stop=None):
      if optional_stop is None:
          start = 0
          stop = start_or_stop
      else:
          start = start_or_stop
          stop = optional_stop
      ...

blackandblue · on Feb 17, 2020

there is no optional positional in the implementation. range is overloaded in the raw sense of the term as the implementation checks the number of arguments and their types and does the right thing.

it could have been implemented in pure python as well by doing args, *kwargs.

bakery2k · on Feb 17, 2020

Well, however the `range` function is implemented, IMO its API would be better expressed as true overloading - as two functions with the same name.

mlonkibjuyhv · on Feb 17, 2020

I could have sworn I have written range(1,stop) many times in Python. Did I misunderstand your argument, or has my memory gone all sideways?

bakery2k · on Feb 17, 2020

`range(stop)` and `range(1, stop)` are both supported, but without overloading, the implementation of `range` is messy as it has to work out the meaning of each argument manually.

skrebbel · on Feb 17, 2020

Why is that a problem? I want the standard library to contain all messy stuff so my code doesn't have to.

From the call site there's no difference between Python's optional-first-argument range() function and a hypothetical overloaded one. Any perceived complexity in usage, therefore, can be fixed with better documentation.

bakery2k · on Feb 17, 2020

`range` is an example. Lack of support for overloading makes it harder to replicate its API in our own functions.

skrebbel · on Feb 17, 2020

Ah right, totally misunderstood.

Yep, true. Overloading is nice.

progval · on Feb 17, 2020

    range(a) means range(start=0, end=a)
    range(a, b) means range(start=a, end=b)

bakery2k · on Feb 16, 2020

I think Jedd was commenting on the spelling of "meter" vs "metre".

bakery2k · on Feb 16, 2020

That shouldn't be surprising. Distances longer than a few kilometers are measured in miles.

AsyncAwait · on Feb 16, 2020

What? In countries using the metric system, distances of more than a few km are usually measured in tens/hundreds/thousands of kilometers.

You don't just switch to a different system mid way.

bakery2k · on Feb 16, 2020

The UK doesn't use the metric system for distances, at least when talking about travel. The only switching is from yards to miles.

AsyncAwait · on Feb 16, 2020

Indeed, you said "Distances longer than a few kilometers are measured in miles", which made it sound almost like someone starts a journey in km and then once it gets past say 10km switches to miles.

If you're pointing out the inconsistency of the UK using metric units for i.e. weight and then not for travel distances, I agree, its a bit of a schism.

bakery2k · on Feb 16, 2020

"Distances longer than a few kilometers are measured in miles" was a direct reference to the OP's "yards to measure distances shorter than a few kilometers".

AsyncAwait · on Feb 16, 2020

Yeah, I somehow glanced over that part of OP's post so missed the irony. Apologies.

Rapzid · on Feb 16, 2020

I took that to be a joke. The GP was "taking the piss".

_mghw · on Feb 16, 2020

Exactly. And that's why the UK is especially stupid with units. At least in the US they are consistently idiotic with Fahrenheit and Miles and BTUs and so on. In the UK, they understand what a kilogram is, but measure weight in stones anyway. Fucking stones! And then this thing with yards and miles.

Utterly hopeless.

spodek · on Feb 16, 2020

I don't see anything idiotic about Fahrenheit. With distances I can see why powers of ten make a difference, but we don't vary temperatures by orders of magnitude in regular life.

Nor do I spend much time around freezing or boiling water. Fahrenheit has 9/5th more specificity.

Is the point that it's different than the rest of the world? I can see that point, but am I missing anything particularly bad about the Fahrenheit scale?

_mghw · on Feb 16, 2020

I'm a thermodynamic engineer by trade. We spend a lot of time around freezing and boiling water. Even more, we spend a lot of time in Kelvin land.

I've a particular hatred of Fahrenheit :)

code_sloth · on Feb 16, 2020

> Is the point that it's different than the rest of the world? I can see that point, but am I missing anything particularly bad about the Fahrenheit scale?

Mainly that it doesn't make any sense. Why was 32F made the magical number for the freezing point of water? The "well known" temperatures like freezing/boiling points of water are based on observations after the scale was invented. The secrets to the F scale died with Fahrenheit and today nobody knows for sure what 0F actually means.

ImprovedSilence · on Feb 16, 2020

So what, it gives much more granularity than Celsius, that’s the OPs point, and it’s why it makes sense to use it.

code_sloth · on Feb 16, 2020

You can basically approximate a 1F change to 0.5C change (or 0.55C) for non scientific purposes.

  50F -> 10C
  51F -> ~10.5C
  52F -> ~11C

Unless you hate decimals, I don't think there's much granularity gained.

TeMPOraL · on Feb 16, 2020

I wonder if they're keeping fixing this as a backup plan for a rainy day? Say one day UK's GDP goes down harder than they'd like, so in order to burn some money and boost it back without making it look obvious, they'll announce the country has made up its mind, and is switching to full and proper metric starting next year. Cue the economy going to overdrive, as everything and the kitchen sink has to be relabeled or replaced...

(And if that doesn't help for long, they can stimulate the economy further by changing the driving side to the right one.)

205guy · on Feb 17, 2020

It’ll go the other way: the next time they need to leave something, they can have a referendum to leave the metric system, then go through a few governments to get it done.

_mghw · on Feb 16, 2020

Hahaha, a likely theory!

Actually, I'm Indian, and we have our steering wheel on the right side, just like the Brits. It's one of the less fortunate things we picked up from them.

saalweachter · on Feb 16, 2020

UK went a little crazy sometime after the American colonies split.

I mean, the US screwed up their fluid ounce / weight ounce so that a US fluid oz of water doesn't quite weight a US oz, but the UK redefined a hundredweight as 112 lbs to make it an even number of stones, and even though they kept their ounces correct, they redefined a pint to 20 ounces so now there's nowhere in the world a pint's a pound.

iso1210 · on Feb 16, 2020

> now there's nowhere in the world a pint's a pound.

was 20 years ago in our student bar. More like 3-4 pounds for a pint now, 5 in London

rblatz · on Feb 16, 2020

Fahrenheit is a better scale for day to day temperature measurement.

bakery2k · on Jan 20, 2020

> an engineer in silicon valley

Good point. The biggest factor that determines an engineer's salary is location, followed by years of experience. Not whether they work on compilers/embedded/web/whatever.

kick · on Jan 20, 2020

Not whether they work on compilers/embedded/web/whatever.

Not true! See: salaries for people who know kdb+, that embedded engineers almost always get screwed on comp.

monoideism · on Jan 21, 2020

> salaries for people who know kdb+

If you are suggestingn that kdb+ salaries are extremely high, like I often read here, I disagree. I did some research in this area, and despite the rumors, found lots of kbd+ jobs posted with very mediocre salaries. It seems unlikely that such jobs would been filled if salaries are as high as people have suggested on HN.

For example, here's a senior kbd+ deb job in Manhattan (high cost of living) that pays 165-185,000/year. I make almost that much as a senior dev in a low cost of living area. I'd expect to make at least $250,000/year in NYC or California.

kick · on Jan 21, 2020

I know multiple people who have jobs using kdb+, and I use an array language myself for work.

Every type of job site will have postings that are way below market rate. The market rate for kdb+ devs is well above that, I can assure you.

monoideism · on Jan 21, 2020

So would you please give me an idea of what market rate is for a senior dev working with kbd+ in NYC? Roughly? If it's well above 180,000, what is it? Thank you.

And I don't mean quants. I mean software developers.

And yes, it's true that every job category will have outliers (high and low), but after spending some time several years ago looking at jobs and speaking with recruiters, I'm skeptical. I've enjoyed learning about array languages, and had heard the rumors of high kbd+ salaries, which is why I was curious.

And I get that kbd+ is just a technology, and much more depends on individual qualifications. So let's say, an experienced, highly-qualified senior dev who has taught themselves kbd+, just for arguments sake.

bakery2k · on Jan 20, 2020

From GitHub [1]:

  "Plans to try MIR light-weight JIT first for CRuby or/and MRuby implementation"
  "MIR is strongly typed"

Is there an explanation of how the project bridges the gap between dynamically-typed Ruby and statically-typed MIR?

More generally, I'd love to see something like MRuby+MIR be successful. It would be great to see an alternative to the aging LuaJIT.

[1] https://github.com/vnmakarov/mir

vnorilo · on Jan 20, 2020

Seems to me that MIR operates on a (much) lower level, basically abstract away the physical machine and its finite register set. As such, it would replace LLVM (or GCC) middle and back end. The goal is much faster compilation without sacrificing more than ~20% of performance.

Dynamic types and garbage collection would then be implemented on/for the abstract MIR machine.

chrisseaton · on Jan 20, 2020

> Is there an explanation of how the project bridges the gap between dynamically-typed Ruby and statically-typed MIR?

My understanding is that MIR doesn’t tackle this problem, so the code does stay pretty dynamic when compiled, just like its sister project YARV MJIT.

They both need an intermediate profiling mode to specialise and monomorphise but nobody is building that as far as I know.

monocasa · on Jan 20, 2020

It's probably like how V8 works. Even though JS is dynamically typed, V8 will keep internal static type definitions around based on what it sees when running, and trap out internally to the slow path when the types don't match what it thinks should happen rather than throwing type errors.