Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

There is experimental support for distributed query execution with spill-to-disk between stages to support larger than memory datasets. This is implemented in the Ballista crate, which extends DataFusion.

https://github.com/apache/arrow-datafusion/tree/master/balli...



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: