Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

We already have this, it is R with tidyverse. What we need is a fully baked transpiler from R/tidyverse to sql.


Yep. Seriously. R w/tidyverse is a ridiculously powerful data wrangling tool especially when dealing with text files.

I tend use Notepad++ when starting out on a data-wrangling adventure. It has an uncanny ability, unlike any other editor, to open hundreds of files at the same time and to perform regex operations on all of them without dropping dead. I uses Notepad++ for initial manual exploration to get the lay of the problem, and then switch to R for the actual analysis.


The irony, of course, is that txr predates tidyverse.


>I tend use Notepad++

I assume, then, that your file sizes are not so big. N++ is not good with big (>25% of your ram) file sizes, refusing to open them.

Is R/tidyverse also limited on the size of the file it can handle? In my job i routinely work with up to 100GB files.


I guess it depends on what your definition of "big" is, I've never had to deal with 100GB files!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: