Please give me an example. I can't think of any transform which cannot be done b...

brokensegue · on Oct 31, 2019

train a set of sklearn models one each per a random partition of the data (computed distributed). then combine all those models using averaging and evaluate them all against an even larger dataset. how do you do that in SQL

truth_seeker · on Nov 1, 2019

Sharding the table can help scale the problem across many machines and as I mentioned earlier you can use PL/R or PL/Python language extension to lift all sorts of ML functions to SQL functions.

brokensegue · on Nov 1, 2019

I'm unfamiliar with PL/Python. Can you have a Python object be the returned value of a sql query? Because that's a requirement of my example.

nxpnsv · on Nov 1, 2019

It’s also possible to do a lot in excel, it is just not always the best tool for the job.

ai_ja_nai · on Oct 31, 2019

Spark != SQL

It's also graph analysis and ML models.

truth_seeker · on Nov 1, 2019

Graph analysis -> Recursive common table expression (https://www.postgresql.org/docs/current/queries-with.html)

ML models - I already mentioned how to uplift R and Python functions to SQL function. even if you are not using PostgreSQL many other databases help you with uplifting and interfacing with existing ML libraries through FFI