I've seen two H2O demos including Sri's keynote at H2O World 2017, and they have mentioned a Python version of the R data.table package. However, I have been unable to find any additional info regarding this.
Is this available as a stand alone python module (or planning on becoming one)?
Your understanding is correct. H2O is currently developing Python datatable package, with the goal to make it as close to R's data.table as possible, both in syntax and in speed. The package will be released open source once it has a certain minimal number of features. Currently there is no set date for the release, but we hope to do it before the end of March.
UPDATE: datatable is now publicly open.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With