Experimental Plotting in H2O FLOW (limited support)

H2O FLOW comes with experimental plot option which can be used as below:

What you need is :

  • X – Column name
  • Y – Column name
  • Data: Data Frame name

Here is what experimental script look like:

plot (g) -> g(
   g.position "X_Column", "Y_Column"
 g.from inspect "data", getFrameData "DATA_SET_KEY"

You can launch the plot configuration as below:

plot inspect 'data', getFrame "<dataframe>"

Here is the screen shot:

Screen Shot 2017-06-19 at 4.50.31 PM

Here is the example script:

plot (g) -> g(
      g.position "FIRST_PAYMENT_DATE", "CHANNEL"
    g.from inspect "data", getFrameData "FM_Joined"


Note: This experimentation script only selects first 20 columns and 1000 rows and this setting can not be configured.

Thats it, enjoy!

Setting H2O FLOW directory path in Sparkling Water

Sometimes you may want to back up H2O FLOW files to some source code repo or to a backup location. For that reason you may want to change the default FLOW directory.

In H2O flag -flow_dir is used to set the local folder for FLOW files.

Note: You can always specify any H2O property by using system properties on Spark driver/executors.

So to change H2O FLOW directory to save you can append to your command line with the Sparkling Water commandline:

--conf spark.driver.extraJavaOptions="-Dai.h2o.flow_dir=/your/backup/location"


Thats it, thanks.