Glue

BigData Workflow Engine for Hadoop, Hbase, Netezza, Pig, Hive, Cascalog ...

Pig Module API

TOC

Class: PigModule

Jars are registered automatically and specified in the module configuration, this allows pig scripts to be free of jar register statements.

Its advised to write pig scripts in the workflow but externaly pig files can be called by replacing the script:String with the file name.

Exceptions are thrown if the script pig job fails

Method Description Example
run(jobName:String, script:String) run the string as a pig script ctx.pig.run('mypigjob', " a = load 'myfile.txt'; g = group a all; r = foreach g generate COUNT(\$1); dump r;")
run(script:String, localMode:boolean) run a script with the option of specifying if its local or remote