hadoop - How to fork actions in Oozie -
i have many sequence files , workflow of actions execute on each file. workflow same file , number of input file may vary. i'd execute workflow on bunch of input files (let's 10 files) in parallel using fork mechanism in oozie. if number of input files fixed, knew how many workflow should execute , write fork, may vary not know how should write fork. thoughts on that?
you can write java oozie client accept number of files parameter , have many number of workflows invoked parallely , return on success of workflow executions. otherwise, might have programmatically generate workflow desired number of fork branches.
Comments
Post a Comment