H6.4 - matchIT Hub for Spark - Running on a local cluster

Download and install Apache Spark from

Download and install Java from

Native Libraries

The native library files should be placed in folder, e.g. /opt/matchithub.lib or /usr/local/lib64, that can be synced with all the processing nodes. Execute the following steps:

cd matchithub-spark/lib
sudo cp lib*.so /usr/local/lib64

Use rsync to sync the /usr/local/lib64 folder with all processing nodes in the cluster. Ensure that contains these lines:


Running Jobs

Each of the sample application folders contains a example script showing a spark-submit command.

