magpie

This project contains a number of scripts for running Big Data software in HPC environments. Thus far, Hadoop, Hbase, Pig, Spark, Storm, and Zookeeper are supported. It currently supports running over the schedulers/resource managers of Slurm and Moab. It currently supports running over the parallel file system Lustre and running over any generic network filesytem.

Basic Idea

The basic idea behind these scripts are to:

Allocate nodes on a cluster using your HPC scheduler/resource manager. Slurm and Moab are currently supported.
Scripts will setup configuration files so the rank 0 node is the "master". All compute nodes will have configuration files created that point to the node designated as the master server.

The configuration files will be populated with values for your filesystem choice and the hardware that exists in your cluster. Reasonable attempts are made to determine optimal values (they are almost certainly better than the default values).
Launch daemons on all nodes. The rank 0 node will run master daemons, such as the Hadoop Namenode or the Hbase Master. All remaining nodes will run appropriate slave daemons, such as the Hadoop Datanodes or Hbase RegionServers.
Now you have a mini big data cluster to do whatever you want.

Additional details can be found in the project README file

Name		Name	Last commit message	Last commit date
Latest commit History 411 Commits
bin		bin
conf		conf
doc		doc
examples		examples
patches		patches
script-msub-slurm		script-msub-slurm
script-sbatch		script-sbatch
script-templates		script-templates
scripts		scripts
COPYING		COPYING
DISCLAIMER		DISCLAIMER
NEWS		NEWS
README		README
README.md		README.md
TODO		TODO
magpie-check-inputs		magpie-check-inputs
magpie-common-exports		magpie-common-exports
magpie-common-functions		magpie-common-functions
magpie-post-run		magpie-post-run
magpie-pre-run		magpie-pre-run
magpie-run		magpie-run
magpie-run-hadoop-terasort		magpie-run-hadoop-terasort
magpie-run-hadoop-upgradehdfs		magpie-run-hadoop-upgradehdfs
magpie-run-hbase-performanceeval		magpie-run-hbase-performanceeval
magpie-run-magpie-testall		magpie-run-magpie-testall
magpie-run-pig-testpig		magpie-run-pig-testpig
magpie-run-spark-sparkpi		magpie-run-spark-sparkpi
magpie-run-storm-stormwordcount		magpie-run-storm-stormwordcount
magpie-run-zookeeper-zookeeperruok		magpie-run-zookeeper-zookeeperruok
magpie-setup-core		magpie-setup-core
magpie-setup-hadoop		magpie-setup-hadoop
magpie-setup-hbase		magpie-setup-hbase
magpie-setup-pig		magpie-setup-pig
magpie-setup-spark		magpie-setup-spark
magpie-setup-storm		magpie-setup-storm
magpie-setup-zookeeper		magpie-setup-zookeeper
magpie-submission-convert		magpie-submission-convert

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

magpie

Basic Idea

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

magpie

Basic Idea

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages