This tutorial shows how to run a one-click Hadoop WordCount job on the Eucalyptus platform of FutureGrid.
1. FutureGrid HPC account: please apply via FutureGrid portal and request a HPC account. 2. FutureGrid Eucalyptus account: please see FutureGrid Eucalyptus Tutorial for detailed instructions. 3. FutureGrid Eucalyptus credentials zip file (euca2-[username]-x509.zip) stored under user’s home directory 4. Key pair created and added for use with Eucalyptus virtual machines
The following sections assume a user has created both an HPC account and a Eucalpytus account under the username of gaoxm.
$ ssh -i .ssh/id\_rsa\_fg
india.futuregrid.org
Enter passphrase for key '.ssh/id\_rsa\_fg':
Last login: Sat May 5 02:17:33 2012 from
c-71-194-153-252.hsd1.in.comcast.net
...
torque/2.5.5 version 2.5.5 loaded
moab version 5.4.0 loaded
euca2ools version 1.2 loaded
$ cd eucalyptus/
$ ls
cloud-cert.pem euca2-gaoxm-d108375b-pk.pem
eucarc hosts nodes
euca2-gaoxm-d108375b-cert.pem euca2-gaoxm-x509.zip
gaoxm.private jssecacerts tmp.out
$ wget
`http://mypage.iu.edu/~gao4/data/hadoopOneClick.zip <http://mypage.iu.edu/~gao4/data/hadoopOneClick.zip>`__
...
$ ls
hadoopOneClick.zip
$ unzip hadoopOneClick.zip
$ cd hadoopOneClick
[gaoxm@i136 hadoopOneClick]$ ls
deploy-hadoop.sh instanceIds.txt publicIps.txt
stop-hadoop.sh
hadoop-one-click.sh ipHosts.txt run-hadoop-wordcount.sh
terminate-instances.sh
hosts nodes.txt start-instances.sh
[gaoxm@i136 hadoopOneClick]$ chmod +x \*.sh
[gaoxm@i136 hadoopOneClick]$ ./hadoop-one-click.sh -n 2 -t m1.small
-i emi-D778156D -k gaoxm -p ~/eucalyptus/gaoxm.private -l
`http://mypage.iu.edu/~gao4/data/grexp10.txt <http://mypage.iu.edu/~gao4/data/grexp10.txt>`__
-s
http://salsahpc.indiana.edu/tutorial/apps/hadoop-0.20.203.0-for-EucaVm.tar.gz
This will run a MapReduce word-count job on a dynamically created virtual Hadoop cluster on FutureGrid. The user needs to replace the –k and –p parameter values with his/her key-pair name and private key path. For detailed usage information, try:
$ ./hadoop-one-click.sh -h
$ ls outputs_logs part-r-00000 _SUCCESS
$ vim outputs/part-r-00000
To run other MapReduce jobs, replace run-hadoop-wordcount.sh with new scripts, and change hadoop-one-click.sh to call the corresponding scripts.
This tutorial shows how to run a one-click Twister K-means job on the Eucalyptus platform of FutureGrid.
The following sections assume a user has created both HPC account and Eucalpytus account under the username of “gaoxm”.
$ ssh -i .ssh/id\_rsa\_fg
india.futuregrid.org
Enter passphrase for key '.ssh/id\_rsa\_fg':
Last login: Sat May 5 02:17:33 2012 from
c-71-194-153-252.hsd1.in.comcast.net
...
torque/2.5.5 version 2.5.5 loaded
moab version 5.4.0 loaded
euca2ools version 1.2 loaded
$ cd eucalyptus/
$ ls
cloud-cert.pem euca2-gaoxm-d108375b-pk.pem
eucarc hosts nodes
euca2-gaoxm-d108375b-cert.pem euca2-gaoxm-x509.zip
gaoxm.private jssecacerts tmp.out
$ wget http://mypage.iu.edu/~gao4/data/twisterOneClick.zip
...
$ ls
hadoopOneClick hadoopOneClick.zip twisterOneClick.zip
$ unzip twisterOneClick.zip
$ cd twisterOneClick
$ ls
deploy-twister.sh instanceIds.txt publicIps.txt
stop-twister.sh
hostnames.txt ipHosts.txt run-twister-kmeans.sh
terminate-instances.sh
hosts nodes.txt start-instances.sh
twister-one-click.sh
$ chmod +x \*.sh
$ ./twister-one-click.sh -n 2 -t m1.small
-i emi-D778156D -k gaoxm -p ~/eucalyptus/gaoxm.private
-l http://salsahpc.indiana.edu/tutorial/apps/Twister-0.9.tar.gz -a
http://www.iterativemapreduce.org/apache-activemq-5.4.2-bin.tar.gz
This will run a MapReduce K-means job on a dynamically created virtual Twister cluster on FutureGrid. The user needs to replace the “–k” and “–p” parameter values with his/her key-pair name and private key path. For detailed usage information, try:
$ ./twister-one-click.sh -h
Calling run\_kmeans.sh on 149.165.159.140...
JobID: kmeans-map-reduce9ec9eaa2-9731-11e1-80d7-156f25bd362a
May 6, 2012 4:11:57 AM
org.apache.activemq.transport.failover.FailoverTransport doReconnect
INFO: Successfully connected
to \ `tcp://master:61616 <https://master:61616/>`__
0 [main] INFO cgl.imr.client.TwisterDriver - Configure Mappers
through the partition file, please wait....
1975 [main] INFO cgl.imr.client.TwisterDriver - Configuring Mappers
through the partition file is completed.
250.77056136584878 , 125.15021341387315 , 249.21561041359857 ,
246.74715176402833 , 375.350251646343 , 249.17570173022511 ,
Total Time for kemeans : 6.808
Total loop count : 15
6260 [main] INFO cgl.imr.client.TwisterDriver - MapReduce
computation termintated gracefully.
------------------------------------------------------
Kmeans clustering took 6.841 seconds.
------------------------------------------------------
To run other iterative MapReduce jobs, replace run-twister-kmeans.sh with new scripts, and change twister-one-click.sh to call the corresponding scripts.
Virtual appliances are virtual machine images encapsulating pre-installed, pre-configured software that can be easily deployed on cloud resources. Users of FutureGrid can use public appliance images posted by other users, as well as contribute to the repository of images.
This page provides a summary of community-provided virtual appliances that are available for use on FutureGrid. If you have created a virtual appliance and you would like to advertise its availability and features with the community, feel free to edit this page and include information about your appliance.
Any FutureGrid user with Eucalyptus or Nimbus accounts can create and register an appliance. The approach is similar in both cases: you can upload an existing image into FutureGrid (e.g. from another Eucalyptus or Nimbus cloud, or an image you create on your own computer), or you can customize an instance interactively in FutureGrid and save it in a FutureGrid resource.
You can upload a “common” image so it becomes available to others in a FutureGrid cloud resource - to do this, add the flag –common to the cloud_client.sh command line when you transfer an image (ensure you are using cloud client version 020 or above). You can also save a Nimbus instance that you are using interactively (through ssh) as an image. Please refer to the manual for cloud_client.sh and the Nimbus tutorial for instructions.
Please refer to FutureGrid tutorial GA7.
Todo
errors in portal page reported, seems not to work
There are several appliances already available on FutureGrid. The table below summarizes a list of appliances, where they are available, and their image names. If you have an appliance that you would like to add to this list, please add it to the table.
Todo
no portal names bt real names
Appliance name | Appliance description | FutureGrid system(s) available | Appliance ID | Installed |
Grid Appliance | Self-configures Condor/MPI/Hadoop virtual clusters for training/education | alamo, india | grid-appliance-2.05.03.gz (alamo), emi-E4ED1880 (india) | panoat |
OpenStack Appliance | Deploys a single-node OpenStack compute virtual cloud | alamo | openstack-ubuntu-10.10-amd64.img | menghan |
ViNe Appliance | Deploys the ViNe virtual network overlay to connect private-address VMs in sierra and foxtrot | sierra, foxtrot | centos-5.5-x64-vine.gz | menghan |
Twister Appliance | Deploys a virtual private cluster running the Twister iterative MapReduce system | india | emi-F0B8194D | jemitche |