.. _s-twister: ********************************************************************** Using Twister in FutureGrid ********************************************************************** .. sidebar:: Page Contents .. contents:: :local: What is Twister? ---------------- MapReduce programming model has simplified the implementations of many data parallel applications. The simplicity of the programming model and the quality of services provided by many implementations of MapReduce attract a lot of enthusiasm among parallel computing communities. From the years of experience in applying MapReduce programming model to various scientific applications, we identified a set of extensions to the programming model and improvements to its architecture that will expand the applicability of MapReduce to more classes of applications. `Twister `_ is a lightweight MapReduce runtime we have developed by incorporating these enhancements. `Twister `_ provides the following features to support MapReduce computations. (Twister is developed as part of Jaliya Ekanayake's Ph.D. research and is supported by the `Salsa `_ Team @ `IU `_) * Distinction on static and variable data * Configurable long running (cacheable) map/reduce tasks * Pub/sub messaging based communication/data transfers * Efficient support for Iterative MapReduce computations (much faster than `Hadoop `_ or `Dryad/DryadLINQ `_) * Combine phase to collect all reduce outputs * Data access via local disks * Lightweight (~5600 lines of Java code) * Support for typical MapReduce computations * Tools to manage data |image19| Iterative MapReduce programming model using Twister Running Twister on FutureGrid ----------------------------- Twister can be run in various modes within FG either in FutureGrid HPC or FutureGrid Cloud environment: - `Running Twister on FutureGrid HPC `_ - `Twister with FutureGrid Cloud OpenStack `_ Run Twister Applications ------------------------ We provide Kmeans and Blast run on Twister as examples. - `Twister Kmeans `_ - `Twister Blast `_ Papers and Presentations ------------------------ * Jaliya Ekanayake, Hui Li, Bingjing Zhang, Thilina Gunarathne, Seung-Hee Bae, Judy Qiu, Geoffrey Fox, `Twister: A Runtime for Iterative MapReduce `_," The First International Workshop on MapReduce and its Applications (MAPREDUCE'10) - HPDC2010 * Jaliya Ekanayake, (Advisor: Geoffrey Fox) `Architecture and Performance of Runtime Environments for Data Intensive Scalable Computing `_, Doctoral Showcase, SuperComputing2009. (`Presentation `_) * Jaliya Ekanayake, Atilla Soner Balkir, Thilina Gunarathne, Geoffrey Fox, Christophe Poulain, Nelson Araujo, Roger Barga, `DryadLINQ for Scientific Analyses `_, Fifth IEEE International Conference on e-Science (eScience2009), Oxford, UK. * Jaliya Ekanayake, Geoffrey Fox, `High Performance Parallel Computing with Clouds and Cloud Technologies `_, First International Conference on Cloud Computing (CloudComp09) Munich, Germany, 2009. * Geoffrey Fox, Seung-Hee Bae, Jaliya Ekanayake, Xiaohong Qiu, and Huapeng Yuan,`Parallel Data Mining from Multicore to Cloudy Grids ``_, High Performance Computing and Grids workshop, 2008. * Jaliya Ekanayake, Shrideep Pallickara, and Geoffrey Fox `MapReduce for Data Intensive Scientific Analysis `_, Fourth IEEE International Conference on eScience, 2008, pp.277-284. .. |Home| image:: /sites/all/themes/fgtheme/logo.png .. |image19| image:: http://www.iterativemapreduce.org/images/imrmodel.png .. |image32| image:: /sites/default/files/images/nsf-logo.png .. |image33| image:: /sites/default/files/u876/xsede-logo.png