DataSys: Data-Intensive Distributed Systems LaboratoryData-Intensive Distributed Systems Laboratory

Illinois Institute of Technology
Department of Computer Science

CFP (TXT) | News | Topics | Dates | Submission | Organization | Keynote | Program

The Fourth International Workshop on Data Intensive Computing in the Clouds (DataCloud) 2013

Co-located with Supercomputing/SC 2013
Denver Colorado -- November 17th, 2013

Location: 507

Time: 9AM - 5:30PM

The workshop features two keynote talks by Dr. Robert Grossman and Dr. Milind BhandarkarThe workshop includes 7 presentations.

Opening remarks:  8:50-9:00

Session 1: Keynote by Dr.Robert Grossman

What is So Special About Science Clouds and Why Does It Matter? ppt

Time: 9:00-10:00

Session Chair: Ziming Zheng

Coffee Break 10:00-10:30

Session 2:   Big data and storage system

Time: 10:30-12:00

Session Chair: Yong Chen

·         Chen Jin, Md. Mostofa Ali Patwary, Ankit Agrawal, William Hendrix, Wei-keng Liao, Alok Choudhary, "DiSC: A Distributed Single-Linkage Hierarchical Clustering Algorithm using MapReduce". Northwestern University 

·         Jianwu Wang, Daniel Crawl, Ilkay Altintas, Kostas Tzoumas, Volker Markl,"Comparison of Distributed Data-Parallelization Patterns for Big Data Analysis: A Bioinformatics Case Study". San Diego Supercomputer Center, University of California, San Diego ppt

·          Lavanya Ramakrishnan, Pradeep K. Mantha, Yushu Yao, Richard S. Canon, "Evaluation of NoSQL and Array Databases for Scientific Applications". Lawrence Berkeley National Lab ppt  

 

Lunch Break 12:00-1:30

 

Session 3: Keynote by Dr. Milind Bhandarkar

Time:  1:30-2:30

Session Chair: Judy Qiu

 

Session 4:  Cloud Computing

Time: 2:30-3:30

Session Chair: Wei Tang

·         Esma Yildirim,"A Flexible GridFTP Client for Implementation of Intelligent Cloud Data Scheduling Services". Fatih University, Turkey ppt

Coffee Break 3:00-3:30

·         Seetharami Seelam and Paolo Dettori,"Towards Enabling Data Intensive Enterprise Applications in Cloud". IBM Research, New York ppt

Session 5: Programming models and tools

Time: 4:00-5:00

Session Chair: Lavanya Ramakrishnan

·         Hao Lin, Shuo Yang, and Samuel P. Midkiff,"A Parallel R Framework for Processing Large Dataset on Distributed Systems". Purdue University ppt

·         Ketan Maheshwari,Alex Rodriguez,David Kelly,Ravi Madduri,Justin M. Wozniak,Michael Wilde,Ian Foster, "Extending the Galaxy portal with parallel and distributed execution capability". Argonne National Laboratory ppt

Closing Remarks and Open Discussions: 5:00 - 5:30