DataSys: Data-Intensive Distributed Systems LaboratoryData-Intensive Distributed Systems Laboratory

Illinois Institute of Technology
Department of Computer Science

CFP (TXT) | News | Topics | Dates | Submission | Organization | Program | Sponsors

4th Workshop on Many-Task Computing on Grids and Supercomputers (MTAGS) 2011

Co-located with Supercomputing/SC 2011
Seattle Washington -- November 14th, 2011
Grand Hyatt Theater -- 10AM - 5:30PM

Panel: Many-Task Computing meets Exascales

Grand Hyatt Theater
Monday, November 14th, 2011
11AM - 12PM

Slides   

Panel Abstract   

Exascale computers will enable the unraveling of significant scientific mysteries. There are many domains (e.g. weather modeling, national security, drug discovery) that will achieve revolutionary advancements due to exascale computing. Predictions are that supercomputers will reach exascales by 2019, and will be composed of millions of compute nodes aggregating billions of threads of execution. The exascales computing era will bring new fundamental challenges in how we build, manage, and program computing systems. Our decades-old approaches (e.g. parallel file systems, MPI) will have to be radically changed to support the coming wave of extreme-scale general purpose parallel computing.

Many-Task Computing (MTC), which aims to bridge the gap between high-performance computing (HPC) and high-throughput computing (HTC), could have some interesting properties making it a viable paradigm to address many of the HPC shortcomings at extreme scales (e.g. reliability, programmability) for a large class of applications. The DOE ASCR program have confirmed that “fulfilling the science potential of emerging multi-core computing systems and other novel 'extreme-scale' computing architectures, which will require significant modifications to today’s tools and techniques”. This panel will attempt to address how many-task computing can address “advanced hardware and software architectures for exascale computing systems, scientific data management and analysis at scale, and scalable and fault tolerant operating and runtime”. 

Panelists

Dr. Dan Reed

Corporate Vice President of Technology Policy and Strategy, Microsoft Research

Leader of the eXtreme Computing Group (XCG), Microsoft Research

Dr. Jack Dongarra

University Distinguished Professor of EECS, University of Tennessee

Distinguished Research Staff of CSM, Oak Ridge National Laboratory

Director, Innovative Computing Laboratory, University of Tennessee

Director, Center for Information Technology Research, University of Tennessee

Dr. Dan S. Katz

Director of Science, TeraGrid GIG

Area Co-director for Applications, Open Grid Forum

Senior Fellow in the Computation Institute, University of Chicago & Argonne National Lab.

Dr. David Abramson

Professor of CS, Monash University, Australia

Director, Monash eScience and Grid Engineering Lab, Monash University, Australia

ARC Professorial Fellow, Faculty of Information Technology, Monash University, Australia

Moderator

Dr. Ioan Raicu

Assistant Professor, Computer Science, Illinois Institute of Technology

Guest Research Faculty, Math and Computer Science, Argonne National Laboratory

Panelists Biographies

Dan ReedDan Reed is Microsoft’s Corporate Vice President for Technology Strategy and Policy and Extreme Computing. Previously, he was the Chancellor’s Eminent Professor at UNC Chapel Hill, as well as the Director of the Renaissance Computing Institute (RENCI) and the Chancellor’s Senior Advisor for Strategy and Innovation for UNC Chapel Hill. Dr. Reed has served as a member of the U.S. President’s Council of Advisors on Science and Technology (PCAST) and as a member of the President’s Information Technology Advisory Committee (PITAC). As chair of PITAC’s computational science subcommittee, he was lead author of the report “Computational Science: Ensuring America’s Competitiveness.” On PCAST, he co-chaired the Networking and Information Technology subcommittee (with George Scalise of the Semiconductor Industry Association) and co-authored a report on the National Coordination Office’s Networking and Information Technology Research and Development (NITRD) program called “Leadership Under Challenge: Information Technology R&D in Competitive World.” In June 2009, he completed two terms of service as chair of the board of directors of the Computing Research Association, which represents the research interests of Ph.D. granting university departments, industrial research groups and national laboratories. He was previously Head of the Department of Computer Science at the University of Illinois at Urbana-Champaign (UIUC), where the held the Edward William and Jane Marr Gutgsell Professorship. He has also been Director of the National Center for Supercomputing Applications (NCSA) at UIUC, where he also led National Computational Science Alliance, a fifty institution partnership devoted to creating the next generation of computational science tools. He was also one of the principal investigators and chief architect for the NSF TeraGrid. He received his B.S. from Missouri University of Science and Technology and his M.S. and Ph.D. in computer science in 1983 from Purdue University. He is a Fellow of the ACM, the IEEE and the AAAS.

Jack DongarraJack Dongarra received a Bachelor of Science in Mathematics from Chicago State University in 1972 and a Master of Science in Computer Science from the Illinois Institute of Technology in 1973. He received his Ph.D. in Applied Mathematics from the University of New Mexico in 1980. He worked at the Argonne National Laboratory until 1989, becoming a senior scientist. He now holds an appointment as University Distinguished Professor of Computer Science in the Computer Science Department at the University of Tennessee and holds the title of Distinguished Research Staff in the Computer Science and Mathematics Division at Oak Ridge National Laboratory (ORNL), Turing Fellow at Manchester University, and an Adjunct Professor in the Computer Science Department at Rice University. He is the director of the Innovative Computing Laboratory at the University of Tennessee. He is also the director of the Center for Information Technology Research at the University of Tennessee which coordinates and facilitates IT research efforts at the University. He specializes in numerical algorithms in linear algebra, parallel computing, the use of advanced-computer architectures, programming methodology, and tools for parallel computers. His research includes the development, testing and documentation of high quality mathematical software. He has contributed to the design and implementation of the following open source software packages and systems: EISPACK, LINPACK, the BLAS, LAPACK, ScaLAPACK, Netlib, PVM, MPI, NetSolve, Top500, ATLAS, and PAPI. He has published approximately 200 articles, papers, reports and technical memoranda and he is coauthor of several books. He was awarded the IEEE Sid Fernbach Award in 2004 for his contributions in the application of high performance computers using innovative approaches; in 2008 he was the recipient of the first IEEE Medal of Excellence in Scalable Computing; in 2010 he was the first recipient of the SIAM Special Interest Group on Supercomputing's award for Career Achievement; and in 2011 he was the recipient of the IEEE IPDPS 2011 Charles Babbage Award. He is a Fellow of the AAAS, ACM, IEEE, and SIAM and a member of the National Academy of Engineering.

Dan S. KatzDaniel S. Katz is the TeraGrid GIG Director of Science, Open Grid Forum Area Co-director for Applications, and a Senior Fellow in the Computation Institute (CI) at the University of Chicago and Argonne National Laboratory. He is also an affiliate faculty member at the Center for Computation and Technology (CCT), Louisiana State University (LSU), where he was previously Director for Cyberinfrastructure Development from 2006 to 2009, and Adjunct Associate Professor in the Department of Electrical and Computer Engineering at LSU. He was at JPL from 1996 to 2006, in a variety of roles, including: Principal Member of the Information Systems and Computer Science Staff, Supervisor of the Parallel Applications Technologies group, Area Program Manager of High End Computing in the Space Mission Information Technology Office, Applications Project Element Manager for the Remote Exploration and Experimentation (REE) Project, and Team Leader for MOD Tool (a tool for the integrated design of microwave and millimeter-wave instruments). From 1993 to 1996 he was employed by Cray Research (and later by Silicon Graphics) as a Computational Scientist on-site at JPL and Caltech, specializing in parallel implementation of computational electromagnetic algorithms. His research interests include: numerical methods, algorithms, and programming applied to supercomputing, parallel computing, cluster computing, distributed computing, and embedded computing; and fault-tolerant computing. He received his B.S., M.S., and Ph.D degrees in Electrical Engineering from Northwestern University, Evanston, Illinois, in 1988, 1990, and 1994, respectively. His work is documented in numerous book chapters, journal and conference publications, and NASA Tech Briefs. He is a senior member of the IEEE, chairs the steering committee for the IEEE Cluster conference series, designed and maintained (until 2001) the original website for the IEEE Antenna and Propagation Society, and serves on the IEEE Technical Committee on Parallel Processing's Executive Committee and the steering committee for IEEE Grid conference series.

David AbramsonProfessor David Abramson has been involved in computer architecture and high performance computing research since 1979. Previous to joining Monash University in 1997, he has held appointments at Griffith University, CSIRO, and RMIT. At CSIRO he was the program leader of the Division of Information Technology High Performance Computing Program, and was also an adjunct Associate Professor at RMIT in Melbourne. He served as a program manager and chief investigator in the Co-operative Research Centre for Intelligent Decisions Systems and the Co-operative Research Centre for Enterprise Distributed Systems. Abramson is currently an ARC Professorial Fellow; Professor of Computer Science in the Faculty of Information Technology at Monash University, Australia, and science director of the Monash e-Research Centre. He is a fellow of the Association for Computing Machinery (ACM) and the Academy of Science and Technological Engineering (ATSE), and a member of the IEEE. Abramson has served on committees for many  conferences and workshops, and has published over 200 papers and technical documents. He has given seminars and received awards around Australia and internationally and has received over $8 million in research funding. He also has a keen interest in R&D commercialization and consults for Axceleon Inc, who produce an industry strength version of Nimrod, and Guardsoft, a company focused on commercialising the Guard relative debugger. Abramson’s current interests are in high performance computer systems design and software engineering tools for programming parallel, distributed supercomputers  and stained glass windows.

Ioan RaicuDr. Ioan Raicu is an assistant professor in the Department of Computer Science (CS) at Illinois Institute of Technology (IIT), as well as a guest research faculty in the Math and Computer Science Division (MCS) at Argonne National Laboratory (ANL). He is also the founder and director of the Data-Intensive Distributed Systems Laboratory (DataSys) at IIT. He received the prestigious NSF CAREER award (2011 - 2015) for his innovative work on distributed file systems for exascale computing. He was a NSF/CRA Computation Innovation Fellow at Northwestern University in 2009 - 2010, and obtained his Ph.D. in Computer Science from University of Chicago under the guidance of Dr. Ian Foster in 2009. He is a 3-year award winner of the GSRP Fellowship from NASA Ames Research Center. His research work and interests are in the general area of distributed systems. His work focuses on a relatively new paradigm of Many-Task Computing (MTC), which aims to bridge the gap between two predominant paradigms from distributed systems, High-Throughput Computing (HTC) and High-Performance Computing (HPC). His work has focused on defining and exploring both the theory and practical aspects of realizing MTC across a wide range of large-scale distributed systems. He is particularly interested in resource management in large scale distributed systems with a focus on many-task computing, data intensive computing, cloud computing, grid computing, and many-core computing. His work has been funded by the NASA Ames Research Center, DOE Office of Advanced Scientific Computing Research, the NSF/CRA CIFellows program, and the NSF CAREER program. He is a member of the ACM and IEEE.