A Dynamic Load Balancing Strategy for Parallel Datacube Computation.
Seigo Muto, Masaru Kitsuregawa:
A Dynamic Load Balancing Strategy for Parallel Datacube Computation.
DOLAP 1999: 67-72@inproceedings{DBLP:conf/dolap/MutoK99,
author = {Seigo Muto and
Masaru Kitsuregawa},
title = {A Dynamic Load Balancing Strategy for Parallel Datacube Computation},
booktitle = {DOLAP '99, ACM Second International Workshop on Data Warehousing
and OLAP, November 6, 1999, Kansas City, Missouri, USA, Proceedings},
publisher = {ACM},
year = {1999},
pages = {67-72},
ee = {db/conf/dolap/MutoK99.html, http://doi.acm.org/10.1145/319757.319793},
crossref = {DBLP:conf/dolap/99},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
Abstract
In recent years, OLAP technologies have become one of the important applications in the database industry.
In particular, the datacube operation proposed in [5] receives strong attention among researchers as a fundamental research topic in the OLAP technologies.
The datacube operation requires computation of aggregations on all possible combinations of each dimension attribute.
As the number of dimensions increases, it becomes very expensive to compute datacubes, because the required computation cost grows exponentially with the increase of dimensions.
Parallelization is very important factor for fast datacube computation.
However, we cannot obtain sufficient performance gain in the presence of data skew even if the computation is parallelized.
In this paper, we present a dynamic load balancing strategy, which enables us to extract the effectiveness of parallizing datacube computation sufficiently.
We perform experiments based on simulations and show that our strategy performs well.
Copyright © 1999 by the ACM,
Inc., used by permission. Permission to make
digital or hard copies is granted provided that
copies are not made or distributed for profit or
direct commercial advantage, and that copies show
this notice on the first page or initial screen of
a display along with the full citation.
CDROM Version: Load the CDROM "Volume 2 Issue 4, CIKM, DOLAP, GIS, SIGFIDET, ..." and ...
DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...
Printed Edition
DOLAP '99, ACM Second International Workshop on Data Warehousing and OLAP, November 6, 1999, Kansas City, Missouri, USA, Proceedings.
ACM 1999
Contents
Online Edition
Citation Page
References
- [1]
- Sameet Agarwal, Rakesh Agrawal, Prasad Deshpande, Ashish Gupta, Jeffrey F. Naughton, Raghu Ramakrishnan, Sunita Sarawagi:
On the Computation of Multidimensional Aggregates.
VLDB 1996: 506-521

- [2]
- Kevin S. Beyer, Raghu Ramakrishnan:
Bottom-Up Computation of Sparse and Iceberg CUBEs.
SIGMOD Conference 1999: 359-370

- [3]
- ...
- [4]
- David J. DeWitt, Jeffrey F. Naughton, Donovan A. Schneider, S. Seshadri:
Practical Skew Handling in Parallel Joins.
VLDB 1992: 27-40

- [5]
- Jim Gray, Adam Bosworth, Andrew Layman, Hamid Pirahesh:
Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Total.
ICDE 1996: 152-159

- [6]
- Sanjay Goil, Alok N. Choudhary:
High Performance OLAP and Data Mining on Parallel Computers.
Data Min. Knowl. Discov. 1(4): 391-417(1997)

- [7]
- Kien A. Hua, Chiang Lee:
Handling Data Skew in Multiprocessor Database Computers Using Partition Tuning.
VLDB 1991: 525-535

- [8]
- Venky Harinarayan, Anand Rajaraman, Jeffrey D. Ullman:
Implementing Data Cubes Efficiently.
SIGMOD Conference 1996: 205-216

- [9]
- Masaru Kitsuregawa, Yasushi Ogawa:
Bucket Spreading Parallel Hash: A New, Robust, Parallel Hash Join Method for Data Skew in the Super Database Computer (SDC).
VLDB 1990: 210-221

- [10]
- Kenneth A. Ross, Divesh Srivastava:
Fast Computation of Sparse Datacubes.
VLDB 1997: 116-125

- [11]
- ...
- [12]
- Ambuj Shatdal, Jeffrey F. Naughton:
Adaptive Parallel Aggregation Algorithms.
SIGMOD Conference 1995: 104-114

- [13]
- Amit Shukla, Prasad Deshpande, Jeffrey F. Naughton, Karthikeyan Ramasamy:
Storage Estimation for Multidimensional Aggregates in the Presence of Hierarchies.
VLDB 1996: 522-531

- [14]
- Christopher B. Walton, Alfred G. Dale, Roy M. Jenevein:
A Taxonomy and Performance Model of Data Skew Effects in Parallel Joins.
VLDB 1991: 537-548

- [15]
- Yihong Zhao, Prasad Deshpande, Jeffrey F. Naughton:
An Array-Based Algorithm for Simultaneous Multidimensional Aggregates.
SIGMOD Conference 1997: 159-170

Copyright © Mon Nov 2 20:30:59 2009
by Michael Ley (ley@uni-trier.de)