24-Jul-2025 | (پارسی)

Chabok: a ...

Home Research Group Research Group Detail Journal Article Chabok: a ...

Publish Date : 10/26/2018 Journal Name : Springer/ Journal of Big Data Pages : 35

Chabok: a Map-Reduce based method to solve data warehouse problems

Abstract

Currently, immense quantities of data cannot be managed by traditional database management systems. Instead, they must be managed by big data solutions using shared nothing architectures. Data warehouse systems are systems that address very large amounts of information. The most prominent data warehouse model is star schema, which consists of a fact table and some number of dimension tables. It is necessary to join the facts and dimensions for query executions on the data warehouse. In shared nothing architecture, all of the required information is not placed on a single node so it is necessary to retrieve information from other nodes, which causes network congestion and low speeds of query execution. To avoid this problem and achieve maximum parallelism, dimensions can be replicated over nodes if they are not too large. However, if there are dimensions with data volumes greater than the capacity of a node or dimensions where the data volume summation exceeds node capacity, the query execution is confronted with serious problems. In big data problems, the amount of data is immense, and thus replicating immense data cannot be considered an appropriate method. In this paper, we propose a method called Chabok, which uses two-phased Map-Reduce to solve the data warehouse problem. In this method, aggregation is performed completely on Mappers, and intermediate results are sent to the Reducer. Chabok does not need data replication for join omission. The proposed method was implemented on Hadoop, and TPC-DS queries were executed for benchmarking. The query execution time on Chabok surpassed prominent big data products for data warehousing.

Authors : Mohammadhossein Barkhordari, Mahdi Niamanesh

About Institute

Academic Center for Education, Culture and Research (ACECR) established in 1980 as an Iranian public non-governmental institution, with the mission of promoting science and technology, culture, education and entrepreneurship. ACECR has made multiple efforts to respond to the needs of different parts of the industry.

Contact us

Fallow us

Aims and Scope | vision & Mission | About Us |

Scroll

Chabok: a ...

Abstract

About Institute

Links

Contact us

Fallow us