تاریخ امروز: 1405/4/13 (English)

Chabok: a Map-Reduce based met...

خانه گروه های پژوهشی جزئیات گروه پژوهشی مقالات ژورنال جزئیات Chabok: a Map-Reduce based met...

تاریخ انتشار : 1397/8/4 نام نشریه : Springer/ Journal of Big Data شمارهء صفخه در نشریه : 35

Chabok: a Map-Reduce based method to solve data warehouse problems

چکیده مقاله

Currently, immense quantities of data cannot be managed by traditional database management systems. Instead, they must be managed by big data solutions using shared nothing architectures. Data warehouse systems are systems that address very large amounts of information. The most prominent data warehouse model is star schema, which consists of a fact table and some number of dimension tables. It is necessary to join the facts and dimensions for query executions on the data warehouse. In shared nothing architecture, all of the required information is not placed on a single node so it is necessary to retrieve information from other nodes, which causes network congestion and low speeds of query execution. To avoid this problem and achieve maximum parallelism, dimensions can be replicated over nodes if they are not too large. However, if there are dimensions with data volumes greater than the capacity of a node or dimensions where the data volume summation exceeds node capacity, the query execution is confronted with serious problems. In big data problems, the amount of data is immense, and thus replicating immense data cannot be considered an appropriate method. In this paper, we propose a method called Chabok, which uses two-phased Map-Reduce to solve the data warehouse problem. In this method, aggregation is performed completely on Mappers, and intermediate results are sent to the Reducer. Chabok does not need data replication for join omission. The proposed method was implemented on Hadoop, and TPC-DS queries were executed for benchmarking. The query execution time on Chabok surpassed prominent big data products for data warehousing.

نویسندگان : محمدحسین برخورداری، مهدی نیامنش

جهاد دانشگاهی مولود مبارک انقلاب است
حضرت آیت الله خامنه ای / معرفی گوینده...

درباره پژوهشکده

اين پژوهشكده يكي از زيرمجموعه‌هاي جهاد دانشگاهي بوده كه هدف از تأسيس آن دستيابي به دانش فني و كاربردي در رشته‌هاي تخصصي ICT از طريق طرح‌هاي مطالعاتي و تحقيقاتي و تلاش در جهت بررسي، شناسايي و كمك به رفع نيازهاي تحقيقاتي بخش‌هاي توليدي، خدماتي و اجرايي در زمينه‌هاي مذكور است.
جزئیات بیشتر...

پیوندهای مفید

اطلاعات تماس

تهران، خیابان انقلاب، چهار راه کالج، کوچه سعیدی، پلاک 5
02188930150
02188930157
info@ictrc.ac.ir

No.5 Saeedi Alley, Hafez Junction, Enghelab Avenue, Tehran, IRAN
+982188930150
+982188930157
info@ictrc.ac.ir

شبکه های اجتماعی

تمای حقوق این وب سایت برای پژوهشکده فناوری اطلاعات جهاد دانشگاهی محفوظ است.

درباره ما | ساختار پژوهشکده | نقشه سایت | اهداف و چشم انداز |

Scroll