Results 1 to 10 of about 14,382 (202)

Performance Evaluation of Query Plan Recommendation with Apache Hadoop and Apache Spark

open access: yesMathematics, 2022
Access plan recommendation is a query optimization approach that executes new queries using prior created query execution plans (QEPs). The query optimizer divides the query space into clusters in the mentioned method.
Elham Azhir   +3 more
doaj   +5 more sources

A distributed data processing scheme based on Hadoop for synchrotron radiation experiments [PDF]

open access: yesJournal of Synchrotron Radiation
With the development of synchrotron radiation sources and high-frame-rate detectors, the amount of experimental data collected at synchrotron radiation beamlines has increased exponentially.
Ding Zhang   +6 more
doaj   +2 more sources

Efficient Group K Nearest-Neighbor Spatial Query Processing in Apache Spark

open access: yesISPRS International Journal of Geo-Information, 2021
Aiming at the problem of spatial query processing in distributed computing systems, the design and implementation of new distributed spatial query algorithms is a current challenge.
Panagiotis Moutafis   +3 more
doaj   +1 more source

Evaluation of the Temporal Efficiency of Big Data Storage Formats in the Dynamics of Data Growth

open access: yesСовременные информационные технологии и IT-образование, 2021
When developing a data lake on platforms such as Apache Hadoop, the choice of data storage format becomes an important issue. This choice should be based on a number of different criteria, one of which is the time it takes to run different queries on ...
Vladimir Belov, Evgeny Nikulchev
doaj   +1 more source

Analysis of hadoop MapReduce scheduling in heterogeneous environment

open access: yesAin Shams Engineering Journal, 2021
Over the last decade, several advancements have happened in distributed and parallel computing. A lot of data is generated daily from various sources, and this speedy data proliferation led to the development of many more frameworks that are efficient to
Khushboo Kalia, Neeraj Gupta
doaj   +1 more source

Fedora Commons With Apache Hadoop: A Research Study

open access: yesCode4Lib Journal, 2013
The Digital Collections digital repository at the University of Maryland Libraries is growing and in need of a new backend storage system to replace the current filesystem storage.
Abdul Rasheed, Mohamed Mohideen
doaj   +1 more source

Evolution of the Hadoop Platform and Ecosystem for High Energy Physics [PDF]

open access: yesEPJ Web of Conferences, 2019
The interest in using scalable data processing solutions based on Apache Hadoop ecosystem is constantly growing in the High Energy Physics (HEP) community.
Baranowski Zbigniew   +9 more
doaj   +1 more source

Sandbox security model for Hadoop file system

open access: yesJournal of Big Data, 2020
Extensive usage of Internet based applications in day to day life has led to generation of huge amounts of data every minute. Apart from humans, data is generated by machines like sensors, satellite, CCTV etc.
Gousiya Begum   +2 more
doaj   +1 more source

Apache Hadoop Architecture, Applications, and Hadoop Distributed File System

open access: yesSemiconductor Science and Information Devices, 2022
The data and internet are highly growing which causes problems in management of the big-data. For these kinds of problems, there are many software frameworks used to increase the performance of the distributed system. This software is used for the availability of large data storage. One of the most beneficial software frameworks used to utilize data in 
Pratit Raj Giri, Gajendra Sharma
openaire   +1 more source

A Parallel Apriori Algorithm and FP- Growth Based on SPARK [PDF]

open access: yesITM Web of Conferences, 2021
Frequent Itemset Mining is an important data mining task in real-world applications. Distributed parallel Apriori and FP-Growth algorithm is the most important algorithm that works on data mining for finding the frequent itemsets.
Gupta Priyanka, Sawant Vinaya
doaj   +1 more source

Home - About - Disclaimer - Privacy