Results 31 to 40 of about 41,447 (233)
Distributed Singular Value Decomposition Method for Fast Data Processing in Recommendation Systems
The problem of analyzing a big amount of user data to determine their preferences and, based on these data, to provide recommendations on new products is important.
Krzysztof Przystupa +6 more
doaj +1 more source
Adaptive Dynamic Data Placement Algorithm for Hadoop in Heterogeneous Environments [PDF]
Hadoop MapReduce framework is an important distributed processing model for large-scale data intensive applications. The current Hadoop and the existing Hadoop distributed file system’s rack-aware data placement strategy in MapReduce in the homogeneous ...
Avishan Sharafi, Ali Rezaee
doaj
Large-Scale Encryption in the Hadoop Environment: Challenges and Solutions
Data is growing at an enormous rate in the present world. One of the finest and most popular technologies available for handling and processing that enormous amount of data is the Hadoop ecosystem.
Raj R. Parmar +4 more
doaj +1 more source
Data provenance is an effective approach for data security supervision. In the distributed, multi-user, and multi-layer big data system, only the provenance generation method, which leverages the information logged at both application and operating ...
Yuanzhao Gao +3 more
doaj +1 more source
Sentiment Analysis on Hadoop with Hadoop Streaming
Ideas and opinions of peoples are influenced by the opinions of other peoples. Lot of research is going on analysis of reviews given by peoples. Sentiment analysis is the major computational technique to calculate or observe sentiments of people’s thoughts.
Girdhar Gopal +2 more
openaire +1 more source
Observations on Factors Affecting Performance of MapReduce based Apriori on Hadoop Cluster
Designing fast and scalable algorithm for mining frequent itemsets is always being a most eminent and promising problem of data mining. Apriori is one of the most broadly used and popular algorithm of frequent itemset mining.
Garg, Rakhi +2 more
core +1 more source
Beyond Batch Processing: Towards Real-Time and Streaming Big Data
Today, big data are generated from many sources, and there is a huge demand for storing, managing, processing, and querying on big data. The MapReduce model and its counterpart open source implementation Hadoop, has proven itself as the de facto solution
Saeed Shahrivari
doaj +1 more source
Only Aggressive Elephants are Fast Elephants
Yellow elephants are slow. A major reason is that they consume their inputs entirely before responding to an elephant rider's orders. Some clever riders have trained their yellow elephants to only consume parts of the inputs before responding.
Dittrich, Jens +5 more
core +1 more source
Neural Network Models for Solar Irradiance Forecasting in Polluted Areas: A Comparative Study
Pollution‐aware hybrid ensemble model is proposed to forecast solar irradiance across eight diverse cities. The model integrates MLP, RNN, and NARX to handle varying atmospheric pollution levels. The model outperforms state‐of‐the‐art methods with enhanced accuracy and interpretability on standard solar irradiance data set.
Mujtaba Ali +6 more
wiley +1 more source
Hadoop Cluster Deployment: A Methodological Approach
For a long time, data has been treated as a general problem because it just represents fractions of an event without any relevant purpose. However, the last decade has been just about information and how to get it.
Ronaldo Celso Messias Correia +5 more
doaj +1 more source

