Results 81 to 90 of about 9,135 (230)
Security and Privacy Aspects in MapReduce on Clouds: A Survey [PDF]
MapReduce is a programming system for distributed processing large-scale data in an efficient and fault tolerant manner on a private, public, or hybrid cloud. MapReduce is extensively used daily around the world as an efficient distributed computation tool for a large class of problems, e.g., search, clustering, log analysis, different types of join ...
arxiv
Survey of research on confidential computing
This paper provides a comprehensive overview of the development process of confidential computing, summarizing its current research status and issues, which focuses on the security requirements for data security and privacy protection. Abstract As the global data strategy deepens and data elements accelerate integrating and flowing more rapidly, the ...
Dengguo Feng+5 more
wiley +1 more source
Experimenting sensitivity-based anonymization framework in apache spark
One of the biggest concerns of big data and analytics is privacy. We believe the forthcoming frameworks and theories will establish several solutions for the privacy protection.
Mohammed Al-Zobbi+2 more
doaj +1 more source
Enhancing MapReduce Fault Recovery Through Binocular Speculation [PDF]
MapReduce speculation plays an important role in finding potential task stragglers and failures. But a tacit dichotomy exists in MapReduce due to its inherent two-phase (map and reduce) management scheme in which map tasks and reduce tasks have distinctly different execution behaviors, yet reduce tasks are dependent on the results of map tasks.
arxiv
Spatial hotspot detection using polygon propagation
Spatial scan statistics is one of the most important models in order to detect high activity or hotspots in real world applications such as epidemiology, public health, astronomy and criminology applications on geographic data. Traditional scan statistic
Satya Katragadda+2 more
doaj +1 more source
WBANs are applied in E‐Healthcare systems, extract the physiological parameters through the implanted (in‐body) and/or wearable(on‐body) sensors applied to the human body, and route them to the remote server in a systematized and well‐grounded manner.
Bhawna Narwal+5 more
wiley +1 more source
Submodular Optimization in the MapReduce Model [PDF]
Submodular optimization has received significant attention in both practice and theory, as a wide array of problems in machine learning, auction theory, and combinatorial optimization have submodular structure. In practice, these problems often involve large amounts of data, and must be solved in a distributed way.
arxiv
Big data: modern approaches to storage and analysis
Big data challenged traditional storage and analysis systems in several new ways. In this paper we try to figure out how to overcome this challenges, why it's not possible to make it efficiently and describe three modern approaches to big data handling ...
Pavel Klemenkov, Sergey Kuznetsov
doaj +1 more source
ReStore: Reusing Results of MapReduce Jobs [PDF]
Analyzing large scale data has emerged as an important activity for many organizations in the past few years. This large scale data analysis is facilitated by the MapReduce programming and execution model and its implementations, most notably Hadoop. Users of MapReduce often have analysis tasks that are too complex to express as individual MapReduce ...
arxiv
A Conditional Lower Bound on Graph Connectivity in MapReduce [PDF]
MapReduce (and its open source implementation Hadoop) has become the de facto platform for processing large data sets. MapReduce offers a streamlined computational framework by interleaving sequential and parallel computation while hiding underlying system issues from the programmer.
arxiv