Results 41 to 50 of about 2,924 (186)
A Comparison of Big Data Frameworks on a Layered Dataflow Model [PDF]
In the world of Big Data analytics, there is a series of tools aiming at simplifying programming applications to be executed on clusters. Although each tool claims to provide better programming, data and execution models, for which only informal (and ...
Aldinucci, Marco +3 more
core +2 more sources
Resource Configuration Tuning for Stream Data Processing Systems via Bayesian Optimization
Stream data processing systems are becoming increasingly popular in the big data era. Systems such as Apache Flink typically provide a number (e.g., 30) of configuration parameters to flexibly specify the amount of resources (e.g., CPU cores and memory ...
Shixin Huang +6 more
doaj +1 more source
Multi-tenant Pub/Sub processing for real-time data streams [PDF]
Devices and sensors generate streams of data across a diversity of locations and protocols. That data usually reaches a central platform that is used to store and process the streams.
Carrera Pérez, David +1 more
core +1 more source
DPASF: a flink library for streaming data preprocessing
Background Data preprocessing techniques are devoted to correcting or alleviating errors in data. Discretization and feature selection are two of the most extended data preprocessing techniques.
Alejandro Alcalde-Barros +3 more
doaj +1 more source
Impairment and a substantial decline in the mobility, independence, and quality of life of an elderly person. In this regard, the current work suggests a novel IoT-based system that makes the use of low-power wireless sensing the networks, big data ...
Pravin Kulurkar +5 more
doaj +1 more source
Apache Mahout’s k-Means vs. fuzzy k-Means performance evaluation [PDF]
(c) 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or ...
Barolli, Leonard +3 more
core +1 more source
Approximate Stream Analytics in Apache Flink and Apache Spark Streaming
Approximate computing aims for efficient execution of workflows where an approximate output is sufficient instead of the exact output. The idea behind approximate computing is to compute over a representative sample instead of the entire input dataset.
Do Le Quoc +5 more
openaire +2 more sources
FML-kNN: scalable machine learning on Big Data using k-nearest neighbor joins
Efficient management and analysis of large volumes of data is a demanding task of increasing scientific and industrial importance, as the ubiquitous generation of information governs more and more aspects of human life.
Georgios Chatzigeorgakidis +3 more
doaj +1 more source
The real-time analysis of Big Data streams is a terrific resource for transforming data into value. For this, Big Data technologies for smart processing of massive data streams are available, but the facilities they offer are often too raw to be ...
Ilaria Bartolini, Marco Patella
doaj +1 more source
DLCD-CCE: A Local Community Detection Algorithm for Complex IoT Networks [PDF]
Internet of Things (IoT) refers to the complex systems generated by the interconnections among widely available objects. Such interactions generate large networks, whose complexity needs to be addressed to provide suitable computationally efficient ...
Hu, Nan +5 more
core +1 more source

