MF-Storm: a maximum flow-based job scheduler for stream processing engines on computational clusters to increase throughput. [PDF]
Muhammad A, Abdul Qadir M.
europepmc +1 more source
Enhancing performance of E-Government information systems with SSD-based Hadoop mapreduce. [PDF]
Ishengoma F.
europepmc +1 more source
A Distributed Stream Processing Middleware Framework for Real-Time Analysis of Heterogeneous Data on Big Data Platform: Case of Environmental Monitoring. [PDF]
Akanbi A, Masinde M.
europepmc +1 more source
Migration state among jobs in Apache Flink
Περίληψη: Καθημερινά, τόσο και περισσότερα δεδομένα παράγονται κάνοντας την επεξεργασία αυτών μείζονος σημασίας ουτοσώστε να εξάγουμε την χρήσιμη πληροφορία. Ειδικότερα η επεξεργασία ροών δεδομένων είναι ζωτικής σημασίας και απαιτεί υψηλών επιδόσεων υπολογιστικούς πόρους ώστε να μπορέσουμε να κάνουμε επερωτήσεις σε πραγματικό χρόνο σε αυτά.
Μπαικουσης Ιωαννης http://users.isc.tuc.gr/~ibaikousis +1 more
openaire +1 more source
Data pipeline for real-time energy consumption data management and prediction. [PDF]
Im J, Lee J, Lee S, Kwon HY.
europepmc +1 more source
When we talk about Big Data, What do we really mean? Toward a more precise definition of Big Data. [PDF]
Han X, Gstrein OJ, Andrikopoulos V.
europepmc +1 more source
Efficient Data Stream Sampling on Apache Flink
Sampling is considered to be a core component of data analysis making it possibleto provide a synopsis of possibly large amounts of data by maintainingonly subsets or multisubsets of it. In the context of data streaming, an emergingprocessing paradigm where data is assumed to be unbounded, samplingoffers great potential since it can establish a ...
openaire +1 more source
Process-Driven and Flow-Based Processing of Industrial Sensor Data. [PDF]
Kammerer K +4 more
europepmc +1 more source
Clustering Big Data Streams in Apache Flink
Summarization: We live in the era of Big Data where massive amounts of information are generated continuously from numerous types of sources. Today’s goal is to apply techniques that take into consideration the volume, the variety and the velocity of the data, in order to gain insight that couldn’t be revealed with traditional data processing ...
Μπιτσακης Θεοδωρος http://users.isc.tuc.gr/~tbitsakis +1 more
openaire +1 more source
Towards a Distributed Digital Twin Framework for Predictive Maintenance in Industrial Internet of Things (IIoT). [PDF]
Abdullahi I, Longo S, Samie M.
europepmc +1 more source

