Results 21 to 30 of about 2,924 (186)

In-transit molecular dynamics analysis with Apache flink [PDF]

open access: yesProceedings of the Workshop on In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization, 2018
In this paper, an on-line parallel analytics framework is proposed to process and store in transit all the data being generated by a Molecular Dynamics (MD) simulation run using staging nodes in the same cluster executing the simulation. The implementation and deployment of such a parallel workflow with standard HPC tools, managing problems such as ...
Zanúz, Henrique   +3 more
openaire   +2 more sources

Big Data Analytics, Processing Models, Taxonomy of Tools, V’s, and Challenges: State‐of‐Art Review and Future Implications

open access: yesWireless Communications and Mobile Computing, Volume 2023, Issue 1, 2023., 2023
In the current digital era, data is budding tremendously from various sources like banks, businesses, education, entertainment, etc. Due to its significant consequence, it became a prominent proceeding for numerous research areas like the semantic web, machine learning, computational intelligence, and data mining.
Sandeep Dasari   +2 more
wiley   +1 more source

A Performance Analysis of Fault Recovery in Stream Processing Frameworks

open access: yesIEEE Access, 2021
Distributed stream processing frameworks have gained widespread adoption in the last decade because they abstract away the complexity of parallel processing. One of their key features is built-in fault tolerance.
Giselle van Dongen, Dirk Van Den Poel
doaj   +1 more source

Data mining in predictive maintenance systems: A taxonomy and systematic review

open access: yesWIREs Data Mining and Knowledge Discovery, Volume 12, Issue 5, September/October 2022., 2022
Predictive Maintenance from a Data Mining perspective: this review analyzes the most significant predictive maintenance (PdM) contributions in recent years from Data Mining (DM) perspective. An exhaustive study is carried out to determine the most used DM techniques for solving each specific PdM problem.
Aurora Esteban   +2 more
wiley   +1 more source

Towards autoscaling of Apache Flink jobs [PDF]

open access: yesActa Universitatis Sapientiae, Informatica, 2021
Abstract Data stream processing has been gaining attention in the past decade. Apache Flink is an open-source distributed stream processing engine that is able to process a large amount of data in real time with low latency. Computations are distributed among a cluster of nodes.
Varga Balázs   +2 more
openaire   +2 more sources

Machine learning‐based prognostic and metastasis models of kidney cancer

open access: yesCancer Innovation, Volume 1, Issue 2, Page 124-134, August 2022., 2022
We used the data of 12,394 kidney cancer patients in the SEER (surveillance, epidemiology, and final results) database to construct a research cohort, combine with statistical relevance and clinical experience to screen for factors related to kidney cancer survival and prognosis.
Yuxiang Zhang   +13 more
wiley   +1 more source

A NOVEL TRUE REAL-TIME SPATIOTEMPORAL DATA STREAM PROCESSING FRAMEWORK

open access: yesJordanian Journal of Computers and Information Technology, 2022
The ability to interpret spatiotemporal data streams in real-time is critical for a range of systems. However, processing vast amounts of spatiotemporal data out of several sources, such as online traffic, social platforms, sensor networks, and other ...
ATURE ANGBERA, HUAH YONG CHAN
doaj   +1 more source

Self‐adaptation on parallel stream processing: A systematic review

open access: yesConcurrency and Computation: Practice and Experience, Volume 34, Issue 6, 10 March 2022., 2022
Summary A recurrent challenge in real‐world applications is autonomous management of the executions at run‐time. In this vein, stream processing is a class of applications that compute data flowing in the form of streams (e.g., video feeds, images, and data analytics), where parallel computing can help accelerate the executions. On the one hand, stream
Adriano Vogel   +3 more
wiley   +1 more source

SPOT: Testing Stream Processing Programs with Symbolic Execution and Stream Synthesizing

open access: yesApplied Sciences, 2021
Adoption of distributed stream processing (DSP) systems such as Apache Flink in real-time big data processing is increasing. However, DSP programs are prone to be buggy, especially when one programmer neglects some DSP features (e.g., source data ...
Qian Ye, Minyan Lu
doaj   +1 more source

[Retracted] Low‐Carbon Awareness Information Technology of Enterprise Executives Based on Big Data and Multimodal Information Fusion

open access: yesMobile Information Systems, Volume 2022, Issue 1, 2022., 2022
The so‐called multimodal information refers to the information from different information sources on different or the same side of the same description target. These pieces of information are different in terms of storage structure, representation, semantic connotation, credibility, and emphasis, but there is a certain inevitable connection between ...
Guimei Yang, Fusheng Zhu
wiley   +1 more source

Home - About - Disclaimer - Privacy