Results 51 to 60 of about 2,924 (186)

BigData Analysis in Healthcare: Apache Hadoop , Apache spark and Apache Flink

open access: yesFrontiers in Health Informatics, 2019
Introduction: Health care data is increasing. The correct analysis of such data will improve the quality of care and reduce costs. This kind of data has certain features such as high volume, variety, high-speed production, etc. It makes it impossible to analyze with ordinary hardware and software platforms. Choosing the right platform for managing this
Elham Nazari   +2 more
openaire   +2 more sources

Software analysis of scientific texts: comparative study of distributed computing frameworks

open access: yesРадіоелектронні і комп'ютерні системи
The relevance of this study is related to the need for efficient analysis of scientific texts in the context of the growing amount of information.
Serik Altynbek   +3 more
doaj   +1 more source

Benchmarking Distributed Stream Data Processing Systems

open access: yes, 2019
The need for scalable and efficient stream analysis has led to the development of many open-source streaming data processing systems (SDPSs) with highly diverging capabilities and performance characteristics.
Heiskanen, Henri   +5 more
core   +2 more sources

Machine learning in big data: A performance benchmarking study of Flink-ML and Spark MLlib

open access: yesApplied Computer Science
Machine learning (ML) in big data frameworks plays a critical role in real-time analytics, decision making, and predictive modeling. Among the most prominent ML libraries for large-scale data processing are Flink-ML, the machine learning extension of ...
Messaoud MEZATI, Ines AOURIA
doaj   +1 more source

Distributed Holistic Clustering on Linked Data

open access: yes, 2017
Link discovery is an active field of research to support data integration in the Web of Data. Due to the huge size and number of available data sources, efficient and effective link discovery is a very challenging task.
A Saeedi   +6 more
core   +1 more source

Rumble: Data Independence for Large Messy Data Sets

open access: yes, 2020
This paper introduces Rumble, an engine that executes JSONiq queries on large, heterogeneous and nested collections of JSON objects, leveraging the parallel capabilities of Spark so as to provide a high degree of data independence. The design is based on
Alonso, Gustavo   +4 more
core   +1 more source

Apache Spark Streaming, Kafka and HarmonicIO: A Performance Benchmark and Architecture Comparison for Enterprise and Scientific Computing

open access: yes, 2019
This paper presents a benchmark of stream processing throughput comparing Apache Spark Streaming (under file-, TCP socket- and Kafka-based stream integration), with a prototype P2P stream processing framework, HarmonicIO.
Blamey, Ben   +2 more
core   +1 more source

Reproducible Experiments for Comparing Apache Flink and Apache Spark on Public Clouds

open access: yesCoRR, 2016
Big data processing is a hot topic in today's computer science world. There is a significant demand for analysing big data to satisfy many requirements of many industries. Emergence of the Kappa architecture created a strong requirement for a highly capable and efficient data processing engine. Therefore data processing engines such as Apache Flink and
Shelan Perera   +2 more
openaire   +2 more sources

Video2flink: Real-Time Video Partitioning in Apache Flink and the Cloud

open access: yesSSRN Electronic Journal, 2022
AbstractVideo2Flink is a distributed highly scalable video processing system for bounded (i.e., stored) or unbounded (i.e., continuous) and real-time video streams with the same efficiency. It shows how complicated video processing tasks can be expressed and executed as pipelined data flows on Apache Flink, an open-source stream processing platform ...
Dimitrios Kastrinakis   +1 more
openaire   +1 more source

A Next‐Generation Approach to Airline Reservations: Integrating Cloud Microservices With AI and Blockchain for Enhanced Operational Performance

open access: yesIET Blockchain, Volume 5, Issue 1, January/December 2025.
This research presents a next‐generation airline reservation system that integrates cloud microservices, distributed Artificial intelligence (AI) modules, and blockchain technology to improve system efficiency, security, and customer satisfaction. The proposed architecture enhances scalability, transaction speed, and fraud prevention, while increasing ...
Biman Barua, M. Shamim Kaiser
wiley   +1 more source

Home - About - Disclaimer - Privacy