Apache spark - Open Access .click

Results 1 to 10 of about 22,068 (223)

Large-scale digital forensic investigation for Windows registry on Apache Spark [PDF]

PLoS ONE, 2022
In this study, we investigate large-scale digital forensic investigation on Apache Spark using a Windows registry. Because the Windows registry depends on the system on which it operates, the existing forensic methods on the Windows registry have been ...
Jun-Ha Lee, Hyuk-Yoon Kwon
doaj +4 more sources

Framing Apache Spark in life sciences [PDF]

Heliyon, 2023
Advances in high-throughput and digital technologies have required the adoption of big data for handling complex tasks in life sciences. However, the drift to big data led researchers to face technical and infrastructural challenges for storing, sharing,
Andrea Manconi +4 more
doaj +5 more sources

A Parallel Multiobjective PSO Weighted Average Clustering Algorithm Based on Apache Spark [PDF]

Entropy, 2023
Multiobjective clustering algorithm using particle swarm optimization has been applied successfully in some applications. However, existing algorithms are implemented on a single machine and cannot be directly parallelized on a cluster, which makes it ...
Huidong Ling +5 more
doaj +4 more sources

HRV-Spark: Computing Heart Rate Variability Measures Using Apache Spark. [PDF]

Proceedings (IEEE Int Conf Bioinformatics Biomed), 2020
Heart rate variability (HRV) analysis has been serving as a significant promising marker in clinical research over the last few decades. The rapidly growing heart rate data generated from various devices, particularly the electrocardiograph (ECG), need to be stored properly and processed timely.
Qu X, Wu Y, Liu J, Cui L.
europepmc +6 more sources

Apache Spark ile Makine Öğrenmesi Destekli Diyabet Rahatsızlığı Tahmini

Düzce Üniversitesi Bilim ve Teknoloji Dergisi, 2022
Diyabet rahatsızlığı, insan vücudunun organlarını etkileyen kritik sağlık sorunlarından biridir. Bu nedenle, diyabet, 21. yüzyılda küresel bir sağlık sorunu olarak kabul edilmektedir.
Emre Yıldırım, Ali Çalhan
doaj +2 more sources

Implementing Apache Spark jobs execution and Apache Spark cluster creation for Openstack Sahara[1] [PDF]

Труды Института системного программирования РАН, 2018
In this paper the problem of creating virtual clusters in clouds for big data analysis with Apache Hadoop and Apache Spark is discussed. Existing methods for Apache Spark clusters creation are described in this work.
A. . Aleksiyants +4 more
doaj +4 more sources

Big Data in metagenomics: Apache Spark vs MPI. [PDF]

PLoS ONE, 2020
The progress of next-generation sequencing has lead to the availability of massive data sets used by a wide range of applications in biology and medicine.
José M Abuín +4 more
doaj +2 more sources

Bioinformatics applications on Apache Spark. [PDF]

Gigascience, 2018
With the rapid development of next-generation sequencing technology, ever-increasing quantities of genomic data pose a tremendous challenge to data processing. Therefore, there is an urgent need for highly scalable and powerful computational systems. Among the state-of-the-art parallel computing platforms, Apache Spark is a fast, general-purpose, in ...
Guo R, Zhao Y, Zou Q, Fang X, Peng S.
europepmc +4 more sources

Pengukuran Performa Apache Spark dengan Library H2O Menggunakan Benchmark Hibench Berbasis Cloud Computing [PDF]

Jurnal Teknologi Informasi dan Ilmu Komputer, 2019
Apache Spark merupakan platform yang dapat digunakan untuk memproses data dengan ukuran data yang relatif besar (big data) dengan kemampuan untuk membagi data tersebut ke masing-masing cluster yang telah ditentukan konsep ini disebut dengan parallel ...
Aminudin Aminudin, Eko Budi Cahyono
doaj +2 more sources

Experimenting sensitivity-based anonymization framework in apache spark [PDF]

Journal of Big Data, 2018
One of the biggest concerns of big data and analytics is privacy. We believe the forthcoming frameworks and theories will establish several solutions for the privacy protection.
Mohammed Al-Zobbi, Seyed Shahrestani, Chun Ruan +2 more
doaj +2 more sources

data mining
artificial intelligence
operating system

machine learning
database
physics

parallel computing
algorithm
data science