Large-scale digital forensic investigation for Windows registry on Apache Spark [PDF]
In this study, we investigate large-scale digital forensic investigation on Apache Spark using a Windows registry. Because the Windows registry depends on the system on which it operates, the existing forensic methods on the Windows registry have been ...
Jun-Ha Lee, Hyuk-Yoon Kwon
doaj +4 more sources
Framing Apache Spark in life sciences [PDF]
Advances in high-throughput and digital technologies have required the adoption of big data for handling complex tasks in life sciences. However, the drift to big data led researchers to face technical and infrastructural challenges for storing, sharing,
Andrea Manconi +4 more
doaj +5 more sources
A Parallel Multiobjective PSO Weighted Average Clustering Algorithm Based on Apache Spark [PDF]
Multiobjective clustering algorithm using particle swarm optimization has been applied successfully in some applications. However, existing algorithms are implemented on a single machine and cannot be directly parallelized on a cluster, which makes it ...
Huidong Ling +5 more
doaj +4 more sources
HRV-Spark: Computing Heart Rate Variability Measures Using Apache Spark. [PDF]
Heart rate variability (HRV) analysis has been serving as a significant promising marker in clinical research over the last few decades. The rapidly growing heart rate data generated from various devices, particularly the electrocardiograph (ECG), need to be stored properly and processed timely.
Qu X, Wu Y, Liu J, Cui L.
europepmc +6 more sources
Apache Spark ile Makine Öğrenmesi Destekli Diyabet Rahatsızlığı Tahmini
Diyabet rahatsızlığı, insan vücudunun organlarını etkileyen kritik sağlık sorunlarından biridir. Bu nedenle, diyabet, 21. yüzyılda küresel bir sağlık sorunu olarak kabul edilmektedir.
Emre Yıldırım, Ali Çalhan
doaj +2 more sources
Implementing Apache Spark jobs execution and Apache Spark cluster creation for Openstack Sahara[1] [PDF]
In this paper the problem of creating virtual clusters in clouds for big data analysis with Apache Hadoop and Apache Spark is discussed. Existing methods for Apache Spark clusters creation are described in this work.
A. . Aleksiyants +4 more
doaj +4 more sources
Big Data in metagenomics: Apache Spark vs MPI. [PDF]
The progress of next-generation sequencing has lead to the availability of massive data sets used by a wide range of applications in biology and medicine.
José M Abuín +4 more
doaj +2 more sources
Bioinformatics applications on Apache Spark. [PDF]
With the rapid development of next-generation sequencing technology, ever-increasing quantities of genomic data pose a tremendous challenge to data processing. Therefore, there is an urgent need for highly scalable and powerful computational systems. Among the state-of-the-art parallel computing platforms, Apache Spark is a fast, general-purpose, in ...
Guo R, Zhao Y, Zou Q, Fang X, Peng S.
europepmc +4 more sources
Pengukuran Performa Apache Spark dengan Library H2O Menggunakan Benchmark Hibench Berbasis Cloud Computing [PDF]
Apache Spark merupakan platform yang dapat digunakan untuk memproses data dengan ukuran data yang relatif besar (big data) dengan kemampuan untuk membagi data tersebut ke masing-masing cluster yang telah ditentukan konsep ini disebut dengan parallel ...
Aminudin Aminudin, Eko Budi Cahyono
doaj +2 more sources
Experimenting sensitivity-based anonymization framework in apache spark [PDF]
One of the biggest concerns of big data and analytics is privacy. We believe the forthcoming frameworks and theories will establish several solutions for the privacy protection.
Mohammed Al-Zobbi +2 more
doaj +2 more sources

