Results 191 to 200 of about 22,068 (223)

Apache Spark

Communications of the ACM, 2016
This open source computing framework unifies streaming, batch, and interactive big data workloads to unlock new applications.
Matei Zaharia   +13 more
openaire   +1 more source

Apache Spark and Apache Ignite Performance Analysis

2019 22nd International Conference on Control Systems and Computer Science (CSCS), 2019
Big Data represents an actual research topic. More and more it becomes part of people life's through different applications that are used daily, such as stock exchange, news, social media, health-care. All these applications make use of Big Data technologies for storing and processing information.
Cristiana-Stefania Stan   +4 more
openaire   +1 more source

Understanding Apache Spark

2021
Apache Spark is a data analytics platform that has made big data accessible and brings large-scale data processing into the reach of every developer. With Apache Spark, it is as easy to read from a single CSV file on your local machine as it is to read from a million CSV files in a data lake.
openaire   +1 more source

Performance comparison of Apache Hadoop and Apache Spark

Proceedings of the Third International Conference on Advanced Informatics for Computing Research, 2019
The term 'Big Data' is a broad term used for the data sets, which is enormous and traditional data processing applications find it hard to process. Both Apache Spark and Apache Hadoop are one of the significant parts of the big data family. Some of the researchers view both frameworks as the rivals but it is not that easy to compare these two as they ...
Amritpal Singh   +2 more
openaire   +1 more source

Partitioning in Apache Spark

2019
Apache Spark performs in-memory computation. The data structure used is Resilient Distributed Datasets (RDDs). These RDDs are partitioned using inbuilt Hash and Range Partitioning. We propose a partition scheme which uses modular division on keys of elements with numbers from 2 to 10.
H. S. Sreeyuktha, J. Geetha Reddy
openaire   +1 more source

The Engine: Apache Spark

2016
If our stack were a vehicle, now we have reached the engine. As an engine, we will disarm it, analyze it, master it, improve it, and run it to the limit.
Raul Estrada, Isaac Ruiz
openaire   +1 more source

Balanced Graph Partitioning with Apache Spark

2014
A significant part of the data produced every day by online services is structured as a graph. Therefore, there is the need for efficient processing and analysis solutions for large scale graphs. Among the others, the balanced graph partitioning is a well known NP-complete problem with a wide range of applications.
Carlini, Emanuele   +4 more
openaire   +2 more sources

Home - About - Disclaimer - Privacy