Results 1 to 10 of about 9,555 (241)
Hadoop in Banking: Event-Driven Performance Evaluation. [PDF]
In today’s data‐intensive atmosphere, performance evaluation in the banking industry depends on timely and accurate insights, leading to better decision making and operational efficiency. Traditional methods for assessing bank performance often need to be improved to handle the volume, velocity, and variety of data generated in real time.
Panda M+5 more
europepmc +2 more sources
Beyond Batch Processing: Towards Real-Time and Streaming Big Data [PDF]
Today, big data are generated from many sources, and there is a huge demand for storing, managing, processing, and querying on big data. The MapReduce model and its counterpart open source implementation Hadoop, has proven itself as the de facto solution
Saeed Shahrivari
doaj +2 more sources
Hadoop on HPC: Integrating Hadoop and Pilot-based Dynamic Resource Management [PDF]
High-performance computing platforms such as supercomputers have traditionally been designed to meet the compute demands of scientific applications. Consequently, they have been architected as producers and not consumers of data. The Apache Hadoop ecosystem has evolved to meet the requirements of data processing applications and has addressed many of ...
Ioannis Paraskevakos+3 more
arxiv +5 more sources
Failure Analysis of Hadoop Schedulers using an Integration of Model Checking and Simulation [PDF]
The Hadoop scheduler is a centerpiece of Hadoop, the leading processing framework for data-intensive applications in the cloud. Given the impact of failures on the performance of applications running on Hadoop, testing and verifying the performance of the Hadoop scheduler is critical.
arxiv +1 more source
Sentiment Analysis on Hadoop with Hadoop Streaming [PDF]
Ideas and opinions of peoples are influenced by the opinions of other peoples. Lot of research is going on analysis of reviews given by peoples. Sentiment analysis is the major computational technique to calculate or observe sentiments of people’s thoughts.
Piyush Gupta+2 more
openaire +1 more source
Unravelling the JPMorgan spoofing case using particle physics visualization methods
Abstract On 29 September 2020, JPMorgan was ordered to pay a settlement of $920.2 million for spoofing the metals and Treasury futures markets from 2008 to 2016. We examine these cases using a visualization method developed in particle physics (CERN) and the messages that the exchange receives about market activity rather than time‐based snapshots ...
Philippe Debie+8 more
wiley +1 more source
Embedding GPU Computations in Hadoop [PDF]
As the size of high performance applications increases, four major challenges including heterogeneity, programmability, fault resilience, and energy efficiency have arisen in the underlying distributed systems.
Jie Zhu+5 more
doaj +1 more source
Lustre, hadoop, accumulo [PDF]
Data processing systems impose multiple views on data as it is processed by the system. These views include spreadsheets, databases, matrices, and graphs. There are a wide variety of technologies that can be used to store and process data through these different steps.
Andrew Prout+13 more
openaire +2 more sources
The research of social processes at the university using big data [PDF]
The volume of information in the 21st century is growing at a rapid pace. Big data technologies are used to process modern information. This article discusses the use of big data technologies to implement monitoring of social processes.
Hacimahmud Abdullayev Vugar+2 more
doaj +1 more source
A COMPARATIVE ANALYSIS OF CONVENTIONAL HADOOP WITH PROPOSED CLOUD ENABLED HADOOP FRAMEWORK FOR SPATIAL BIG DATA PROCESSING [PDF]
The emergence of new tools and technologies to gather the information generate the problem of processing spatial big data. The solution of this problem requires new research, techniques, innovation and development.
A. K. Tripathi, S. Agrawal, R. D. Gupta
doaj +1 more source