Apache hadoop - Open Access .click

Results 161 to 170 of about 14,382 (202)

Some of the next articles are maybe not open access.

2013 IEEE 14th International Conference on Information Reuse & Integration (IRI), 2013
The paradigm of processing huge datasets has been shifted from centralized architecture to distributed architecture. As the enterprises faced issues of gathering large chunks of data they found that the data cannot be processed using any of the existing centralized architecture solutions.
Jyoti Nandimath +5 more
openaire +1 more source

Big Data Analysis Using Apache Hadoop

2014 International Conference on IT Convergence and Security (ICITCS), 2014
We live in on-demand, on-command Digital universe with data prolifering by Institutions, Individuals and Machines at a very high rate. This data is categories as "Big Data" due to its sheer Volume, Variety and Velocity. Most of this data is unstructured, quasi structured or semi structured and it is heterogeneous in nature.
Shankar Ganesh Manikandan, Siddarth Ravi
openaire +1 more source

MapReduce programming with apache Hadoop

2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), 2010
Apache Hadoop has become the platform of choice for developing large-scale dataintensive applications. In this tutorial, we will discuss design philosophy of Hadoop, describe how to design and develop Hadoop applications and higher-level application frameworks to crunch several terabytes of data, using anywhere from four to 4,000 computers.
openaire +1 more source

Extensible Video Processing Framework in Apache Hadoop

2013 IEEE 5th International Conference on Cloud Computing Technology and Science, 2013
Digital video is prominent big data spread all over the Internet. It is large not only in size but also in required processing power to extract useful information. Fast processing of excessive video reels is essential on criminal investigations, such as terrorism.
Chungmo Ryu +4 more
openaire +1 more source

Processing LIDAR Data with Apache Hadoop

2016
The paper is focused on research in the area of processing LIDAR data with Apache Hadoop. Our team is managing an information system that is able to calculate probability of existence of different objects in space and time. The system works with a lot of different data sources, including large datasets.
Jan Růžička +3 more
openaire +1 more source

Comparative Study of Apache Pig & Apache Cassandra in Hadoop Distributed Environment

2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA), 2020
Big data analytics is the one which acquire, organise and analyse the huge volume of data with high velocity to find some patterns and useful information. The data sets are so large that it can’t be handled by traditional databases to manage and process the structure and unstructured data. Hence, big data tools i.e.
Yogesh Kumar Gupta, Tanusha Mittal
openaire +1 more source

A modern data architecture with apache Hadoop

2015 International Conference on Green Computing and Internet of Things (ICGCIoT), 2015
This paper represents the analysis of the existing architecture framework used across domains. It also emphasizes on the modern architecture in integration with apache Hadoop. The existing data architecture is under pressure from new data and machine generated data for the upcoming years that is due to emergence of new data types there has been ...
Tripty Singh, null Darshan V S
openaire +1 more source

Analyzing Performance of Apache Pig and Apache Hive with Hadoop

2018
Big Data is the term used for huge datasets which are very complex in nature and difficult to be processed using traditional devices. The current requirement is for a new technology for analyzing these huge datasets. One of the best options is Apache Hadoop as it consists of various components which work simultaneously to provide an efficient and ...
Krati Bansal, Priyanka Chawla, Pratik Kurle +2 more
openaire +1 more source

Survey of Data Locality in Apache Hadoop

2019 IEEE International Conference on Big Data, Cloud Computing, Data Science & Engineering (BCD), 2019
One of the key challenges in big data technology is the velocity at which the data is processed. Hadoop, an open-source software framework, is the dominant technology to support big data analytics. So, the researcher has tried to increase the performance of the Hadoop system. One of the Hadoop performance research is data locality.
Sungchul Lee, Ju-Yeon Jo, Yoohwan Kim
openaire +1 more source

Apache Hadoop jako analytická platforma

2017
Diploma Thesis focuses on integrating Hadoop platform into current data warehouse architecture. In theoretical part, properties of Big Data are described together with their methods and processing models. Hadoop framework, its components and distributions are discussed.
openaire +2 more sources

big data
hadoop
apache spark

mapreduce
hdfs
spark

apache hive
hive