Results 161 to 170 of about 14,382 (202)
Some of the next articles are maybe not open access.
Big data analysis using Apache Hadoop
2013 IEEE 14th International Conference on Information Reuse & Integration (IRI), 2013The paradigm of processing huge datasets has been shifted from centralized architecture to distributed architecture. As the enterprises faced issues of gathering large chunks of data they found that the data cannot be processed using any of the existing centralized architecture solutions.
Jyoti Nandimath +5 more
openaire +1 more source
Big Data Analysis Using Apache Hadoop
2014 International Conference on IT Convergence and Security (ICITCS), 2014We live in on-demand, on-command Digital universe with data prolifering by Institutions, Individuals and Machines at a very high rate. This data is categories as "Big Data" due to its sheer Volume, Variety and Velocity. Most of this data is unstructured, quasi structured or semi structured and it is heterogeneous in nature.
Shankar Ganesh Manikandan, Siddarth Ravi
openaire +1 more source
MapReduce programming with apache Hadoop
2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), 2010Apache Hadoop has become the platform of choice for developing large-scale dataintensive applications. In this tutorial, we will discuss design philosophy of Hadoop, describe how to design and develop Hadoop applications and higher-level application frameworks to crunch several terabytes of data, using anywhere from four to 4,000 computers.
openaire +1 more source
Extensible Video Processing Framework in Apache Hadoop
2013 IEEE 5th International Conference on Cloud Computing Technology and Science, 2013Digital video is prominent big data spread all over the Internet. It is large not only in size but also in required processing power to extract useful information. Fast processing of excessive video reels is essential on criminal investigations, such as terrorism.
Chungmo Ryu +4 more
openaire +1 more source
Processing LIDAR Data with Apache Hadoop
2016The paper is focused on research in the area of processing LIDAR data with Apache Hadoop. Our team is managing an information system that is able to calculate probability of existence of different objects in space and time. The system works with a lot of different data sources, including large datasets.
Jan Růžička +3 more
openaire +1 more source
Comparative Study of Apache Pig & Apache Cassandra in Hadoop Distributed Environment
2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA), 2020Big data analytics is the one which acquire, organise and analyse the huge volume of data with high velocity to find some patterns and useful information. The data sets are so large that it can’t be handled by traditional databases to manage and process the structure and unstructured data. Hence, big data tools i.e.
Yogesh Kumar Gupta, Tanusha Mittal
openaire +1 more source
A modern data architecture with apache Hadoop
2015 International Conference on Green Computing and Internet of Things (ICGCIoT), 2015This paper represents the analysis of the existing architecture framework used across domains. It also emphasizes on the modern architecture in integration with apache Hadoop. The existing data architecture is under pressure from new data and machine generated data for the upcoming years that is due to emergence of new data types there has been ...
Tripty Singh, null Darshan V S
openaire +1 more source
Analyzing Performance of Apache Pig and Apache Hive with Hadoop
2018Big Data is the term used for huge datasets which are very complex in nature and difficult to be processed using traditional devices. The current requirement is for a new technology for analyzing these huge datasets. One of the best options is Apache Hadoop as it consists of various components which work simultaneously to provide an efficient and ...
Krati Bansal +2 more
openaire +1 more source
Survey of Data Locality in Apache Hadoop
2019 IEEE International Conference on Big Data, Cloud Computing, Data Science & Engineering (BCD), 2019One of the key challenges in big data technology is the velocity at which the data is processed. Hadoop, an open-source software framework, is the dominant technology to support big data analytics. So, the researcher has tried to increase the performance of the Hadoop system. One of the Hadoop performance research is data locality.
Sungchul Lee, Ju-Yeon Jo, Yoohwan Kim
openaire +1 more source
Apache Hadoop jako analytická platforma
2017Diploma Thesis focuses on integrating Hadoop platform into current data warehouse architecture. In theoretical part, properties of Big Data are described together with their methods and processing models. Hadoop framework, its components and distributions are discussed.
openaire +2 more sources

