Results 51 to 60 of about 9,555 (241)
Internet jako pramen výzkumu: Přístup k archivovaným webovým zdrojům a možnosti jejich zpracování
Internet se stal přirozenou komunikační platformou soudobé společnosti. Webové archivy, které začaly vznikat v 90. letech 20. století s cílem zachytit a uchovat proměnlivý webový obsah, se tak staly klíčovými prameny pro výzkum nedávné minulosti ...
Zdenko Vozár+2 more
doaj +1 more source
Performance Tuning of Hadoop MapReduce: A Noisy Gradient Approach [PDF]
Hadoop MapReduce is a framework for distributed storage and processing of large datasets that is quite popular in big data analytics. It has various configuration parameters (knobs) which play an important role in deciding the performance i.e., the execution time of a given big data processing job.
arxiv +1 more source
Grid computing is an emerging technology that enabled the heterogeneous collection of data and provisioning of services to the users. Due to the high amount of incoming heterogeneous request, grid computing needs an efficient scheduling to reduce execution time and satisfy service level agreement (SLA) and quality of service (QoS) requirements.
Gangasandra Mahadevaiah Kiran+1 more
wiley +1 more source
Enabling Data Driven Projects for a Modern Enterprise
With the growing volume and demand for data a major concern for an Organization trying to implement Data Driven projects, is not only how to technically collect, cleanse, integrate, access, but even more so, how and why to use it.
Artyom Topchyan
doaj +1 more source
Experimental assessment of containers running on top of virtual machines
In this paper, the performance of containers running on top of virtual machines is experimentally compared with standalone virtual machines and containers based on different hardware resources, including the processor, main memory, disk, and network in a real testbed by running the most commonly used benchmarks.
Hossein Aqasizade+2 more
wiley +1 more source
LINEAR REGRESSION WITH R AND HADOOP [PDF]
In this paper we present a way to solve the linear regression model with R and Hadoop using the Rhadoop library. We show how the linear regression model can be solved even for very large models that require special technologies.
Bogdan OANCEA
doaj
Detection of Sensitive Data to Counter Global Terrorism
Global terrorism has created challenges to the criminal justice system due to its abnormal activities, which lead to financial loss, cyberwar, and cyber-crime.
Binod Kumar Adhikari+4 more
doaj +1 more source
While analyzing health data is important for improving health outcomes, class imbalance in datasets poses major challenges to machine learning classification models. This work, therefore, considers the class imbalance problem in stroke prediction using models such as K‐nearest neighbors, support vector machine, logistic regression, random forest, and ...
Edmund Fosu Agyemang+7 more
wiley +1 more source
The surge in big data and analytics has catalysed the proliferation of cybercrime, largely driven by organisations’ intensified focus on gathering and processing personal data for profit while often overlooking security considerations.
Cephas Mpungu+2 more
doaj +1 more source
Processing of raw astronomical data of large volume by MapReduce model
Exponential grow of volume, increased quality of data in current and incoming sky surveys open new horizons for astrophysics but require new approaches to data processing especially big data technologies and cloud computing.
S. . Gerasimov+4 more
doaj +1 more source