Results 1 to 10 of about 13,588,304 (249)
DataSHIELD: taking the analysis to the data, not the data to the analysis [PDF]
Research in modern biomedicine and social science requires sample sizes so large that they can often only be achieved through a pooled co-analysis of data from several studies. But the pooling of information from individuals in a central database that may be queried by researchers raises important ethico-legal questions and can be controversial. In the
Gaye, Amadou +48 more
openaire +9 more sources
Stock data analysis using sympbolic data analysis
in French ...
Philippe Caillou, Edwin Diday
openaire +2 more sources
Probabilistic Active Learning for Active Class Selection. [PDF]
In machine learning, active class selection (ACS) algorithms aim to actively select a class and ask the oracle to provide an instance for that class to optimize a classifier's performance while minimizing the number of requests. In this paper, we propose
Sabsch, Tim +10 more
core +1 more source
Data Analysis of a Google Data Center [PDF]
Data collected from an operational Google data center during 29 days represent a very rich and very useful source of information for understanding the main features of a data center. In this paper, we highlight the strong heterogeneity of jobs. The distribution of job execution duration shows a high disparity, as well as the job waiting time before ...
Minet, Pascale +3 more
openaire +2 more sources
Learning from incomplete data in Bayesian networks with qualitative influences [PDF]
Domain experts can often quite reliably specify the sign of influences between variables in a Bayesian network. If we exploit this prior knowledge in estimating the probabilities of the network, it is more likely to be accepted by its users and may in ...
Sub Algorithmic Data Analysis +6 more
core +1 more source
Data Preprocessing and Intelligent Data Analysis
This paper first provides an overview of data preprocessing, focusing on problems of real world data. These are primarily problems that have to be carefully understood and solved before any data analysis process can start. The paper discusses in detail two main reasons for performing data preprocessing: (i) problems with the data and (ii) preparation ...
Fazel Famili +3 more
openaire +3 more sources
Keeping it Short and Simple: Summarising Complex Event Sequences with Multivariate Patterns [PDF]
We study how to obtain concise descriptions of discrete multivariate sequential data. In particular, how to do so in terms of rich multivariate sequential patterns that can capture potentially highly interesting (cor)relations between sequences.
Bertens, R. +11 more
core +1 more source
Analysis of neonatal mortality data for year 2016 [PDF]
This report presents an analysis of neonatal mortality in infants born in 2016 at a gestational age (GA) less that 32 weeks, and who were admitted to neonatal units that form the UK Neonatal Collaborative in England, Wales and Scotland (part).
Longford, Nicholas, Modi, Neena
core +1 more source
The objective of this report is to highlight opportunities for enhancing global research data infrastructures from the point of view of data analysis. We discuss various directions and data-analysis functionalities for supporting such infrastructures.
openaire +3 more sources
Temporal density extrapolation using a dynamic basis approach
Density estimation is a versatile technique underlying many data mining tasks and techniques, ranging from exploration and presentation of static data, to probabilistic classification, or identifying changes or irregularities in streaming data.
Sub Algorithmic Data Analysis +4 more
core +1 more source

