Results 51 to 60 of about 1,816,247 (166)
Structure learning: some testing problems
The work is based on data about the prevalence of congenital anomalies among newborns in Lithuania. The log-linear model is used to assess dependence structure of a subset of categorical variables. It is shown that fitting the log-linear model with just
Marijus Radavičius +1 more
doaj +1 more source
Clustering is a main task of data mining, and its purpose is to identify natural structures in a dataset. The results of cluster analysis are not only related to the nature of the data itself but also to some priori conditions, such as clustering ...
FU Li-wei, WU Sen
doaj +1 more source
Constrained Inference When the Sampled and Target Populations Differ
In the analysis of contingency tables, often one faces two difficult criteria: sampled and target populations are not identical and prior information translates to the presence of general linear inequality restrictions. Under these situations, we present
Huijun Yi, Bhaskar Bhattacharya
doaj +1 more source
Inconsistency of chi2 test for sparse categorical data under multinomial sampling
Simple conditions for the inconsistency of Pearson’s chi2 test in case of very sparse categorical data are given. The conditions illustrate the phenomenon of “reversed consistency”: the greater deviation from the null hypothesis the less power of the ...
Pavel Samusenko
doaj +1 more source
Characterisation of Serbian durum wheat genotypes based on UPOV-defined characteristics [PDF]
Estimation of the level of durum wheat germplasm genetic diversity is important for its classification and efficient use in breeding programmes. The aim of this study was to assess genetic diversity of durum wheat genotypes developed at the Institute of ...
Takač Verica +6 more
doaj
Information-Theoretic Modeling of Categorical Spatiotemporal GIS Data
An information-theoretic data mining method is employed to analyze categorical spatiotemporal Geographic Information System land use data. Reconstructability Analysis (RA) is a maximum-entropy-based data modeling methodology that works exclusively with ...
David Percy, Martin Zwick
doaj +1 more source
The spectral analysis of nonstationary categorical time series using local spectral envelope [PDF]
Most classical methods for the spectral analysis are based on the assumption that the time series is stationary. However, many time series in practical problems shows nonstationary behaviors.
Jeong, Hyewook, Jeong, Hyewook
core
Benchmarking imputation methods for categorical biological data
Trait datasets are at the basis of a large share of ecology and evolutionary research, being used to infer ancestral morphologies, quantify species extinction risks, or evaluate the functional diversity of biological communities. These datasets, however,
Matthieu Gendre +3 more
doaj +1 more source
Understanding and Enhancement of Internal Clustering Validation Indexes for Categorical Data
Clustering is one of the main tasks of machine learning. Internal clustering validation indexes (CVIs) are used to measure the quality of several clustered partitions to determine the local optimal clustering results in an unsupervised manner, and can ...
Xuedong Gao, Minghan Yang
doaj +1 more source
Partitioned conditional generalized linear models for categorical data [PDF]
In categorical data analysis, several regression models have been proposed for hierarchically-structured response variables, e.g. the nested logit model. But they have been formally defined for only two or three levels in the hierarchy.
Guédon, Yann +2 more
core +4 more sources

