Results 21 to 30 of about 4,692,865 (188)
In Thin-Film Transistor Liquid-Crystal Display (TFT-LCD) manufacturing, conducting a machine learning based system with multiple data types has become actively desired to solve complicated problems.
Yi Liu, Hsueh-Ping Lu, Ching-Hao Lai
doaj +1 more source
Model Based Clustering for Mixed Data: clustMD [PDF]
A model based clustering procedure for data of mixed type, clustMD, is developed using a latent variable model. It is proposed that a latent variable, following a mixture of Gaussian distributions, generates the observed data of mixed type.
Gormley, Isobel Claire +1 more
core +3 more sources
Diabetes is the most common disease and a major threat to human health. Type 2 diabetes (T2D) makes up about 90% of all cases. With the development of high-throughput sequencing technologies, more and more fundamental pathogenesis of T2D at genetic and ...
Zhandong Li, Xiaoyong Pan, Yu-Dong Cai
doaj +1 more source
Autonomous clustering using rough set theory [PDF]
This paper proposes a clustering technique that minimises the need for subjective human intervention and is based on elements of rough set theory. The proposed algorithm is unified in its approach to clustering and makes use of both local and global ...
A. K. Jain +33 more
core +1 more source
Causal Inference on Multivariate and Mixed-Type Data [PDF]
Given data over the joint distribution of two random variables $X$ and $Y$, we consider the problem of inferring the most likely causal direction between $X$ and $Y$. In particular, we consider the general case where both $X$ and $Y$ may be univariate or multivariate, and of the same or mixed data types. We take an information theoretic approach, based
Marx, A., Vreeken, J.
openaire +3 more sources
Penelitian ini bertujuan untuk membandingkan metode one-hot-encoding, Gower distance yang dikombinasikan dengan algoritma k-means, DBSCAN, dan OPTICS, serta k-prototype untuk pengelompokan data bertipe campuran.
Zahra Rizky Fadilah +1 more
doaj +1 more source
A Two-stage Method for Inverse Medium Scattering [PDF]
We present a novel numerical method to the time-harmonic inverse medium scattering problem of recovering the refractive index from near-field scattered data.
Bakushinsky +30 more
core +1 more source
Continuous Diffusion for Mixed-Type Tabular Data
Score-based generative models, commonly referred to as diffusion models, have proven to be successful at generating text and image data. However, their adaptation to mixed-type tabular data remains underexplored. In this work, we propose CDTD, a Continuous Diffusion model for mixed-type Tabular Data.
Mueller, Markus +2 more
openaire +2 more sources
Clustering mixed-type data using a probabilistic distance algorithm
Cluster analysis is a broadly used unsupervised data analysis technique for finding groups of homogeneous units in a data set. Probabilistic distance clustering adjusted for cluster size (PDQ), discussed in this contribution, falls within the broad category of clustering methods initially developed to deal with continuous data; it has the advantage of ...
Cristina Tortora, Francesco Palumbo
openaire +2 more sources
Cluster Validation for Mixed-Type Data
For cluster analysis based on mixed-type data (i.e. data consisting of numerical and categorical variables), comparatively few clustering methods are available. One popular approach to deal with this kind of problems is an extension of the k-means algorithm (Huang, 1998), the so-called k-prototype algorithm, which is implemented in the R package ...
Aschenbruck, Rabea, Szepannek, Gero
openaire +2 more sources

