Results 21 to 30 of about 4,692,865 (188)

A Novel Attention-Based Multi-Modal Modeling Technique on Mixed Type Data for Improving TFT-LCD Repair Process

open access: yesIEEE Access, 2022
In Thin-Film Transistor Liquid-Crystal Display (TFT-LCD) manufacturing, conducting a machine learning based system with multiple data types has become actively desired to solve complicated problems.
Yi Liu, Hsueh-Ping Lu, Ching-Hao Lai
doaj   +1 more source

Model Based Clustering for Mixed Data: clustMD [PDF]

open access: yes, 2015
A model based clustering procedure for data of mixed type, clustMD, is developed using a latent variable model. It is proposed that a latent variable, following a mixture of Gaussian distributions, generates the observed data of mixed type.
Gormley, Isobel Claire   +1 more
core   +3 more sources

Identification of Type 2 Diabetes Biomarkers From Mixed Single-Cell Sequencing Data With Feature Selection Methods

open access: yesFrontiers in Bioengineering and Biotechnology, 2022
Diabetes is the most common disease and a major threat to human health. Type 2 diabetes (T2D) makes up about 90% of all cases. With the development of high-throughput sequencing technologies, more and more fundamental pathogenesis of T2D at genetic and ...
Zhandong Li, Xiaoyong Pan, Yu-Dong Cai
doaj   +1 more source

Autonomous clustering using rough set theory [PDF]

open access: yes, 2008
This paper proposes a clustering technique that minimises the need for subjective human intervention and is based on elements of rough set theory. The proposed algorithm is unified in its approach to clustering and makes use of both local and global ...
A. K. Jain   +33 more
core   +1 more source

Causal Inference on Multivariate and Mixed-Type Data [PDF]

open access: yes, 2019
Given data over the joint distribution of two random variables $X$ and $Y$, we consider the problem of inferring the most likely causal direction between $X$ and $Y$. In particular, we consider the general case where both $X$ and $Y$ may be univariate or multivariate, and of the same or mixed data types. We take an information theoretic approach, based
Marx, A., Vreeken, J.
openaire   +3 more sources

Perbandingan Metode Klasterisasi Data Bertipe Campuran: One-Hot-Encoding, Gower Distance, dan K-Prototype Berdasarkan Akurasi (Studi Kasus: Chronic Kidney Disease Dataset)

open access: yesJournal of Applied Informatics and Computing, 2023
Penelitian ini bertujuan untuk membandingkan metode one-hot-encoding, Gower distance yang dikombinasikan dengan algoritma k-means, DBSCAN, dan OPTICS, serta k-prototype untuk pengelompokan data bertipe campuran.
Zahra Rizky Fadilah   +1 more
doaj   +1 more source

A Two-stage Method for Inverse Medium Scattering [PDF]

open access: yes, 2012
We present a novel numerical method to the time-harmonic inverse medium scattering problem of recovering the refractive index from near-field scattered data.
Bakushinsky   +30 more
core   +1 more source

Continuous Diffusion for Mixed-Type Tabular Data

open access: yes, 2023
Score-based generative models, commonly referred to as diffusion models, have proven to be successful at generating text and image data. However, their adaptation to mixed-type tabular data remains underexplored. In this work, we propose CDTD, a Continuous Diffusion model for mixed-type Tabular Data.
Mueller, Markus   +2 more
openaire   +2 more sources

Clustering mixed-type data using a probabilistic distance algorithm

open access: yesApplied Soft Computing, 2022
Cluster analysis is a broadly used unsupervised data analysis technique for finding groups of homogeneous units in a data set. Probabilistic distance clustering adjusted for cluster size (PDQ), discussed in this contribution, falls within the broad category of clustering methods initially developed to deal with continuous data; it has the advantage of ...
Cristina Tortora, Francesco Palumbo
openaire   +2 more sources

Cluster Validation for Mixed-Type Data

open access: yes, 2020
For cluster analysis based on mixed-type data (i.e. data consisting of numerical and categorical variables), comparatively few clustering methods are available. One popular approach to deal with this kind of problems is an extension of the k-means algorithm (Huang, 1998), the so-called k-prototype algorithm, which is implemented in the R package ...
Aschenbruck, Rabea, Szepannek, Gero
openaire   +2 more sources

Home - About - Disclaimer - Privacy