Results 11 to 20 of about 73 (69)

Redescription Model Mining [PDF]

open access: yesProceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, 2021
This paper introduces Redescription Model Mining, a novel approach to identify interpretable patterns across two datasets that share only a subset of attributes and have no common instances. In particular, Redescription Model Mining aims to find pairs of describable data subsets -- one for each dataset -- that induce similar exceptional models with ...
Stamm, Felix I.   +3 more
openaire   +2 more sources

Interactive redescription mining [PDF]

open access: yesProceedings of the 2014 ACM SIGMOD International Conference on Management of Data, 2014
Exploratory data analysis consists of multiple iterated steps: a data mining method is run on the data, the results are interpreted, new insights are formed, and the resulting knowledge is utilized when executing the method in a next round, and so on until satisfactory results are obtained.
Galbrun, E., Miettinen, P.
openaire   +3 more sources

Mining Redescriptions with Siren [PDF]

open access: yesACM Transactions on Knowledge Discovery from Data, 2018
In many areas of science, scientists need to find distinct common characterizations of the same objects and, vice versa, to identify sets of objects that admit multiple shared descriptions. For example, in biology, an important task is to identify the bioclimatic constraints that allow some species to survive, that is, to describe geographical regions ...
Galbrun, Esther, Miettinen, Pauli
openaire   +3 more sources

Differentially private tree-based redescription mining

open access: yesData Mining and Knowledge Discovery, 2023
AbstractDifferential privacy provides a strong form of privacy and allows preserving most of the original characteristics of the dataset. Utilizing these benefits requires one to design specific differentially private data analysis algorithms. In this work, we present three tree-based algorithms for mining redescriptions while preserving differential ...
Matej Mihelčić, Pauli Miettinen
openaire   +3 more sources

Associating life stages and sexes of Nearctic Polycentropus Curtis, 1835 (Trichoptera: Polycentropodidae) using mitochondrial DNA barcoding

open access: yesEcology and Evolution, Volume 12, Issue 3, March 2022., 2022
Caddisfly taxonomy is conventionally based on males, resulting in a relatively poor understanding of females and immature stages. Using mtDNA barcoding, we were able to associate additional females and larvae of Nearctic members of the genus Polycentropus.
Alexander B. Orfinger   +2 more
wiley   +1 more source

Analysing Political Opinions Using Redescription Mining [PDF]

open access: yes2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW), 2016
Understanding the socio-economical background of voters supporting a certain cause or, vice versa, understanding the political stance of people from a certain socio-economical niche are important questions in political sciences. Traditionally, answering these questions has required the researcher to fix either the political stance or the socio ...
Galbrun, Esther, Miettinen, Pauli
openaire   +3 more sources

Commodity risk assessment of bonsai plants from China consisting of Pinus parviflora grafted on Pinus thunbergii

open access: yesEFSA Journal, Volume 20, Issue 2, February 2022., 2022
Abstract The European Commission requested the EFSA Panel on Plant Health to prepare and deliver a scientific opinion on the risk posed by bonsai plants from China consisting of Pinus parviflora grafted on Pinus thunbergii taking into account the available scientific information, including the technical information provided by China.
EFSA Panel on Plant Health (PLH)   +28 more
wiley   +1 more source

The limits of Quediini at last (Staphylinidae: Staphylininae): a rove beetle mega‐radiation resolved by comprehensive sampling and anchored phylogenomics

open access: yesSystematic Entomology, Volume 46, Issue 2, Page 396-421, April 2021., 2021
Novel probe set developed for anchored hybrid enrichment of Staphylinidae (Coleoptera), targeting 1229 single‐copy, protein‐encoding, orthologous loci from the nuclear genome. Comprehensive phylogeny of rove beetle tribe Quediini (∼800 spp.), with 46% of 201 ingroup taxa sequenced from pinned specimens, will serve as framework for needed generic ...
Adam J. Brunke   +15 more
wiley   +1 more source

An Optimization Approach for Mining of Process Models with Infrequent Behaviors Integrating Data Flow and Control Flow

open access: yesScientific Programming, Volume 2021, Issue 1, 2021., 2021
Infrequent behaviors of business process refer to behaviors that occur in very exceptional cases, and their occurrence frequency is low as their required conditions are rarely fulfilled. Hence, a strong coupling relationship between infrequent behavior and data flow exists.
Li-li Wang   +4 more
wiley   +1 more source

Redescription Mining and Applications in Bioinformatics [PDF]

open access: yes, 2009
Our ability to interrogate the cell and computationally assimilate its answers is improving at a dramatic pace. For instance, the study of even a focused aspect of cellular activity, such as gene action, now benefits from multiple high-throughput data acquisition technologies such as microarrays, genome-wide deletion screens, and RNAi assays.
Naren Ramakrishnan, Mohammed Zaki
openaire   +2 more sources

Home - About - Disclaimer - Privacy