Results 11 to 20 of about 44,292,339 (363)
MissForest - non-parametric missing value imputation for mixed-type data [PDF]
MOTIVATION Modern data acquisition based on high-throughput technology is often facing the problem of missing data. Algorithms commonly used in the analysis of such large-scale data often depend on a complete set.
D. Stekhoven, P. Bühlmann
semanticscholar +3 more sources
Local Type Checking for Linked Data Consumers [PDF]
The Web of Linked Data is the cumulation of over a decade of work by the Web standards community in their effort to make data more Web-like. We provide an introduction to the Web of Linked Data from the perspective of a Web developer that would like to ...
Gabriel Ciobanu+2 more
doaj +6 more sources
What Contributes to a Crowdfunding Campaign’s Success? Evidence and Analyses from GoFundMe Data
Researchers have attempted to measure the success of crowdfunding campaigns using a variety of determinants, such as the descriptions of the crowdfunding campaigns, the amount of funding goals, and crowdfunding project characteristics.
Xupin Zhang, Hanjia Lyu, Jiebo Luo
doaj +1 more source
Sherlock: A Deep Learning Approach to Semantic Data Type Detection [PDF]
Correctly detecting the semantic type of data columns is crucial for data science tasks such as automated data cleaning, schema matching, and data discovery. Existing data preparation and analysis systems rely on dictionary lookups and regular expression
Madelon Hulsebos+7 more
semanticscholar +1 more source
The Transrational Numbers as an Abstract Data Type
In an arithmetical structure one can make division a total function by defining 1/0 to be an element of the structure, or by adding a new element, such as an error element also denoted with a new constant symbol, an unsigned infinity or one or both ...
J. Bergstra, J. V. Tucker
semanticscholar +1 more source
Whole genome phylogeny of Gallus: introgression and data-type effects
Previous phylogenetic studies that include the four recognized species of Gallus have resulted in a number of distinct topologies, with little agreement.
G. Tiley+5 more
semanticscholar +1 more source
Identification of cell populations often relies on manual annotation of cell clusters using established marker genes. However, the selection of marker genes is a time-consuming process that may lead to sub-optimal annotations as the markers must be ...
Aleksandr Ianevski+2 more
semanticscholar +1 more source
Digital Objects – FAIR Digital Objects: Which Services Are Required?
Some of the early Research Data Alliance working groups reused the notion of digital objects as digital entities described by metadata and referenced by a persistent identifier. In recent times the FAIR principles became a prominent role as framework for
Ulrich Schwardmann
doaj +1 more source
Inductive-data-type Systems [PDF]
In a previous work ("Abstract Data Type Systems", TCS 173(2), 1997), the last two authors presented a combined language made of a (strongly normalizing) algebraic rewrite system and a typed lambda-calculus enriched by pattern-matching definitions ...
Barendregt+11 more
core +6 more sources
Heterogeneous Network-Based Chronic Disease Progression Mining
Healthcare insurance fraud has caused billions of dollars in losses in public healthcare funds around the world. In particular, healthcare insurance fraud in chronic diseases is especially rampant. Understanding disease progression can help investigators
Chenfei Sun+4 more
doaj +1 more source