Results 11 to 20 of about 45,779,043 (381)
zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Hinze, R, Jeuring, J, Löh, A
openaire +10 more sources
A Survey of Data Partitioning and Sampling Methods to Support Big Data Analysis
Computer clusters with the shared-nothing architecture are the major computing platforms for big data processing and analysis. In cluster computing, data partitioning and sampling are two fundamental strategies to speed up the computation of big data and
Mohammad Sultan Mahmud+4 more
doaj +1 more source
Seven Primary Data Types in Citizen Science Determine Data Quality Requirements and Methods
Data quality (DQ) is a major concern in citizen science (CS) programs and is often raised as an issue among critics of the CS approach. We examined CS programs and reviewed the kinds of data they produce to inform CS communities of strategies of DQ ...
Robert D. Stevenson+3 more
doaj +1 more source
What Contributes to a Crowdfunding Campaign’s Success? Evidence and Analyses from GoFundMe Data
Researchers have attempted to measure the success of crowdfunding campaigns using a variety of determinants, such as the descriptions of the crowdfunding campaigns, the amount of funding goals, and crowdfunding project characteristics.
Xupin Zhang, Hanjia Lyu, Jiebo Luo
doaj +1 more source
Sherlock: A Deep Learning Approach to Semantic Data Type Detection [PDF]
Correctly detecting the semantic type of data columns is crucial for data science tasks such as automated data cleaning, schema matching, and data discovery. Existing data preparation and analysis systems rely on dictionary lookups and regular expression
Madelon Hulsebos+7 more
semanticscholar +1 more source
The Transrational Numbers as an Abstract Data Type
In an arithmetical structure one can make division a total function by defining 1/0 to be an element of the structure, or by adding a new element, such as an error element also denoted with a new constant symbol, an unsigned infinity or one or both ...
J. Bergstra, J. V. Tucker
semanticscholar +1 more source
Whole genome phylogeny of Gallus: introgression and data-type effects
Previous phylogenetic studies that include the four recognized species of Gallus have resulted in a number of distinct topologies, with little agreement.
G. Tiley+5 more
semanticscholar +1 more source
Heterogeneous Network-Based Chronic Disease Progression Mining
Healthcare insurance fraud has caused billions of dollars in losses in public healthcare funds around the world. In particular, healthcare insurance fraud in chronic diseases is especially rampant. Understanding disease progression can help investigators
Chenfei Sun+4 more
doaj +1 more source
Digital Objects – FAIR Digital Objects: Which Services Are Required?
Some of the early Research Data Alliance working groups reused the notion of digital objects as digital entities described by metadata and referenced by a persistent identifier. In recent times the FAIR principles became a prominent role as framework for
Ulrich Schwardmann
doaj +1 more source
Identification of cell populations often relies on manual annotation of cell clusters using established marker genes. However, the selection of marker genes is a time-consuming process that may lead to sub-optimal annotations as the markers must be ...
A. Ianevski, A. Giri, T. Aittokallio
semanticscholar +1 more source