Caught you: threats to confidentiality due to the public release of large-scale genetic data sets [PDF]
Background Large-scale genetic data sets are frequently shared with other research groups and even released on the Internet to allow for secondary analysis. Study participants are usually not informed about such data sharing because data sets are assumed
Wjst Matthias
doaj +5 more sources
Global rank-invariant set normalization (GRSN) to reduce systematic distortions in microarray data [PDF]
Background Microarray technology has become very popular for globally evaluating gene expression in biological samples. However, non-linear variation associated with the technology can make data interpretation unreliable.
Kulesz-Martin Molly+3 more
doaj +5 more sources
Generic Environments in Coq [PDF]
We introduce a library which provides an abstract data type of environments, as a functor parameterized by a module defining variables, and a function which builds environments for such variables with any Type of type. Usual operations over environments are defined, along with an extensive set of basic and more advanced properties. Moreover, we give an
Polonowski, Emmanuel
arxiv +3 more sources
Empirical Bayes models for multiple probe type microarrays at the probe level [PDF]
Background When analyzing microarray data a primary objective is often to find differentially expressed genes. With empirical Bayes and penalized t-tests the sample variances are adjusted towards a global estimate, producing more stable results compared ...
Rudemo Mats+2 more
doaj +5 more sources
Harvesting metadata in clinical care: a crosswalk between FHIR, OMOP, CDISC and openEHR metadata
Metadata describe information about data source, type of creation, structure, status and semantics and are prerequisite for preservation and reuse of medical data.
Caroline Bönisch+2 more
doaj +1 more source
Implementation and Evaluation of a Multivariate Abstraction-Based, Interval-Based Dynamic Time-Warping Method as a Similarity Measure for Longitudinal Medical Records [PDF]
We extended dynamic time warping (DTW) into interval-based dynamic time warping (iDTW), including (A) interval-based representation (iRep): [1] abstracting raw, time-stamped data into interval-based abstractions, [2] comparison-period scoping, [3] partitioning abstract intervals into a given temporal granularity; (B) interval-based matching (iMatch ...
arxiv +1 more source
A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers [PDF]
Readers of academic research papers often read with the goal of answering specific questions. Question Answering systems that can answer those questions can make consumption of the content much more efficient.
Pradeep Dasigi+5 more
semanticscholar +1 more source
Static Safety for an Actor Dedicated Process Calculus by Abstract Interpretation [PDF]
The actor model eases the definition of concurrent programs with non uniform behaviors. Static analysis of such a model was previously done in a data-flow oriented way, with type systems.
A. Igarashi+20 more
core +4 more sources
Classes: an abstract data type facility for the C language
Language constructs for definition and use of abstract data types ease the design and maintenance of large programs. This paper describes the C class concept, an extension to the C language providing such constructs.
B. Stroustrup
semanticscholar +1 more source
SkelCL - A Portable Skeleton Library for High-Level GPU Programming [PDF]
While CUDA and OpenCL made general-purpose programming for Graphics Processing Units (GPU) popular, using these programming approaches remains complex and error-prone because they lack high-level abstractions.
Gorlatch, Sergei+2 more
core +1 more source