Space-efficient Feature Maps for String Alignment Kernels [PDF]
String kernels are attractive data analysis tools for analyzing string data. Among them, alignment kernels are known for their high prediction accuracies in string classifications when tested in combination with SVM in various applications.
CC Chang +10 more
core +4 more sources
Large scale material science data analysis [PDF]
Material Science, the science of studying materials and their properties, involves many aspects such as performing experiments to calculate certain physical properties. Scientists are always looking to utilise the collected experimental data in order to make predictions for new points, where the studied property is unknown.
openaire +2 more sources
The Mass-Size Relation from Clouds to Cores. I. A new Probe of Structure in Molecular Clouds [PDF]
We use a new contour-based map analysis technique to measure the mass and size of molecular cloud fragments continuously over a wide range of spatial scales (0.05 < r / pc < 10), i.e., from the scale of dense cores to those of entire clouds.
A. A. Goodman +23 more
core +3 more sources
Recursive Data Analysis in Large Scale Complex Systems [PDF]
Abstract Advanced data analysis is needed in practical applications in large scale complex systems. Variable specific datadriven solutions provide consistent levels, which can be used in compact model structures. In changing operating conditions, the recursive analysis extends the applicability of these structures in building and tuning dynamic and ...
openaire +2 more sources
Finding regulatory modules through large-scale gene-expression data analysis
The use of gene microchips has enabled a rapid accumulation of gene-expression data. One of the major challenges of analyzing this data is the diversity, in both size and signal strength, of the various modules in the gene regulatory networks of ...
Kloster, Morten +2 more
core +1 more source
Exploratory data analysis in large-scale genetic studies [PDF]
Genome-wide association studies (GWAS) have become the method of choice for investigating the genetic basis of common diseases and complex traits. The immense scale of these experiments is unprecedented, involving thousands of samples and up to a million variables.
openaire +3 more sources
The Iterative Signature Algorithm for the analysis of large scale gene expression data
We present a new approach for the analysis of genome-wide expression data. Our method is designed to overcome the limitations of traditional techniques, when applied to large-scale data.
A. Brazma +28 more
core +1 more source
Statistical Traffic State Analysis in Large-scale Transportation Networks Using Locality-Preserving Non-negative Matrix Factorization [PDF]
Statistical traffic data analysis is a hot topic in traffic management and control. In this field, current research progresses focus on analyzing traffic flows of individual links or local regions in a transportation network.
Han, Yufei, Moutarde, Fabien
core +5 more sources
HiSem-RAG: A Hierarchical Semantic-Driven Retrieval-Augmented Generation Method
Traditional retrieval-augmented generation (RAG) methods struggle with hierarchical documents, often causing semantic fragmentation, structural loss, and inefficient retrieval due to fixed strategies.
Dongju Yang, Junming Wang
doaj +1 more source
A Novel Clustering Algorithm for Wi-Fi Indoor Positioning
In recent years, the Wi-Fi-based indoor positioning technology has become a research hotspot. This technology mainly locates the indoor Wi-Fi based on the received signal strength indicator (RSSI) signals.
Jin Ren +4 more
doaj +1 more source

