Results 81 to 90 of about 208,737 (313)
Discriminative Topic Mining via Category-Name Guided Text Embedding [PDF]
Mining a set of meaningful and distinctive topics automatically from massive text corpora has broad applications. Existing topic models, however, typically work in a purely unsupervised way, which often generate topics that do not fit users' particular needs and yield suboptimal performance on downstream tasks.
arxiv +1 more source
SyMO: A Hybrid Approach for Multi‐Objective Optimization of Crystal Growth Processes
The hybrid SyMO (Symbolic regression Multi‐objective Optimization) framework combines Computational Fluid Dynamics (CFD), machine learning, and mathematical optimization techniques to investigate the effects of various process parameters, furnace geometries, and radiation shield material properties on key crystal quality metrics in Czochralski silicon (
Milena Petkovic, Natasha Dropka
wiley +1 more source
Accessing accurate documents by mining auxiliary document information [PDF]
Earlier techniques of text mining included algorithms like k-means, Naive Bayes, SVM which classify and cluster the text document for mining relevant information about the documents. The need for improving the mining techniques has us searching for techniques using the available algorithms.
arxiv
Comparison of Syntactic Parsers on Biomedical Texts [PDF]
Syntactic parsing is an important step in the automated text analysis which aims at information extraction. Quality of the syntactic parsing determines to a large extent the recall and precision of the text mining results. In this paper we evaluate the performance of several popular syntactic parsers in application to the biomedical text mining.
arxiv
Accurately predicting protein structure is of great significance in biological research. LightRoseTTA, a light‐weight deep graph network, to achieve prediction for proteins is presented. Notably, three highlights are possessed by LightRoseTTA: i) high‐accurate structure prediction for proteins; ii) high‐efficient training and inference; and iii) low ...
Xudong Wang+7 more
wiley +1 more source
Abstract Recent research suggests that the effect of greenwashing and corporate financial performance (CFP) is ambiguous. This call for study the contextual factors that create contingencies in the greenwashing–CFP relationship. Using a sample of 2816 observations covering 735 Chinese‐listed firms in 21 different industries from 2013 to 2017, this ...
Wei Li+3 more
wiley +1 more source
A Survey of the Applications of Text Mining for the Food Domain
In the food domain, text mining techniques are extensively employed to derive valuable insights from large volumes of text data, facilitating applications such as aiding food recalls, offering personalized recipes, and reinforcing food safety regulation.
Shufeng Xiong+4 more
doaj +1 more source
Traditional Chinese Medicine (TCM), while holistic and historically esteemed, faces challenges in “miracle cures” due to slow onset, long cycles, and difficulty controlling quality. This study obtains the active ingredients, glabridin (GLA) and puerarin (PUE), from Ge‐Gen Decoction (GGD), developing a safe and effective drug delivery system, GLA‐PUE ...
Jianhong Qi+7 more
wiley +1 more source
Text mining brain imaging reports
Background With the improvements to text mining technology and the availability of large unstructured Electronic Healthcare Records (EHR) datasets, it is now possible to extract structured information from raw text contained within EHR at reasonably high
Beatrice Alex+5 more
doaj +1 more source
MaTableGPT: GPT‐Based Table Data Extractor from Materials Science Literature
A significant challenge in materials informatics today is the creation of large‐scale experimental databases. This difficulty primarily stems from the complexities involved in consolidating locally distributed datasets from various laboratories into a single cohesive database.
Gyeong Hoon Yi+11 more
wiley +1 more source