Results 51 to 60 of about 8,833 (199)

Global Copper Deposit Dataset: A New Open‐Source Database for Advanced Data Analysis and Exploration Targeting

open access: yesGeoscience Data Journal, Volume 13, Issue 1, January 2026.
We build a new, open‐source global copper deposit dataset (GCDD), facilitating AI‐driven data analysis for exploration targeting and improving our understanding of copper mineralizing systems and their mappable expressions. The GCDD hosts information about 1483 copper deposits worldwide, capturing key deposit attributes such as location, genetic type ...
Bin Wang   +2 more
wiley   +1 more source

Observations on Factors Affecting Performance of MapReduce based Apriori on Hadoop Cluster

open access: yes, 2017
Designing fast and scalable algorithm for mining frequent itemsets is always being a most eminent and promising problem of data mining. Apriori is one of the most broadly used and popular algorithm of frequent itemset mining.
Garg, Rakhi   +2 more
core   +1 more source

A Design‐Driven Machine Learning Approach for Invariant Mining in a Smart Grid

open access: yesIET Cyber-Physical Systems: Theory &Applications, Volume 11, Issue 1, January/December 2026.
An ICS is vulnerable to cyber‐attacks arising from within its communication network or directly from the SCADA and devices such as PLCs. The study reported here presents a scenario‐specific invariant mining approach to detect anomalies in plant behaviour.
Danish Hudani   +5 more
wiley   +1 more source

A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIM

open access: yesIEEE Access
Frequent itemset mining (FIM) and high utility itemset mining (HUIM) are popular data mining techniques used in various real-world applications such as retail-market, bio-medicine, and click-stream analysis.
Muhammad Waheed Ashraf   +2 more
doaj   +1 more source

Mining frequent closed itemsets out of core [PDF]

open access: yesProceedings of the 2006 SIAM International Conference on Data Mining, 2006
Extracting frequent itemsets is an important task in many data mining applications. When data are very large, it becomes mandatory to perform the mining task by using an external memory algorithm, but only a few of these algorithms have been proposed so far.
LUCCHESE, Claudio   +2 more
openaire   +3 more sources

From Prediction to Prevention: Using Text Mining and Explainable Machine Learning for Urban Bus Accident Analytics

open access: yesRisk Analysis, Volume 46, Issue 1, January 2026.
ABSTRACT Urban bus accidents present major safety and operational challenges, particularly in densely populated metropolitan areas. This study develops a machine learning‐based analytical framework to identify, quantify, and interpret the factors associated with severe bus accidents.
Bowei Chen   +3 more
wiley   +1 more source

Reductions for Frequency-Based Data Mining Problems

open access: yes, 2017
Studying the computational complexity of problems is one of the - if not the - fundamental questions in computer science. Yet, surprisingly little is known about the computational complexity of many central problems in data mining. In this paper we study
Miettinen, Pauli, Neumann, Stefan
core   +1 more source

Correlation Analysis of Influencing Factors of Autonomous Vehicle Accidents Based on Improved Apriori Algorithm

open access: yesJournal of Advanced Transportation, Volume 2026, Issue 1, 2026.
The purpose of this study was to explore the risk factors for autonomous vehicle (AV) crashes and their interdependencies. A total of 659 AV crash data were collected between 2018 and July 2024 from AV crash reports published by the California Department of Motor Vehicles.
Tao Wang   +4 more
wiley   +1 more source

Taming the Triangle: On the Interplays Between Fairness, Interpretability, and Privacy in Machine Learning

open access: yesComputational Intelligence, Volume 41, Issue 4, August 2025.
ABSTRACT Machine learning techniques are increasingly used for high‐stakes decision‐making, such as college admissions, loan attribution, or recidivism prediction. Thus, it is crucial to ensure that the models learnt can be audited or understood by human users, do not create or reproduce discrimination or bias and do not leak sensitive information ...
Julien Ferry   +4 more
wiley   +1 more source

An efficient and resilience linear prefix approach for mining maximal frequent itemset using clustering

open access: yesJournal of Safety Science and Resilience
The numerous volumes of data generated every day necessitate the deployment of new technologies capable of dealing with massive amounts of data efficiently.
M. Sinthuja   +5 more
doaj   +1 more source

Home - About - Disclaimer - Privacy