Results 51 to 60 of about 8,833 (199)
We build a new, open‐source global copper deposit dataset (GCDD), facilitating AI‐driven data analysis for exploration targeting and improving our understanding of copper mineralizing systems and their mappable expressions. The GCDD hosts information about 1483 copper deposits worldwide, capturing key deposit attributes such as location, genetic type ...
Bin Wang +2 more
wiley +1 more source
Observations on Factors Affecting Performance of MapReduce based Apriori on Hadoop Cluster
Designing fast and scalable algorithm for mining frequent itemsets is always being a most eminent and promising problem of data mining. Apriori is one of the most broadly used and popular algorithm of frequent itemset mining.
Garg, Rakhi +2 more
core +1 more source
A Design‐Driven Machine Learning Approach for Invariant Mining in a Smart Grid
An ICS is vulnerable to cyber‐attacks arising from within its communication network or directly from the SCADA and devices such as PLCs. The study reported here presents a scenario‐specific invariant mining approach to detect anomalies in plant behaviour.
Danish Hudani +5 more
wiley +1 more source
A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIM
Frequent itemset mining (FIM) and high utility itemset mining (HUIM) are popular data mining techniques used in various real-world applications such as retail-market, bio-medicine, and click-stream analysis.
Muhammad Waheed Ashraf +2 more
doaj +1 more source
Mining frequent closed itemsets out of core [PDF]
Extracting frequent itemsets is an important task in many data mining applications. When data are very large, it becomes mandatory to perform the mining task by using an external memory algorithm, but only a few of these algorithms have been proposed so far.
LUCCHESE, Claudio +2 more
openaire +3 more sources
ABSTRACT Urban bus accidents present major safety and operational challenges, particularly in densely populated metropolitan areas. This study develops a machine learning‐based analytical framework to identify, quantify, and interpret the factors associated with severe bus accidents.
Bowei Chen +3 more
wiley +1 more source
Reductions for Frequency-Based Data Mining Problems
Studying the computational complexity of problems is one of the - if not the - fundamental questions in computer science. Yet, surprisingly little is known about the computational complexity of many central problems in data mining. In this paper we study
Miettinen, Pauli, Neumann, Stefan
core +1 more source
The purpose of this study was to explore the risk factors for autonomous vehicle (AV) crashes and their interdependencies. A total of 659 AV crash data were collected between 2018 and July 2024 from AV crash reports published by the California Department of Motor Vehicles.
Tao Wang +4 more
wiley +1 more source
ABSTRACT Machine learning techniques are increasingly used for high‐stakes decision‐making, such as college admissions, loan attribution, or recidivism prediction. Thus, it is crucial to ensure that the models learnt can be audited or understood by human users, do not create or reproduce discrimination or bias and do not leak sensitive information ...
Julien Ferry +4 more
wiley +1 more source
The numerous volumes of data generated every day necessitate the deployment of new technologies capable of dealing with massive amounts of data efficiently.
M. Sinthuja +5 more
doaj +1 more source

