Results 61 to 70 of about 8,898 (200)
Frequent-Itemset Mining Using Locality-Sensitive Hashing [PDF]
The Apriori algorithm is a classical algorithm for the frequent itemset mining problem. A significant bottleneck in Apriori is the number of I/O operation involved, and the number of candidates it generates. We investigate the role of LSH techniques to overcome these problems, without adding much computational overhead. We propose randomized variations
Bera, Debajyoti, Pratap, Rameshwar
openaire +2 more sources
The purpose of this study was to explore the risk factors for autonomous vehicle (AV) crashes and their interdependencies. A total of 659 AV crash data were collected between 2018 and July 2024 from AV crash reports published by the California Department of Motor Vehicles.
Tao Wang +4 more
wiley +1 more source
ABSTRACT Machine learning techniques are increasingly used for high‐stakes decision‐making, such as college admissions, loan attribution, or recidivism prediction. Thus, it is crucial to ensure that the models learnt can be audited or understood by human users, do not create or reproduce discrimination or bias and do not leak sensitive information ...
Julien Ferry +4 more
wiley +1 more source
Efficient Analysis of Pattern and Association Rule Mining Approaches
The process of data mining produces various patterns from a given data source. The most recognized data mining tasks are the process of discovering frequent itemsets, frequent sequential patterns, frequent sequential rules and frequent association rules.
Lazzez, Amor, Slimani, Thabet
core +1 more source
ABSTRACT Background and Aims Infertility, as defined by the World Health Organization, is the inability to conceive after 12 months of regular, unprotected intercourse. This study aimed to identify factors influencing infertility by applying data mining techniques, specifically rule‐mining methods, to analyze diverse patient data and uncover relevant ...
Hosna Heydarian +3 more
wiley +1 more source
This paper proposed an Automated Quality Assessment of SRS (AQA‐SRS) framework by integrating four popular methods which are; NLP, K‐means, MAS, and CBR to assess the quality of SRS documents. The NLP utilize for feature extraction, K‐means for features clustering, MAS for interactive assessment and feature selection decision, and CBR for managing the ...
Mohammed Ahmed Jubair +6 more
wiley +1 more source
Observations on Factors Affecting Performance of MapReduce based Apriori on Hadoop Cluster
Designing fast and scalable algorithm for mining frequent itemsets is always being a most eminent and promising problem of data mining. Apriori is one of the most broadly used and popular algorithm of frequent itemset mining.
Garg, Rakhi +2 more
core +1 more source
An AI knowledge‐based system for police assistance in crime investigation
Abstract The fight against crime is often an arduous task overall when huge amounts of data have to be inspected, as is currently the case when it comes for example in the detection of criminal activity on the dark web. This work presents and describes an artificial intelligence (AI) based system that combines various tools to assist police or law ...
Carlos Fernandez‐Basso +4 more
wiley +1 more source
A Fast Approach for Up-Scaling Frequent Itemsets
With the rapid growth of data scale and diversification of demand, people have an urgent desire to extract useful frequent itemset from datasets of different scales. It is no doubt that the traditional method can solve the problem.
Runzi Chen, Shuliang Zhao, Mengmeng Liu
doaj +1 more source
The coal industry has always been a typically high‐risk industry with frequent accidents and extremely adverse impacts. Cases of accidents in coal mine ventilation systems serve as a concentrated demonstration of accident hazards and hold significant value for identifying key risk factors that may induce disasters in coal mine ventilation systems. This
Mingjia Jing +4 more
wiley +1 more source

