000 03025 a2200229 4500
008 250812b |||||||| |||| 00| 0 eng d
020 _a9783031007859
082 _a006.312 DON
100 _aDong, Guozhu
245 _aExploiting the power of group differences: using patterns to solve data analysis problems
260 _aAG, Switzerland:
_bSpringer,
_c2019
300 _axv; 130p.:
_bpbk:
_c24 cm.
440 _aSynthesis lecture on data mining and knowledge discovery
504 _aIncludes Bibliography,author's biography and Index
520 _aThis book presents pattern-based problem-solving methods for a variety of machine learning and data analysis problems. The methods are all based on techniques that exploit the power of group differences. They make use of group differences represented using emerging patterns (aka contrast patterns), which are patterns that match significantly different numbers of instances in different data groups. A large number of applications outside of the computing discipline are also included. Emerging patterns (EPs) are useful in many ways. EPs can be used as features, as simple classifiers, as subpopulation signatures/characterizations, and as triggering conditions for alerts. EPs can be used in gene ranking for complex diseases since they capture multi-factor interactions. The length of EPs can be used to detect anomalies, outliers, and novelties. Emerging/contrast pattern based methods for clustering analysis and outlier detection do not need distance metrics, avoiding pitfalls of the latter in exploratory analysis of high dimensional data. EP-based classifiers can achieve good accuracy even when the training datasets are tiny, making them useful for exploratory compound selection in drug design. EPs can serve as opportunities in opportunity-focused boosting and are useful for constructing powerful conditional ensembles. EP-based methods often produce interpretable models and results. In general, EPs are useful for classification, clustering, outlier detection, gene ranking for complex diseases, prediction model analysis and improvement, and so on. EPs are useful for many tasks because they represent group differences, which have extraordinary power. Moreover, EPs represent multi-factor interactions, whose effective handling is of vital importance and is a major challenge in many disciplines. Based on the results presented in this book, one can clearly say that patterns are useful, especially when they are linked to issues of interest. We believe that many effective ways to exploit group differences' power still remain to be discovered. Hopefully this book will inspire readers to discover such new ways, besides showing them existing ways, to solve various challenging problems. https://link.springer.com/book/10.1007/978-3-031-01913-5
650 _aStatistics
650 _aData mining
650 _aKnowledge Discovery
650 _aInformation Storage and Retrieval
650 _aArtificial Intelligence
942 _cTD
_2ddc
999 _c63225
_d63225