- Shopping Bag ( 0 items )
Mining Very Large Databases with Parallel Processing addresses the problem of large-scale data mining. It is an interdisciplinary text, describing advances in the integration of three computer science areas, namely 'intelligent' (machine learning-based) data mining techniques, relational databases and parallel processing. The basic idea is to use concepts and techniques of the latter two areas - particularly parallel processing - to speed up and scale up data mining algorithms.
The book is divided into three parts. The first part presents a comprehensive review of intelligent data mining techniques such as rule induction, instance-based learning, neural networks and genetic algorithms. Likewise, the second part presents a comprehensive review of parallel processing and parallel databases. Each of these parts includes an overview of commercially-available, state-of-the-art tools. The third part deals with the application of parallel processing to data mining. The emphasis is on finding generic, cost-effective solutions for realistic data volumes. Two parallel computational environments are discussed, the first excluding the use of commercial-strength DBMS, and the second using parallel DBMS servers.
It is assumed that the reader has a knowledge roughly equivalent to a first degree (BSc) in accurate sciences, so that (s)he is reasonably familiar with basic concepts of statistics and computer science.
The primary audience for Mining Very Large Databases with Parallel Processing is industry data miners and practitioners in general, who would like to apply intelligent data mining techniques to large amounts of data. The book will also be of interest to academic researchers and postgraduate students, particularly database researchers, interested in advanced, intelligent database applications, and artificial intelligence researchers interested in industrial, real-world applications of machine learning.
Preface. Introduction. Part I: Knowledge Discovery and Data Mining. 1. Knowledge Discovery Tasks. 2. Knowledge Discovery Paradigms. 3. The Knowledge Discovery Process. 4. Data Mining. 5. Data Mining Tools. Part II: Parallel Database Systems. 6. Basic Concepts on Parallel Processing. 7. Data Parallelism, Control Parallelism and Related Issues. 8. Parallel Database Servers. Part III: Parallel Data Mining. 9. Approaches to Speed up Data Mining. 10. Parallel Data Mining Without DBMS Facilities. 11. Parallel Data Mining With Database Facilities. 12. Summary and Some Open Problems. References. Index.