Exploratory Data Analysis with MATLAB, Second Edition
CRC Press – 2010 – 536 pages
CRC Press – 2010 – 536 pages
Since the publication of the bestselling first edition, many advances have been made in exploratory data analysis (EDA). Covering innovative approaches for dimensionality reduction, clustering, and visualization, Exploratory Data Analysis with MATLAB®, Second Edition uses numerous examples and applications to show how the methods are used in practice.
New to the Second Edition
Like its predecessor, this edition continues to focus on using EDA methods, rather than theoretical aspects. The MATLAB codes for the examples, EDA toolboxes, data sets, and color versions of all figures are available for download at http://pi-sigma.info
"This book presents a broad panoply of data-analytical methods implemented in MATLAB. … the amount of material covered is impressive. The explanations are clear, and the fluid style makes reading pleasant. … very useful for the applied statistician. Its material may also be employed as a complement to a more theoretical-oriented course."
—R. Maronna, Statistical Papers, Vol. 55, 2014
"The book is very helpful for applied data analysts as an excellent compact overview of popular available methods supplied with a MATLAB code. … Common features and differences between various methods are carefully explained and the book is well understandable from the perspective of the users. … The book, written by very experienced authors, can be strongly recommended as an excellent manual for MATLAB users who need to extract information from their data."
—Jan Kalina, ISCB Newsletter, June 2013
"The authors present an intuitive and easy-to-read book. … accompanied by many examples, proposed exercises, good references, and comprehensive appendices that initiate the reader unfamiliar with MATLAB. … a great contribution to the field of data analysis, which I am sure will be useful for researchers and practitioners."
—Adolfo Alvarez Pinto, International Statistical Review (2011), 79
"Practitioners of EDA who use MATLAB will want a copy of this book. … The authors discuss many EDA methods, including graphical approaches. With the book comes the EDA Toolbox (downloadable from the text website) for use with MATLAB. It contains code for all of the algorithms discussed in the text.
… the authors strategically inject helpful observations and guidance into the examples throughout the book.
… this book does not merely document routines; it shows how to do EDA. The helpful summaries, intuitive explanations, and comprehensive examples make the text so much more than a software cookbook. … The authors have done a great service by bringing together so many EDA routines, but their main accomplishment in this dynamic text is providing the understanding and tools to do EDA.
This text, along with the EDA Toolbox, is an excellent resource. Even readers with limited background can quickly be analyzing data and plotting it in interesting ways. For practitioners of EDA who use MATLAB, and ideally also the Statistics Toolbox, I highly recommend this book."
—MAA Reviews, April 2011
Praise for the First Edition:
"This book … has a good introduction to EDA, and then illustrates several applications where MATLAB provides the analysis of data to produce unexpected results."
"The audience for the book is a wide one and includes statisticians, computer scientists, and others who may be interested in or use EDA. … I found the book to be engagingly written, and successful in its defined task of teaching the reader to use EDA with MATLAB. I liked the graphics and thought that they fully illustrated the techniques used."
—Brian Jersky, Journal of the American Statistical Association
"The book can also be useful in a classroom setting at the senior undergraduate and graduate level, valuable exercises being included in each chapter."
—Neculai Curteanu, Zentralblatt MATH
INTRODUCTION TO EXPLORATORY DATA ANALYSIS
Introduction to Exploratory Data Analysis
What Is Exploratory Data Analysis
Overview of the Text
A Few Words about Notation
Data Sets Used in the Book
EDA AS PATTERN DISCOVERY
Dimensionality Reduction - Linear Methods
Principal Component Analysis (PCA)
Singular Value Decomposition (SVD)
Nonnegative Matrix Factorization
Fisher’s Linear Discriminant
Dimensionality Reduction - Nonlinear Methods
Multidimensional Scaling (MDS)
Artificial Neural Network Approaches
Projection Pursuit Indexes
Independent Component Analysis
Evaluating the Clusters
Overview of Model-Based Clustering
Hierarchical Agglomerative Model-Based Clustering
MBC for Density Estimation and Discriminant Analysis
Generating Random Variables from a Mixture Model
Residuals and Diagnostics with Loess
Choosing the Smoothing Parameter
Bivariate Distribution Smooths
Curve Fitting Toolbox
GRAPHICAL METHODS FOR EDA
Plotting Points as Curves
Data Tours Revisited
Appendix A: Proximity Measures
Appendix B: Software Resources for EDA
Appendix C: Description of Data Sets
Appendix D: Introduction to MATLAB
Appendix E: MATLAB Functions
Summary, Further Reading, and Exercises appear at the end of each chapter.
Wendy L. Martinez has been in government service for over 20 years, working with leading researchers from academia, industry, and government labs. During this time, she has conducted and published research in text data mining, probability density estimation, signal processing, scientific visualization, and statistical pattern recognition. A fellow of the American Statistical Association, she earned an M.S. in aerospace engineering from George Washington University and a Ph.D. in computational sciences and informatics from George Mason University.
Angel R. Martinez teaches undergraduate and graduate courses in statistics and mathematics at Strayer University. Before retiring from government service, he worked for the U.S. Navy as an operations research analyst and a computer scientist. He earned an M.S. in systems engineering from the Virginia Polytechnic Institute and State University and a Ph.D. in computational sciences and informatics from George Mason University.
Since 1984, Jeffrey L. Solka has been working in statistical pattern recognition for the Department of the Navy. He has published over 120 journal, conference, and technical papers; has won numerous awards; and holds 4 patents. He earned an M.S. in mathematics from James Madison University, an M.S. in physics from Virginia Polytechnic Institute and State University, and a Ph.D. in computational sciences and informatics from George Mason University.