McMASTER UNIVERSITY STATISTICS SEMINAR

Week of November 27 - December 1, 2000

SPEAKER:

Dr Peter Macdonald
Department of Mathematics & Statistics, McMaster University

TITLE:

"A Data Mining Case Study"

DAY:

Wednesday, November 29, 2000

TIME:

3:30 p.m. [Tea & cookies in BSB-202 at 3:00 p.m.]

PLACE:

BSB-108

SUMMARY

The Case Studies sessions at the Statistical Society of Canada 2000 Annual Meeting included a Data Mining exercise provided by Gary Saarenvirta, formerly of The Loyalty Group, now with IBM Canada. McMaster students Swetlana Ljubicic and Melissa Naglic worked on this Case Study and we presented our results along with teams from UBC, Waterloo, Guelph and Carleton.

I will introduce some of the statistical methods commonly used in Data Mining, including logistic regression, classification and regression trees, and latent variable methods (principal components, projection on latent variables), and discuss the results obtained by McMaster and some of the other teams.

ABOUT THE SPEAKER

Peter Macdonald received his Bachelor's and Master's degrees in Mathematics from the University of Toronto, and his D.Phil. in Biomathematics from the University of Oxford. He joined McMaster in 1971 and has spent research leaves at l'Institut National de la Santé et de la Recherche Médicale in Villejuif, France, and La Trobe University, Bundoora, Australia. He has held many positions in the Statistical Society of Canada, including that of President in 1990-91.

REFERENCES

In preparation for this seminar, you should read the description of the Case Study on the Web, download the data, and try some exploratory analysis.


Return to the Statistics Activity Sheet