返回主页

高级数据挖掘》课程相关信息

基本信息:

授课地点:中南大学铁道校区电子楼507

电子楼位于中南大学铁道校区西门附近,距离住处需步行1015分钟。

住宿安排

中南大学迎宾楼

位置:中南大学铁道校区东门外向北50左右

双人标间:208/188/160(无网络) /

提供早餐。

交通路线

乘坐火车到达长沙火车站,可乘坐7107路公共汽车(出火车站后向左走,在火车站售票厅后)到达中南大学铁道校区(前行100左右即到迎宾楼)。从火车站打车到中南大学铁道校区需花费20元左右。

如乘坐飞机到达长沙黄花机场,打车赴中南大学铁道校区需花费100元左右;可乘坐民航班车到达终点站(民航大厦);然后打车赴中南大学铁道校区需花费20元左右。

高级数据挖掘》课程简介

Advanced Data Mining and Real-world Applications

Charles X. Ling, Professor

Department of Computer Science

The University of Western Ontario, Canada

 

SYNOPSIS:

I will first give a quick and comprehensive review of basic data mining tasks (and algorithms), including:

-          Classification (decision trees, neural networks, etc.)

-          Regression (regression trees, k-NN, etc.)

-          Clustering (k-means, etc.)

-          Association (Apriori)

Then I will teach some recent, advanced topics in data mining and machine learning, including:

-          Support Vector Machines (SVM)

-          Semi-supervised learning (co-training, EM, etc.)

-          Ensemble learning (bagging, boosting)

-          Cost-sensitive learning

-          Active learning

-          Bayesian learning

-          Feature selection and extraction

I will also discuss many real-world applications of data mining:

-          Mining market and stock data

-          Direct marketing

-          Credit risk prediction

-          Action mining

-          Mining medical data for medical diagnosis

-          Profit mining

-          Text mining

-          Mining for search engines

-          Social network discovery

The lecture will be delivered in a mixture of English and Chinese. Students are advised to attend all classes and follow the lecture notes closely. Active class participation and discussions are highly encouraged.

On the last day of the lecture, a discussion salon with all students will be held, and future research collaboration will be encouraged.

Graduate students who take this course for credit must complete a hand-on project, and write an exam at the end. 

PREREQUISITES:

Basic knowledge on data mining, machine learning, and/or Artificial Intelligence (4th-year undergraduate or Masters' level) would help to understand the lecture well.  

TEXTBOOKS:

There are NO particular textbooks for this course because it is an advanced course. The majority of the course materials will come from research papers, PPT, and relevant reference books.

REFERENCE MATERIALS:

Books:

Data Mining: Practical Machine Learning Tools and Techniques (2nd edition). By Ian H. Witten and Eibe Frank. Morgan Kaufmann, 2005.

Data Mining: Concepts and Techniques (2nd edition). By Jiawei Han and Micheline Kamber. Morgan Kaufmann. 2006.

Machine Learning, by Tom Mitchell, McGraw Hill, 1997.

Software:

The WEKA Package (see http://www.cs.waikato.ac.nz/ml/weka/) is the most popular and powerful data mining tool used by machine learning researchers and data-mining practitioners around the world. Freely downloadable with open source code in Java. This is also the accompanying software for the book “Data Mining: Practical Machine Learning Tools and Techniques”.

Journal articles:

Articles from Journal of Machine Learning Research, Machine Learning, KDD, IEEE TKDE, and so on may be used in the class. 

Conference papers:

Papers from KDD, IEEE ICDM, ICML, ECML, PAKDD, PKDD, and so on may be used in the lectures.

Useful website on data mining: http://www.kdnuggets.com/