A Data Science Central Community

By University of Illinois at Urbana-Champaign. Instructors: Abel Bliss, Professor, Department of Computer Science; John C. Hart, Professor, Department of Computer Science; ChengXiang Zhai, Professor, Department of Computer Science.

Purpose:

- Understand the basic concepts and principles in data mining and visualization.
- Learn commonly used algorithms for mining both structured and unstructured (text) data.
- Understand how to handle a large amount of text data with search engines.
- Gain experience applying some of the algorithms to solve real world data mining problems.

Module 1 - Pattern Discovery in Data Mining

- Introduction to data mining
- Concepts and challenges in pattern discovery and analysis
- Scalable pattern discovery algorithms
- Pattern evaluation
- Mining flexible patterns in multi-dimensional space
- Mining sequential patterns
- Mining graph patterns
- Pattern-based classification
- Application examples of pattern discovery

Module 2 - Text Retrieval and Search Engines

- Introduction to text data mining
- Basic concepts in text retrieval
- Information retrieval models
- Implementation of a search engine
- Evaluation of search engines
- Advanced search engine technologies

Module 3 - Cluster Analysis and Data Mining

- Basic concept and introduction
- Partitioning methods
- Hierarchical methods
- Density-based methods
- Probabilistic models and EM algorithm
- Spectral clustering
- Clustering high dimensional data
- Clustering streaming data
- Clustering graph data and network data
- Constraint-based clustering and semi-supervised clustering
- Application examples of cluster analysis

Module 4 - Text Mining and Analytics

- Overview of text analytics and applications
- Extending a search engine to support text analytics (text categorization, text clustering, text summarization)
- Topic mining and analysis with statistical topic models
- Opinion mining and summarization
- Integrative analysis of text and structured data

Module 5 - Visualization

- Week 1: Visualization Infrastructure (graphics programming and human perception)
- Week 2: Basic Visualization (charts, graphs, animation, interactivity)
- Week 3: Visualizing Relationships (hierarchies, networks)
- Week 4: Visualizing Information (text, databases)

Capstone

The 6-week long capstone project class will allow you to apply the learned algorithms and techniques for data mining from the previous courses in the specialization to solve interesting real-world data mining challenges. After finishing the capstone project class, you can expect to generate tangible results such as reports or prototype systems, which can be shown to potential employers to demonstrate their skills. Projects will be drawn from real-world problems and will be conducted with industry, government, and academic partners.

You must take the capstone project class after taking all the other courses in this specialization.

**DSC Resources**

- Training | Books | Cheat Sheet | Apprenticeship | Certification
- Core Articles | Competitions and Challenges | Salary Surveys
- Top Links | Newsletter | Webinars | Jobs | Our Book

**Additional Reading**

Tags:

© 2021 TechTarget, Inc. Powered by

Badges | Report an Issue | Privacy Policy | Terms of Service

**Most Popular Content on DSC**

To not miss this type of content in the future, subscribe to our newsletter.

- Book: Applied Stochastic Processes
- Long-range Correlations in Time Series: Modeling, Testing, Case Study
- How to Automatically Determine the Number of Clusters in your Data
- New Machine Learning Cheat Sheet | Old one
- Confidence Intervals Without Pain - With Resampling
- Advanced Machine Learning with Basic Excel
- New Perspectives on Statistical Distributions and Deep Learning
- Fascinating New Results in the Theory of Randomness
- Fast Combinatorial Feature Selection

**Other popular resources**

- Comprehensive Repository of Data Science and ML Resources
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- 100 Data Science Interview Questions and Answers
- Cheat Sheets | Curated Articles | Search | Jobs | Courses
- Post a Blog | Forum Questions | Books | Salaries | News

**Archives:** 2008-2014 |
2015-2016 |
2017-2019 |
Book 1 |
Book 2 |
More

**Most popular articles**

- Free Book and Resources for DSC Members
- New Perspectives on Statistical Distributions and Deep Learning
- Time series, Growth Modeling and Data Science Wizardy
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- Comprehensive Repository of Data Science and ML Resources
- Advanced Machine Learning with Basic Excel
- Difference between ML, Data Science, AI, Deep Learning, and Statistics
- Selected Business Analytics, Data Science and ML articles
- How to Automatically Determine the Number of Clusters in your Data
- Fascinating New Results in the Theory of Randomness
- Hire a Data Scientist | Search DSC | Find a Job
- Post a Blog | Forum Questions