Agenda

Data Mining and Oracle
Statistics, multivariate statistics and Data Mining - Data Mining cycle - Data preprocessing: Descriptive data aggregation, data cleansing, data integration and transformation - Data Reduction - Discretization and concept hierarchies - Data Mining and Business Intelligence: Databases, Data Warehouses and OLAP as the basis for Data Mining - Oracle architecture for Data Mining: database, Data Mining module and MS Excel add-in
Factors and influences
Factor Analysis and Principal Component Analysis - Outlier Analysis
Data Mining using Association analysis
Finding frequent patterns (Frequent Itemset Mining) - Apriori algorithm - association rules and association analysis - shopping basket analysis
Data Mining and Classification
Decision Trees: selection of attributes, tree pruning, deduction of rules, quality measures and comparison of models - Support Vector Machines: algorithms, building and using a model
Data Mining and Probability Theory
Classification using logistic regression - Probability and Bayes´s Theorem - Naïve Bayes: algorithms, building and using a model
Cluster Analysis
Introduction to Cluster Analysis - Similarity and distance measurement - Variants and basic techniques - Partitioning methods: k-Means Method - Hierarchical methods: agglomerative and divisive methods
Trainer

Marco Skulschus (born in Germany in 1978) studied economics in Wuppertal (Germany) and Paris (France) and wrote his master´s thesis about semantic data modeling. He started working as a lecturer and consultant in 2002.
Publications
- Grundlagen empirische Sozialforschung ISBN 978-3-939701-23-1
- System und Systematik von Fragebögen ISBN 978-3-939701-26-2
- Oracle PL/SQL ISBN 978-3-939701-40-8
- MS SQL Server - T-SQL Programmierung und Abfragen ISBN 978-3-939701-69-9
Projects
- He works as an IT-consultant and project manager. He developed various Business Intelligence systems for industry clients and the public sector. For several years now, he is responsible for a BI-team in India which is mainly involved in BI and OLAP projects, reporting systems as well as statistical analysis and Data Mining.
Research
He led several research projects and was leading scientist and project manager of a publicly funded project about interactive questionnaires and online surveys.
