:: Topics Covered
- General principle of data modelling
- Different types of models
- Choice of a type of model given the nature of the response variable
- Choice of a type of model given the relationship between the response and the predictors
- Data preparation for modelling
- Measuring model performance
- Diagnotic tools for models
- Univariate vs. multivariate models
- Specific models with multivariate models
- Simple linear regression models
- Multiple linear regression models
- Handling redundancy in the explanatory variables
- Variable selection
- Models for categorical variables
- Logistic regression
- Specific tools to assess model performance in logistic regression: ROC curve
- The notion of odds ratio
- Alternatives to logistic regression
- Non-linear regression
- General considerations on reporting regression results
- Use of models for prediction purposes
:: Course ContentThis workshop is articulated around three sessions covering linear regression, regression models for categorical data and non-linear regression. It brings to the fore similarities and differences between these methods by focusing on the correct usage of each of them.
Among the covered topics, particular attention is paid to data preparation, missing data handling, choice of an appropriate modelling technique, the difference between explanatory and predictive models, handling redundancy in the predictors, variable selection and uncertainty measures on the predictions generated with models.
Moreover, an outlook of more recent modelling methods will be presented.
Finally, a important portion of the time is devoted to hands-on applications during which participants will have the opportunity to use their statistical software to build models using their own data sets that they are invited to bring along (a variety of datasets will be provided to participants who do not have the chance to use their own).
|