Última actualización: 24/04/2018


Curso Académico: 2018/2019

Modelización Predictiva
(16494)
Grado en Ciencia e Ingeniería de Datos (Plan: 392 - Estudio: 350)


Coordinador/a: GARCIA PORTUGUES, EDUARDO

Departamento asignado a la asignatura: Departamento de Estadística

Tipo: Obligatoria
Créditos: 6.0 ECTS

Curso:
Cuatrimestre:




Requisitos (Asignaturas o materias cuyo conocimiento se presupone)
Cálculo I y II Algebra Lineal Programación Probabilidad y Análisis de Datos Introducción a la Modelización Estadística Aprendizaje Estadístico
* General competences   - CG1: Adequate knowledge and skills to analyse and synthesise basic problems related to engineering and data science, solve them and communicate them efficiently.   - CG4: Ability to solve technological, computational, mathematical and statistical problems that may arise in engineering and data science.   - CG5: Ability to solve mathematically formulated problems applied to different subjects, using numerical algorithms and computational techniques.   - CG6: Synthesise the conclusions obtained from the analyses carried out and present them clearly and convincingly, both written and orally. * Transversal competences   - CT1: Ability to communicate knowledge orally and in writing, before a specialised and non-specialised public. * Specific competences   - CE1: Ability to solve mathematical problems that may arise in engineering and data science. Ability to apply knowledge about: algebra; geometry; differential and integral calculation; numerical methods; numerical algorithm; statistics and optimisation.   - CE2: Properly identify problems of a predictive nature corresponding to certain objectives and data and use the basic results of regression analysis as the basic basis of prediction methods.   - CE5: Understand and handle fundamental concepts of probability and statistics and be able to represent and manipulate data to extract meaningful information from them.   - CE7: Understand the basic concepts of programming and ability to carry out programs aimed at data analysis.
Descripción de contenidos: Programa
This course is designed to give a panoramic view of several tools available for predictive modeling, at an introductory-intermediate level. This view covers in-depth the main concepts in (simple and multiple) linear models, gives an overview on their extensions, and treats more superficially regression trees. The focus is placed on providing the main insights on the statistical/mathematical foundations of the models and on showing the effective implementation of the methods through the use of the statistical software R. 1. Introduction 1.1 Course overview 1.2 What is predictive modeling? 1.3 Review on statistical inference 1.4 Review on probability 1.5 Software 2. Simple linear regression 2.1 Model formulation and estimation 2.2 Assumptions of the model 2.3 Inference for model parameters 2.4 Prediction 2.5 ANOVA and model fit 3. Multiple linear regression 3.1 Model formulation and estimation 3.2 Assumptions of the models 3.3 Inference for model parameters 3.4 ANOVA and model fit 3.5 Model selection 3.6 Use of qualitative predictors 3.7 Model diagnostics and multicollinearity 4. Linear regression extensions 4.1 Dimension reduction techniques 4.2 Regularization 4.3 Handling nonlinear relationships 4.4 Regression splines 4.5 Local linear regression 4.6 Logistic regression 5. Regression trees 5.1 Decision trees 5.2 Bagging 5.3 Random forest 5.4 Boosting The program is subject to small modifications due to the course development and/or academic calendar.
Actividades formativas, metodología a utilizar y régimen de tutorías
The lessons consist on a mixture of theory (methods description) and practice (implementation and practical usage of methods). The implementation of the methods is done with the statistical language R.
Sistema de evaluación
  • Peso porcentual del Examen Final 0
  • Peso porcentual del resto de la evaluación 100

Bibliografía básica
  • James, G., Witten, D., Hastiee, T. and Tibshirani, R. . An Introduction to Statistical Learning with Applications in R. Springer-Verlag. 2013
Bibliografía complementaria
  • Kuhn, M. and Johnson, K.. Applied Predictive Modeling. Springer. 2013
  • Peña, D.. Regresión y Diseño de Experimentos. Alianza Editorial. 2002
  • Wood, S. N.. Generalized Additive Models: An Introduction with R. Chapman & Hall/CRC. 2006
Recursos electrónicosRecursos Electrónicos *
(*) El acceso a algunos recursos electrónicos puede estar restringido a los miembros de la comunidad universitaria mediante su validación en campus global. Si esta fuera de la Universidad, establezca una VPN


El programa de la asignatura podría sufrir alguna variación por causa de fuerza mayor debidamente justificada o por eventos académicos comunicados con antelación.


Dirección web para más información: https://www.uc3m.es/ss/Satellite/Grado/en/Detalle/Estudio_C/1371241688824/1371212987094/Bachelor_s_Degree_in_Data_Science_and_Engineering