Checking date: 19/05/2023


Course: 2023/2024

Data Analysis
(15986)
Bachelor in Computer Science and Engineering (2018 Study Plan) (Plan: 431 - Estudio: 218)


Coordinating teacher: GARCIA HERRERO, JESUS

Department assigned to the subject: Computer Science and Engineering Department

Type: Electives
ECTS Credits: 6.0 ECTS

Course:
Semester:




Requirements (Subjects that are assumed to be known)
Programming (1º, 1C) Artificial Intelligence (2º, 2C)
Objectives
- Cognitive 1. Evaluation based on multiple Theoretical machine learning tasks 2. Knowledge about several model building techniques working on data 3. Knowledge about practical techniques to deal with uncertainty and errors in data to take advantage of them
Skills and learning outcomes
Description of contents: programme
1. Introduction to Data Analysis and Data Mining 2. Machine learning with numeric techniques 2.1. Statistical analysis and causal relations 2.2. Bayesian classifiers. Numeric and symbolic attributes 3. Numerical learning 3.1. Regression 3.2. Clustering with numeric techniques: K-means, Expectation Maximization 4. Evaluation of Machine Learning Models 4.1. Confusion matrices 4.2. Comparison of alternatives, significance contrasts 5. Attribute analysis 5.1. Non-supervised selection 5.2. Attribute transformation 5.3. Supervised selection 6. Methodology of data mining projects 6. Introduction to other advanced techniques (combination, SVM , Fuzzy systems, GAs)
Learning activities and methodology
Theoretical lectures: 2 ECTS. To achieve the specific cognitive competences of the course. Practical lectures: 2,5 ECTS. To develop the specific instrumental competences and most of the general competences, such as analysis, abstraction, problem solving and capacity to apply theoretical concepts. Besides, to develop the specific attitudinal competences. Guided academic activities (present teacher): 1,5 ECTS. The student proposes a project according to the teachers guidance to go deeply into some aspect of the course, followed by public presentation.
Assessment System
  • % end-of-term-examination 30
  • % of continuous assessment (assigments, laboratory, practicals...) 70
Calendar of Continuous assessment
Basic Bibliography
  • I. Witten y E. Frank. Data Mining: Practical Machine Learning Tools and Techniques (Third Edition) . Morgan Kaufmann. 2011
  • Jesús García, Antonio Berlanga, José M. Molina, Miguel A. Patricio. Ciencia de datos: Técnicas analíticas y aprendizaje estadístico en un enfoque práctico. Altaria. 2018
Additional Bibliography
  • David Hand, Heikki Mannila. Principles of data mining. MIT Press. 2002
  • Pérez López, César. Estadística aplicada a través de Excel. Prentice Hall. 2002

The course syllabus may change due academic events or other reasons.