Checking date: 07/05/2018


Course: 2018/2019

Introduction to Data Science
(16475)
Bachelor in Data Science and Engineering (Plan: 392 - Estudio: 350)


Coordinating teacher: DELGADO GOMEZ, DAVID

Department assigned to the subject: Statistics Department

Type: Basic Core
ECTS Credits: 6.0 ECTS

Course:
Semester:

Branch of knowledge: Engineering and Architecture



At the end of the course students will be able: To understand the importance of data science in today's knowledge society. Use data visualization techniques to understand the problems faced by a data scientist and to report the results obtained. To know when to use a supervised or an unsupervised data analysis technique. To know the main data analysis techniques and applications where they have been used successfully. To know the main problems a data scientist may encounter and how to deal with them To know the different actual data analysis tools.
Description of contents: programme
1. The Importance of Data Science 2. Understanding the Data: Case Studies of Exploratory Data Analysis and Visualization Techniques I 3. Understanding the Data: Case Studies of Exploratory Data Analysis and Visualization Techniques II 4. Importance of a good design of the experiment and choice of performance measures: precision, sensitivity, specificity, ROC curves. Over-fitting. 5. Introduction to unsupervised techniques: case studies of clustering I 6. Case studies of clustering II 7. Introduction to unsupervised classification: case studies on decision trees and random forests. 8. Case studies on data reduction techniques (Principal Component Analysis, Independent Component Analysis, Fisher Discriminant Analysis). 9. Introduction to Regression: Case Studies of Linear Regression. 10. Case studies of Logistic Regression. 11. Case studies on probabilistic models. 12. Introduction to the state of the art: case studies on Support vector machines. 13. Case studies on Deep Learning.
Learning activities and methodology
The course is taught in 14 theoretical lessons and 14 practical lessons. In the theoretical classes, the main concepts of the subject will be explained. These concepts will be further elaborated in the practical classes in which various computer-based data analyses will be carried out. The students will also have a collective tutorial where they will have the opportunity to resolve any doubts they may have about the theoretical and practical classes or about the assignments they have to carry out.
Assessment System
  • % end-of-term-examination 50
  • % of continuous assessment (assigments, laboratory, practicals...) 50

Basic Bibliography
  • Peter Bruce & Andrew Bruce. Practical Statistics for data scientists:50 essential concepts. O'really. 2018

The course syllabus may change due academic events or other reasons.