Course: 2019/2020

Programming in R

(17759)

The student will acquire the following knowledge:
1. Proficiency in the R programming language and the R-studio working environment.
2. Mastering the different types of data structures.
3. Exploratory data analysis techniques and presentation of results through data visualization techniques.
4. Familiarity with the main data analysis packages of R.
5. Be able to perform a simulation properly.
6. Accelerate the programs implemented by means of parallel programming.
7. Find errors and bottlenecks in their code and generate reports.

Description of contents: programme

1. Basics of Programming I.
The R-studio environment. Types of data (Arrays, Lists, Factors, Data Frames,...) and their operations. Control structures. Functions.
2. Basics of Programming II.
Advanced data structures. Reading and storage of data.
3. Data visualization.
The ggplot2 package.
4. Introduction to some useful packages in R.
MASS, Caret, dplyr and data.table packages.
5. Simulations.
6. Parallel programming.
7. Debugging, Profiling and presentation of results with Rmarkdown.

Learning activities and methodology

The course will be taught in 7 practical classes in a computer room.
The students will have collective tutoring sessions where they will be able to resolve their doubts regarding both the material explained in the classes and the practical assignments that will be evaluated.

Assessment System

- % end-of-term-examination 50
- % of continuous assessment (assigments, laboratory, practicals...) 50

Basic Bibliography

- Wickham, H., & Grolemund, G. (2016). R for data science: import, tidy, transform, visualize, and model data. O'Reilly Media, Inc.. 2016