Checking date: 22/07/2023


Course: 2023/2024

Programming in R
(17759)
Master in Statistics for Data Science (Plan: 386 - Estudio: 345)
EPI


Coordinating teacher: MARIN DIAZARAQUE, JUAN MIGUEL

Department assigned to the subject: Statistics Department

Type: Compulsory
ECTS Credits: 3.0 ECTS

Course:
Semester:




Requirements (Subjects that are assumed to be known)
None.
Objectives
The student will acquire the following knowledge: 1. Proficiency in the R programming language and the R-studio working environment. 2. Mastering the different types of data structures. 3. Exploratory data analysis techniques and presentation of results through data visualization techniques. 4. Familiarity with the main data analysis packages of R. 5. Be able to perform a simulation properly. 6. Accelerate the programs implemented by means of parallel programming. 7. Find errors and bottlenecks in their code and generate reports.
Skills and learning outcomes
Description of contents: programme
1. Basics of Programming I. The R-studio environment. Types of data (Arrays, Lists, Factors, Data Frames,...) and their operations. Control structures. Functions. 2. Basics of Programming II. Advanced data structures. Reading and storage of data. 3. Data visualization. The ggplot2 package. 4. Introduction to some useful packages in R. MASS, Caret, dplyr and data.table packages. 5. Simulations. 6. Parallel programming. 7. Debugging, Profiling and presentation of results with Rmarkdown.
Learning activities and methodology
The classes consist of a mixture of presentations on the fundamental concepts of the subject and the presentation of practical cases through the use of R software. Students are expected to bring their own laptops to experiment with the code during the lectures. * Training activities   - AF1: Theoretical lesson.   - AF2: Practical lesson.   - AF5: Tutorials.   - AF6: Group work.   - AF7: Individual work.   - AF8: On-site evaluation tests. * Teaching methodologies   - MD1: Class lectures by the professor with the support of computer and audiovisual media, in which the main concepts of the subject are developed and the bibliography is provided to complement the students' learning.   - MD2: Critical reading of texts recommended by the professor of the subject: press articles, reports, manuals and/or academic articles, either for later discussion in class, or to expand and consolidate the knowledge of the subject.   - MD3: Resolution of practical cases, problems, etc. posed by the teacher individually or in groups.   - MD4: Presentation and discussion in class, under the moderation of the professor of topics related to the content of the subject, as well as case studies.   - MD5: Preparation of papers and reports individually or in groups.
Assessment System
  • % end-of-term-examination 0
  • % of continuous assessment (assigments, laboratory, practicals...) 100
Calendar of Continuous assessment
Basic Bibliography
  • Felicidad Marques Asension. R en profundidad. Programación, gráficos y estadística. RC. 2017
  • Fox, J.. Using the R Commander: A Point-and-click Interface for R. CRC Press.. 2016
  • Irizarry, R.A.. Introduction to data science: data analysis and prediction algorithms with R. Boca Raton, Florida. CRC Press. 2020
  • Wickham, H., & Grolemund, G. (2016). R for data science: import, tidy, transform, visualize, and model data. O'Reilly Media, Inc.. 2016
Recursos electrónicosElectronic Resources *
(*) Access to some electronic resources may be restricted to members of the university community and require validation through Campus Global. If you try to connect from outside of the University you will need to set up a VPN


The course syllabus may change due academic events or other reasons.