1. Theoretical introduction to Natural Language Processing
1.1. Brief history of computational linguistics and main developments
1.2. What is Natural Language Processing and its role in Artificial Intelligence
1.3. Structure of a basic NLP pipeline
1.4. Most common tasks and applications in the industry
1.5. Current importance in the digital society, main initiatives
2. Practical introduction to automatic language analysis with R
2.1. Source text import, dataset design and creation of data structures
2.2. Text cleaning, removal of stopwords and symbols, missing values and duplicates
2.3. Splitting and tokenization processes
2.4. Basic analysis: word count, n-gram extraction, frequency tables
2.5. Intermediate analysis: distinctiveness analysis, tf-idf, bag of words
3. Introduction to sentiment analysis
3.1. What is automatic sentiment analysis in a text: opinion, emotion and intention of the speaker
3.2. Real-world cases of sentiment analysis in the industry and limitations
3.3. Practical training on automatic sentiment analysis: use of lexicons and dictionaries, automatic sentiment mapping, segmentation, word clouds
3.4. Creation of sentiment analysis graphs and reports
4. Introduction to topic modeling
4.1. What is topic modeling, main uses in the industry
4.2. Classifying text into categories: supervised and unsupervised methods
4.3. Practical training in topic modelling: word and topic association, natural group identification and characterization, common terms and overlapping
4.4. Creation of topic modeling graphs and reports for identification of representative ideas
5. Language models
5.1. What are pre-trained language models and their impact on NLP and Machine Learning development
5.2. Uses and implications in the industry and current status, main initiatives
5.3. Practical training on the use and evaluation of basic predictive models with text data