Skip to content

Software training - Text Mining in R

2 & 3 July 2024


Royal Statistical Society

Summary

Price
£427.20 - £592.80 inc VAT
Study method
Online + live classes
Duration
2 days · Part-time
Qualification
No formal qualification
Certificates
  • Certificate of Attendance - Free
Additional info
  • Tutor is available to students

Overview

Want to learn how to get the most out of text data? Today, a lot of data produced contains unstructured text, which can be difficult to transform and analyse without the correct knowledge and tools. This virtual course runs over two afternoons and will teach you the basics of manipulating and transforming text data as well as how to extract meaning and sentiment in R, using packages such as {stringr} and {tidytext}.

This course is being delivered on 2 & 3 July 2024

Certificates

Certificate of Attendance

Digital certificate - Included

Description

Level: Intermediate (I)


Want to learn how to get the most out of text data? Today, a lot of data produced contains unstructured text, which can be difficult to transform and analyse without the correct knowledge and tools. This virtual course runs two afternoons and will teach you the basics of manipulating and transforming text data as well as how to extract meaning and sentiment in R, using packages such as {stringr} and {tidytext}.

Topics Covered

  • Appreciating the benefits of text data
  • Cleaning and extracting text with {stringr} and regular expressions
  • Transforming and mining text with {tidytext}
  • Analysing the sentiment of text
  • Understanding the content of a text with TF-IDF

Learning Outcomes


By the end of the course, participants will be able to…

  • clean, manipulate, and transform text data with {stringr}
  • use basic regular expressions to extract and remove patterns in text
  • convert unstructured text data into a tidy format suitable for analysis with {tidytext}
  • understand basic text mining concepts, such as tokenization, stop words, n-grams, lemmatization and more
  • create beautiful plots of text data
  • analyse the sentiment of a piece of text and compare sentiment across texts and over time
  • extract representative words of a text to classify its content

Requirements

This course assumes basic familiarity with R and the {tidyverse}. We recommend first attending our Introduction to R course if you want to get up to speed for this course!

Questions and answers

Reviews

Currently there are no reviews for this course. Be the first to leave a review.

FAQs

Study method describes the format in which the course will be delivered. At Reed Courses, courses are delivered in a number of ways, including online courses, where the course content can be accessed online remotely, and classroom courses, where courses are delivered in person at a classroom venue.

CPD stands for Continuing Professional Development. If you work in certain professions or for certain companies, your employer may require you to complete a number of CPD hours or points, per year. You can find a range of CPD courses on Reed Courses, many of which can be completed online.

A regulated qualification is delivered by a learning institution which is regulated by a government body. In England, the government body which regulates courses is Ofqual. Ofqual regulated qualifications sit on the Regulated Qualifications Framework (RQF), which can help students understand how different qualifications in different fields compare to each other. The framework also helps students to understand what qualifications they need to progress towards a higher learning goal, such as a university degree or equivalent higher education award.

An endorsed course is a skills based course which has been checked over and approved by an independent awarding body. Endorsed courses are not regulated so do not result in a qualification - however, the student can usually purchase a certificate showing the awarding body's logo if they wish. Certain awarding bodies - such as Quality Licence Scheme and TQUK - have developed endorsement schemes as a way to help students select the best skills based courses for them.

Interest free credit agreements provided by Zopa Bank Limited trading as DivideBuy are not regulated by the Financial Conduct Authority and do not fall under the jurisdiction of the Financial Ombudsman Service. Zopa Bank Limited trading as DivideBuy is authorised by the Prudential Regulation Authority and regulated by the Financial Conduct Authority and the Prudential Regulation Authority, and entered on the Financial Services Register (800542). Zopa Bank Limited (10627575) is incorporated in England & Wales and has its registered office at: 1st Floor, Cottons Centre, Tooley Street, London, SE1 2QG. VAT Number 281765280. DivideBuy's trading address is First Floor, Brunswick Court, Brunswick Street, Newcastle-under-Lyme, ST5 1HH. © Zopa Bank Limited 2024. All rights reserved.