Introduction to Text Analytics with R Part 1 | Overview
Data Science Dojo Data Science Dojo
105K subscribers
151,348 views
0

 Published On Jun 5, 2017

This data science series introduces the viewer to the exciting world of text analytics with R programming. As exemplified by the popularity of blogging and social media, textual data if far from dead – it is increasing exponentially! Not surprisingly, knowledge of text analytics is a critical skill for data scientists if this wealth of information is to be harvested and incorporated into data products. This data science training provides introductory coverage of the following tools and techniques:
– Tokenization, stemming, and n-grams
– The bag-of-words and vector space models
– Feature engineering for textual data (e.g. cosine similarity between documents)
– Feature extraction using singular value decomposition (SVD)
– Training classification models using textual data
– Evaluating the accuracy of the trained classification models

The overview of this video series provides an introduction to text analytics as a whole and what is to be expected throughout the instruction. It also includes specific coverage of:
– Overview of the spam dataset used throughout the series
– Loading the data and initial data cleaning
– Some initial data analysis, feature engineering, and data visualization

Kaggle Dataset:
https://www.kaggle.com/uciml/sms-spam...

The data and R code used in this series is available here:
https://code.datasciencedojo.com/data...

Table of Contents:
0:00 Introduction
11:06 Packages
13:21 Read CSV
17:04 Find missing data
19:05 Explore the data
23:14 Text length

--

At Data Science Dojo, we believe data science is for everyone. Our data science trainings have been attended by more than 10,000 employees from over 2,500 companies globally, including many leaders in tech like Microsoft, Google, and Facebook. For more information please visit: https://hubs.la/Q01Z-13k0

💼 Learn to build LLM-powered apps in just 40 hours with our Large Language Models bootcamp: https://hubs.la/Q01ZZGL-0

💼 Get started in the world of data with our top-rated data science bootcamp: https://hubs.la/Q01ZZDpt0

💼 Master Python for data science, analytics, machine learning, and data engineering: https://hubs.la/Q01ZZD-s0

💼 Explore, analyze, and visualize your data with Power BI desktop: https://hubs.la/Q01ZZF8B0

--

Unleash your data science potential for FREE! Dive into our tutorials, events & courses today!

📚 Learn the essentials of data science and analytics with our data science tutorials: https://hubs.la/Q01ZZJJK0

📚 Stay ahead of the curve with the latest data science content, subscribe to our newsletter now: https://hubs.la/Q01ZZBy10

📚 Connect with other data scientists and AI professionals at our community events: https://hubs.la/Q01ZZLd80

📚 Checkout our free data science courses: https://hubs.la/Q01ZZMcm0

📚 Get your daily dose of data science with our trending blogs: https://hubs.la/Q01ZZMWl0

--

📱 Social media links

Connect with us:   / data-science-dojo  

Follow us:   / datasciencedojo  

Keep up with us:   / data_science_dojo  

Like us:   / datasciencedojo  

Find us: https://www.threads.net/@data_science...

--

Also, join our communities:

LinkedIn:   / 13601597  

Twitter:   / 1677363761399865344  

Facebook:   / aiandmachinelearningforeveryone  

Vimeo: https://vimeo.com/datasciencedojo

Discord:   / discord  

_

Want to share your data science knowledge? Boost your profile and share your knowledge with our community: https://hubs.la/Q01ZZNCn0

#textanalytics #rprogramming

show more

Share/Embed