-
Notifications
You must be signed in to change notification settings - Fork 66
Expand file tree
/
Copy pathnlp_eda_python.Rmd
More file actions
15 lines (9 loc) · 841 Bytes
/
nlp_eda_python.Rmd
File metadata and controls
15 lines (9 loc) · 841 Bytes
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
# Natural Langauge Processing Common EDA Practices in Python
Zhucheng Zhan, Yunchen Yao
```{r, include=FALSE}
knitr::opts_chunk$set(warning = FALSE, message = FALSE)
```
In Natural Language Processing, exploratory data analysis is an important and essential part before training models. We could obtain adequate information from text data visualization, like getting the most frequently used word quickly to understand what the text is about, and many more.
For the community contribution, we introduce four major techniques that are commonly used to understand text data and give an example of implementing these techniques by python. We take Twitter text data as our example to illustrate the common practices for NLP EDA.
Check out the tutorial using the following link:
https://github.com/skyyce/EDAV_CC/blob/main/EDA_in_NLP.ipynb