Description: The text preprocessing program written by Python contains every step of implementation code, which is divided into delete punctuation marks, delete stop words, similarity calculation, PCA dimension reduction, clustering and visualization. The running environment is pytharm, python3 development environment.
To Search:
File list (Check if you may need any files):
Filename | Size | Date |
---|
EnglishChuLi | 0 | 2017-11-27
|
EnglishChuLi\.idea | 0 | 2017-11-27
|
EnglishChuLi\.idea\EnglishChuLi.iml | 398 | 2017-11-21
|
EnglishChuLi\.idea\misc.xml | 212 | 2017-11-21
|
EnglishChuLi\.idea\modules.xml | 276 | 2017-11-21
|
EnglishChuLi\.idea\workspace.xml | 24817 | 2017-11-27
|
EnglishChuLi\DeleteChar.py | 520 | 2017-11-25
|
EnglishChuLi\DeleteStop.py | 724 | 2017-11-25
|
EnglishChuLi\GetEnglishInformation.py | 1093 | 2017-11-21
|
EnglishChuLi\Kbean.py | 706 | 2017-11-25
|
EnglishChuLi\PCA.py | 732 | 2017-11-25
|
EnglishChuLi\similary.py | 1753 | 2017-11-25
|
EnglishChuLi\SnowballStemmer.py | 836 | 2017-11-25 |