Skip to content

Latest commit

 

History

History

text_cleaning

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 

Getting started

To clean an entire folder of .TXT files, you can run:

cd ~ 
cd allie/cleaning/text_cleaning
python3 cleaning.py /Users/jimschwoebel/allie/load_dir

Settings

Here are some default settings relevant to this section of Allie's API:

setting description default setting all options
clean_data whether or not to clean datasets during the model training process via default cleaning scripts. False True, False
default_text_cleaners the default cleaning techniques used during model training on text data if clean_data == True ["clean_textacy"] ["clean_summary", "clean_textacy"]