Looking at stop words: why you shouldn't blindly trust model defaults

September 26, 2020

Date

September 26, 2020

Time

7:00 PM

Invited talk at Salt Lake City R Users Group

Removing stop words is a fairly common step in natural language processing, and NLP packages often supply a default list. However, most documentation and tutorials don’t explore the nuances of selecting an appropriate list. Defaults for machine learning and modeling can be helpful but may be misleading or wrong. This talk will focus on the importance of checking assumptions and defaults in the software you use.

Posted on:
September 26, 2020
Length:
1 minute read, 75 words
Categories:
talk whyR2020
See Also: