What effect do stemming, lemmatizing, and removing stop words have on text data? Kermit composing a hot take for the timeline. -- via Tenor If you're getting into data science and you're still fuzzy on the difference between structured and unstructured data...do an NLP project. "NLP" is natural language processing. Until I did an NLP … Continue reading NLP: Stemming, Lemmatization, and Stop Words
Category: data science
Classification metrics in plain English
A data science project can often be summarized a series of decisions. What do you do with missing data? What are you choosing as your target variable? Which kind of model are you choosing to make your predictions. When I was working on my first machine learning classification project for the Flatiron School, the decision … Continue reading Classification metrics in plain English
How much should this house cost?
Multiple linear regression modeling Below is an overview of my second project as a data science student in the Flatiron School bootcamp. The premise is that a real estate agency in the Seattle area wants to advise its clients on how they can increase the expected price that their homes will sell for. There are … Continue reading How much should this house cost?
Journalist, data scientist, or both?
When I decided to enroll in the Flatiron School and learn data science, I found myself in a bit of a career identity crisis. I've been living and breathing journalism since college. Was I leaving that behind? My main goal as a Flatiron student is to use the technical skills that I'm learning in order … Continue reading Journalist, data scientist, or both?