Welcome to My Data Science Blog
-
Featured
Revisiting NYC’s OpenData on 311 Service Requests
https://opendata.cityofnewyork.us/ While studying in an immersive Data Science bootcamp program, one of my projects I worked on had to specifically use classification variables. An example of a classification variable is hot dog vs not hot dog. This is different from continuous variables such as predicting the sale price of your car based on make, model,… Read more
-
Healthcare Provider Fraud: Exploring the Outpatient Data
I explored the Inpatient data in my previous blog. Let’s take a look at the outpatient data. (517737, 27) This is a much bigger data set than the inpatient data set. Outpatient has 517,737 rows of data, whereas inpatient only had 40,474 rows. This is not surprising as there are many advantages to having outpatient… Read more
-
Healthcare Provider Fraud: Exploring the Inpatient Data
As we continue our project on Healthcare Provider Fraud Detection, we explore the remainder of our data. We have two csv files, one for Inpatient data and the other for Outpatient data. Let’s load in our data, take a look at its shape and the dataframe itself. We have 40,474 rows of data with 30… Read more
Follow My Blog
Get new content delivered directly to your inbox.