This is my presentation in IEEE Big Data 2018 on my research Deep K-Nearest-Neighbors (Deep KNN). This is a K-Nearest-Neighbors model enhanced by using deep neural networks.
We captured tweets originated 250-mile radius near Fort Myers, Florida within a week around the landfall of the disaster. We then performed topic modeling to investigate the consensus reaction to…
This is my presentation on graph clustering for market basket analysis. I also built a SAS macro for transformation from transactional data to graph data in this research. I presented…
This is a poster for my research on utilizing graphs to product clustering and sales forecasting in retailing using SAS. I presented this poster in SAS Analytics 2016 and won…
So please be a bit patient, because before diving in data analytics, we still have something to do. As you may have guessed, that is to get used to the…
Jupyter Notebook is an Integrated Development Environment (IDE, basically an application that supports writing codes) for multiple languages including Python. Jupyter notebook is increasingly seeing more and more usages in…
Now that we have some ideas on what is data and what we can do, we can start getting into the actions! Let us not rush ourselves however, because there…
As we discussed in the last post, predictive analysis is a family of analysis that aims to learn useful knowledge from data collected historically. The learned knowledge is then applied…
Now that we have established the concept of tabular data, let discuss what we can do with them. The goal of data science is to discover useful knowledge from mountains…
What is tabular data? The most basic form of a data set with which we can work is probably the tabular data. Tabular data is data that is organized as…