tienlinhle32, Author at Data Science from a Practical Perspective

Common Data Issues

By tienlinhle32 February 20, 2023Data Science, Exploratory Analysis1 Comment

So far, we have discussed distribution analysis and correlation analysis when initially exploring data. One important task in this phase is to determine whether there is anything that will cause…

Correlation with Categorical Data

By tienlinhle32 February 17, 2023Data Science, Exploratory Analysis

Previously, we have learned about tools for correlation analysis on numeric columns. By now though, you should have known that numeric is not the only type of data. In fact,…

Correlation Analysis on Numeric Data

By tienlinhle32 February 11, 2023Data Science, Exploratory Analysis1 Comment

Previously, we have learned to analyze distributions of numeric and categorical columns. However, those techniques only focus on one individual column at a time. In exploratory analysis, we have a…

Exploring Categorical Distributions

By tienlinhle32 February 2, 2023Data Science, Exploratory Analysis1 Comment

Surely after discussing distribution analysis on numeric data, we will move on to categorical data, right? Of course! In this post, we will discuss tools for performing analysis on categorical…

Exploring Numeric Distributions

By tienlinhle32 January 27, 2023Data Science, Exploratory Analysis3 Comments

With an overview understanding about distribution analysis, let us actually perform those, starting with numerical data. Obviously, we will be using a mixture of Pandas and Matplotlib - a powerful…

Distribution Analysis

By tienlinhle32 January 22, 2023Data Science, Exploratory Analysis1 Comment

Sun Tzu once said "know your data, know your models, a hundred analyses, a hundred wins", or something along that line. See, people in the mediaeval times knew the importance…

Exploratory Analysis

By tienlinhle32 January 17, 2023Data Science, Exploratory Analysis4 Comments

At this point, we have obtained a good amount of understanding and hands-on about NumPy arrays and Pandas dataframes. We can now start some analysis. And, the first one that…

Merging Dataframes

By tienlinhle32 January 9, 2023Data Science, Data Set Basics

While certainly useful in some cases, concatenating dataframes is fairly problematic because of its strict requirement on row orders. You may end up with wrong and meaningless results even with…

Concatenating Dataframes

By tienlinhle32 January 5, 2023Data Science, Data Set Basics1 Comment

Previously, we have discussed basic data concatenation with NumPy arrays. In Pandas, concatenating dataframes is also a thing, however with a few differences. The operation no longer requires equal shapes…

Text Data in Pandas

By tienlinhle32 December 27, 2022Data Science, Data Set Basics

So far, we have only been discussing operations with numbers, so you may start wondering if we would ever talk about text data, right? Sure, why don't we do that…