Pandas Note (1): Data Type

(1) Inquiry: df.dtypes

(2) Setup:

data = pd.DataFrame([3,4]) 
new_data = data.astype('category')

int64 and category will be printed

1.1 Category Data Type

The category data type in pandas is a hybrid data type. It looks and behaves like a string in many instances but internally is represented by an array of integers. This allows the data to be sorted in a custom order and to more efficiently store the data.

In the following example, we should a case where category column is set up:

import pandas as pd
from pandas.api.types import CategoricalDtype

sales_1 = [{'account': 'Jones LLC', 'Status': 'Gold', 'Jan': 150, 'Feb': 200, 'Mar': 140},
{'account': 'Alpha Co', 'Status': 'Gold', 'Jan': 200, 'Feb': 210, 'Mar': 215},
{'account': 'Blue Inc', 'Status': 'Silver', 'Jan': 50, 'Feb': 90, 'Mar': 95 }]
df_1 = pd.DataFrame(sales_1)
status_type = CategoricalDtype(categories=['Silver', 'Gold'], ordered=True)
df_1['Status'] = df_1['Status'].astype(status_type)




Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

TYPY, Roadmap

Introduction to Molecular Modelling: Part 3 (Creating High-Quality Images)

Komputation v0.9.0

Sharing session with Binus Students

Keptn — Multi-stage delivery with Quality Gates(Demo)-Part 4

DevOps-What is CI, CD and CT?

Databricks Photon + AWS Graviton2 = Performance

JAMstack at the Edge: How we built Built with Workers… on Workers

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store


More from Medium

Applying Custom Functions in Pandas

10 Common SQL operations to perform using Pandas

Data Cleaning: Handling Missing Data in Pandas

Pandas Groupby