๋ฌธ์
From Connections_UK.csv, plot a pie graph of the connections in Q4 2018, Q4 2019 and Q4 2020, with percentages on each pie.
-You may want to replace NAN values with zeros first, remember also to get rid of United Kingdom total
-You can choose to plot them separately or in one plots using subplots
ํ์ด
๋ฌธ์ ์์ NAN์ 0์ผ๋ก ๋ฐ๊พผ ํ์ United Kingdom total์ ์ญ์ ํด์ผํ๊ณ
๋ฐ๋ก ์ธ๊ฐ๋ฅผ plotํด์ผํ๋ค๋ ์กฐ๊ฑด์ ํ์ธํ์ต๋๋ค.
1. ํจํค์ง ์ํฌํธ ๋ฐ ํ์ผ ๋ถ๋ฌ์ค๊ธฐ
from pandas import read_csv
import pandas as pd
import numpy as np
uk=read_csv('Connections_UK.csv')
uk
7๊ฐ์ ํ๊ณผ 26๊ฐ์ ์ปฌ๋ผ์ ๊ฐ์ง๊ณ ์๋ ๋ฐ์ดํฐ ํ๋ ์์ธ ๊ฒ์ ํ์ธ!
2. ์ ์ฒ๋ฆฌ
# ํ์ํ ์ด๋ง ๋ฝ์์ ์ญ์ ํ๋ค
newuk=uk[['Market, Operator', "Q4 2018", "Q4 2019", "Q4 2020"]]
print(newuk)
# dropna
newuk.dropna(inplace=True)
# 6๋ฒ์งธ ํ์ ์ญ์ ํ๊ธฐ ์ํด 4๋ฒ์งธ ํ๊น์ง๋ง ๋ฝ์์ ์ ์ฅํ๋ค
newuk = newuk[:5]
ํ์ํ ์ปฌ๋ผ๋ง ๋จ๊ธฐ๊ณ ์ญ์ ํ์๊ณ
United Kingdom์ ํ์ ์ญ์ ํด์ newuk์ ์ ์ฅํ์๋ค!
์ค๊ฐ์ ๋น ์ง ๋ก์ฐ๊ฐ ์๊ฒผ์ผ๋ฏ๋ก ์ธ๋ฑ์ค ์ฌ๋ฐฐ์ด!
# ์ธ๋ฑ์ค๋ฅผ ์ฌ์์ฑํ๋ค
newuk.reset_index(inplace = True, drop=True)
newuk
~์ซ์๋ก ๋ณํ~
# ,๋ฅผ ๊ณต๋ฐฑ์ผ๋ก ๋ฐ๊ฟ์ค
newuk['Q4 2018']=newuk['Q4 2018'].str.replace(',', '')
newuk['Q4 2019']=newuk['Q4 2019'].str.replace(',', '')
newuk['Q4 2020']=newuk['Q4 2020'].str.replace(',', '')
# numeric์ผ๋ก ๋ณํํด์ค
newuk[["Q4 2018", "Q4 2019", "Q4 2020"]]=newuk[["Q4 2018", "Q4 2019", "Q4 2020"]].apply(pd.to_numeric)
# ๋ฐ๋ ๋ฐ์ดํฐํํ ํ์ธ
newuk.info()
๋ถ์์ ์ํด์ ๋ฌธ์๋ก ์ทจ๊ธ๋นํ๋ ์ซ์๋ฅผ int64๋ก ๋ณํํ๋ ๊ณผ์ ์ด ํ์ํ๋ค!
์์ : ','๋ฌธ์ ์ ๊ฑฐ->numeric๋ณํ->ํ์ธ
3. plot
import matplotlib.pyplot as plt
# subplot์ผ๋ก pie์ฐจํธ ๊ทธ๋ฆฌ๊ธฐ, 1์นธ์ 3๊ฐ์ plot, ๋์ด๋ 18-๋์ด๋9
fig,ax = plt.subplots(1,3,figsize=(18,9))
# 'Q42018', label์ 'market, operator'๋ก ๋ถ์ฌ์ค, ์์์ 1์งธ์๋ฆฌ๊น์ง %๋ก ํ๊ธฐ
ax[0].pie(newuk['Q4 2018'], labels = newuk['Market, Operator'], autopct='%1.1f%%')
# title๋ช
์ง์
ax[0].set_title("Q4 2018")
# 'Q42019'
ax[1].pie(newuk['Q4 2019'], labels = newuk['Market, Operator'], autopct='%1.1f%%')
ax[1].set_title("Q4 2019")
# 'Q42020'
ax[2].pie(newuk['Q4 2020'], labels = newuk['Market, Operator'], autopct='%1.1f%%')
ax[2].set_title("Q4 2020")
'๐ ๋ฐ์ดํฐ ๋ถ์ > 03. Data Visualizaton' ์นดํ ๊ณ ๋ฆฌ์ ๋ค๋ฅธ ๊ธ
[๋ฐ์ดํฐ ์๊ฐํ] ์๊ฐํ ์ค์ต - 4 by 4 scatter plot (0) | 2022.03.05 |
---|---|
[๋ฐ์ดํฐ ์๊ฐํ] ์๊ฐํ ์ค์ต - Scatter plot (0) | 2022.03.02 |
[๋ฐ์ดํฐ ์๊ฐํ] ์๊ฐํ ์ค์ต - Bar graph (0) | 2022.03.02 |
[๋ฐ์ดํฐ ์๊ฐํ] 1. MATPLOTLIB(2) (0) | 2022.03.01 |
[๋ฐ์ดํฐ ์๊ฐํ] 1. MATPLOTLIB(1) (0) | 2022.02.24 |