๋ฐ์ดํฐ ์ง๋ฌด ํ๋ก์ธ์ค
์ ๋ณด ๋์(Data Leakage)
๋ถ๊ท ํํ ๋ฐ์ดํฐ(Imbalanced Data)
- ๋ถ๋ฅ
๊ฐ ๋ชจ๋ธ์ weight ์ ์ฉ
- ํ๊ท
right skewed -> log๋ณํ
left skewed -> exp๋ณํ
๊ทธ๋ฌ๋ ์ค์ ๋ฐ์ดํฐ๊ฐ left skewed์ธ ๊ฒฝ์ฐ๋ ๋๋ฌผ๋ค
Data Wrangling
ํน์ฑ ์ค์๋
- Gini(MDI)
- Permutation
Boosting
- Adaboost
-Gradientboost
- XGB
-Light GB
-Catboost
๋ชจ๋ธ ํด์
- Feature Importance
- PDP
isolate
interact
- SHAP
'๐ฟ Data > ๋ถํธ์บ ํ' ์นดํ ๊ณ ๋ฆฌ์ ๋ค๋ฅธ ๊ธ
[TIL]45.5_Section1_sprint1_๊ฐ์ธ๋ณต์ต(์ฃผ๋ง) (0) | 2022.01.16 |
---|---|
[TIL]45_Section2_Review(2) (0) | 2022.01.15 |
[TIL]36_Interpreting ML Model (0) | 2022.01.05 |
[TIL]35.Feature Importance (0) | 2022.01.04 |
[TIL]34.Data Wrangling (0) | 2022.01.03 |