๐Ÿ’ฟ Data/์ด๋ชจ์ €๋ชจ

    SQL_SELECT์˜ ์‹คํ–‰ ์ˆœ์„œ

    SELECT๋ฌธ์€ ๋ฐ์ดํ„ฐ๋ฅผ ์กฐํšŒํ•˜๋Š” ์ฟผ๋ฆฌ๋ฌธ์— ์‚ฌ์šฉ ์ฟผ๋ฆฌ๋ฌธ์ด ์ ํžŒ ์ˆœ์„œ๊ฐ€ ์•„๋‹Œ ์ •ํ•ด์ง„ ์ˆœ์„œ๋Œ€๋กœ ์ž‘๋™ ์‹คํ–‰ ์ˆœ์„œ FROM WHERE GROUP BY HAVING SELECT ORDER BY ์˜ˆ์‹œ) SELECT CustomerId, AVG(Total) FROM invoices WHERE CustomerId >= 10 GROUP BY CustomerId HAVING SUM(Total) >= 30 ORDER BY 2 ์˜ˆ์‹œ ์‹คํ–‰ ์ˆœ์„œ FROM invoices: ๋จผ์ € invoices ํ…Œ์ด๋ธ”์— ์ ‘๊ทผ์„ ํ•ฉ๋‹ˆ๋‹ค. WHERE CustomerId >= 10: 'CustomerId' ํ•„๋“œ๊ฐ€ 10 ์ด์ƒ์ธ ๋ ˆ์ฝ”๋“œ๋“ค์„ ์กฐํšŒํ•ฉ๋‹ˆ๋‹ค. GROUP BY CustomerId: 'CustomerId' ๋ฅผ ๊ธฐ์ค€์œผ๋กœ ๊ทธ..

    SQL_SQLite ์ž์ฃผ ์“ฐ๋Š” ๋ฌธ๋ฒ•(2)

    ์‚ฌ์šฉ๋˜๋Š” ์˜ˆ์‹œ๋“ค์€ ๋”ฐ๋กœ ๋ช…์‹œ๋˜์ง€ ์•Š๋Š” ํ•œ chinook(SQLite training ์˜ˆ์ œ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค) ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค๋ฅผ ๊ธฐ์ค€์œผ๋กœ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค. SQL ๋‚ด์žฅํ•จ์ˆ˜ ์ง‘ํ•ฉ์—ฐ์‚ฐ ๋ ˆ์ฝ”๋“œ๋“ค์„ ์กฐํšŒํ•˜๊ณ  ๋ถ„๋ฅ˜ํ•œ ๋’ค์— ํŠน์ • ์ž‘์—…์„ ํ•˜๋Š” ๋“ฑ์˜ ์ง‘ํ•ฉ์—ฐ์‚ฐ GROUP BY ๋ฐ์ดํ„ฐ๋ฅผ ์กฐํšŒํ•  ๋•Œ, ๊ธฐ์ค€์œผ๋กœ ๋ฌถ์–ด์„œ ์กฐํšŒํ•˜๊ฒŒ ํ•ด์ฃผ๋Š” ๊ธฐ๋Šฅ SELECT State, COUNT(*) # ๊ฐ State๋งˆ๋‹ค ๊ฐฏ์ˆ˜๋ฅผ ์„ธ์–ด ์ค๋‹ˆ๋‹ค.(State๋งˆ๋‹ค ๋ฌถ์Œ์ด ํ˜•์„ฑ๋˜์–ด์žˆ์œผ๋‹ˆ) FROM customers GROUP BY State; HAVING GROUP BY๋กœ ์กฐํšŒ๋œ ๊ฒฐ๊ณผ์— ํ•„ํ„ฐ๋ฅผ ์ ์šฉ SELECT State, COUNT(*) FROM customers GROUP BY State HAVING COUNT(*) >= 3 ์—ฌ๊ธฐ์„œ TIP WHERE์™€ HAVING์˜ ..

    SQL_SQLite ์ž์ฃผ ์“ฐ๋Š” ๋ฌธ๋ฒ•

    SELECT : ๋ฐ์ดํ„ฐ์…‹์— ํฌํ•จ๋  ํŠน์„ฑ ๊ณ ๋ฅด๊ธฐ SELECT 'hello world'; SELECT 2; SELECT 15 + 3;FROM : ํ…Œ์ด๋ธ”๊ณผ ๊ด€๋ จ์ด ์žˆ๋Š” ๊ฒฝ์šฐ ํ•„์ˆ˜๋กœ ๋ช…์‹œํ•ด์•ผํ•˜๋Š” ๋ช…๋ น์–ด, ๊ฒฐ๊ณผ๋“ค์„ ๋„์ถœํ•ด๋‚ผ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ํ…Œ์ด๋ธ” ๋ช…์‹œ SELECT ํŠน์„ฑ_1, ํŠน์„ฑ_2 FROM ํ…Œ์ด๋ธ”_์ด๋ฆ„; -- ์˜ˆ์‹œ SELECT customers.FirstName, customers.LastName FROM customers; SELECT * FROM ํ…Œ์ด๋ธ”_์ด๋ฆ„; -- ์˜ˆ์‹œ SELECT * # *๋Š” ์™€์ผ๋“œ์นด๋“œ(wildcard)๋กœ ์ „๋ถ€ ์„ ํƒํ•  ๋•Œ ์‚ฌ์šฉ FROM customers;WHERE : ์„ ํƒ์ ์œผ๋กœ ํ•„ํ„ฐ ์—ญํ• ์„ ํ•˜๋Š” ์ฟผ๋ฆฌ๋ฌธ # ํŠน์ • ๊ฐ’๊ณผ ๋™์ผํ•œ ๋ฐ์ดํ„ฐ ์ฐพ๊ธฐ SELECT ํŠน์„ฑ_1, ํŠน์„ฑ_2 FROM ..

    Boosting(vs bagging)

    1. ํŠน์„ฑ ์ค‘์š”๋„๋ฅผ ๊ณ„์‚ฐํ•˜๋Š” ๋ฐฉ๋ฒ•์˜ ์žฅ๋‹จ์ ์„ ์„ค๋ช…ํ•˜๊ณ  ๊ฐ๊ฐ ์–ด๋–ค ์ƒํ™ฉ์— ์‚ฌ์šฉํ•˜๋ฉด ์ข‹์„์ง€ ์„ค๋ช…ํ•ด ๋ณด์„ธ์š”. ์—ฌ๊ธฐ๋กœ 2. bagging๊ณผ boosting์˜ ์ฐจ์ด์ ๊ณผ ๊ฐ๊ฐ ์–ด๋–ค ์•Œ๊ณ ๋ฆฌ์ฆ˜ ์ข…๋ฅ˜๋“ค์ด ์žˆ๋Š”์ง€ ์•Œ๊ณ ๋ฆฌ์ฆ˜๋ณ„ ์žฅ๋‹จ์ ์„ ์„ค๋ช…ํ•˜๊ณ , ์–ด๋–ค ์ƒํ™ฉ์—์„œ ์‚ฌ์šฉํ•˜๋ฉด ์ข‹์„์ง€ ๋…ผ์˜ํ•ด ๋ณด์„ธ์š”. bagging vs boosting์˜ ์ฐจ์ด ๋ฐฐ๊น… : ๋ณ‘๋ ฌ ํ•™์Šต, ๊ฐ๊ฐ์˜ ํŠธ๋ฆฌ๋“ค์ด ๋…๋ฆฝ์  ๋ถ€์ŠคํŒ… : ์ˆœ์ฐจ ํ•™์Šต(์ง๋ ฌ), ๋’ค์˜ ํŠธ๋ฆฌ๊ฐ€ ์ด์ „ ํŠธ๋ฆฌ์˜ ์˜ํ–ฅ์„ ๋ฐ›์Œ(์ข…์†) ์˜ค๋‹ต์— ๋Œ€ํ•ด์„œ ๋” ๋†’์€ ๊ฐ€์ค‘์น˜๋ฅผ ๋ถ€์—ฌํ•จ์œผ๋กœ ์จ ์˜ค๋‹ต์— ๋” ์ง‘์ค‘ํ•ฉ๋‹ˆ๋‹ค. ๊ทธ๋งŒํผ ๋ฐฐ๊น…์— ๋น„ํ•ด error๊ฐ€ ์ ๊ณ  ์„ฑ๋Šฅ์ด ์ข‹์Šต๋‹ˆ๋‹ค. ๋‹ค๋งŒ, ์˜ค๋‹ต์— ๋” ์ง‘์ค‘ํ•˜๋Š” ๋ฐฉ๋ฒ•์œผ๋กœ ์˜ค๋ฒ„ ํ”ผํŒ…๋  ๊ฐ€๋Šฅ์„ฑ์ด ๋†’์Šต๋‹ˆ๋‹ค. ๋‹จ์ˆœํ•˜๊ฒŒ ์ƒ๊ฐํ•  ์ˆ˜๋Š” ์—†์ง€๋งŒ, ์ผ๋ฐ˜์ ์œผ๋กœ ๊ฐœ๋ณ„ ๊ฒฐ์ • ํŠธ๋ฆฌ(๊ธฐ๋ณธ๋ชจ๋ธ)์˜ ์„ฑ๋Šฅ์ด ๋‚ฎ๋‹ค๋ฉด ..

    Model Interpreting

    ๋จธ์‹ ๋Ÿฌ๋‹ ๋ชจ๋ธ ํ•ด์„ ๋ฐฉ๋ฒ•๋“ค์˜ ์žฅ๋‹จ์ ๊ณผ ๊ฐ๊ฐ ์–ด๋–ค ๋ฐฉ์‹์œผ๋กœ ํ™œ์šฉํ•  ์ˆ˜ ์žˆ์„์ง€ ๋…ผ์˜ํ•ด ๋ณด์„ธ์š”. ๋ชจ๋ธ ํ•ด์„์˜ ํ•„์š”์„ฑ ์„ฑ๋Šฅ์ด ์ข‹์€ ๋ชจ๋ธ์€ ๋Œ€์ฒด๋กœ Black Box Model์ž…๋‹ˆ๋‹ค. (์˜ˆ์ธก์ด ์ •ํ™•ํ•˜๊ธฐ ์œ„ํ•ด์„  ์•„๋ฌด๋ž˜๋„ ๋ชจ๋ธ ์ž์ฒด๊ฐ€ ๋ณต์žกํ•ด์ง€๋‹ค๋ณด๋‹ˆ) ๋งŽ์€ ๋ถ„์•ผ์—์„œ ๋ชจ๋ธ์„ ๋ฌด์กฐ๊ฑด ์‹ ๋ขฐํ•  ์ˆ˜ ์—†์Šต๋‹ˆ๋‹ค. ๊ฐ€๋ น, ์˜ํ™”๋ฅผ ์ถ”์ฒœํ•˜๋Š” ์‹œ์Šคํ…œ์—์„œ ์˜ํ™” ์ถ”์ฒœ์„ ์ž˜๋ชปํ–ˆ๋‹ค๊ณ  ํ•ด์„œ ์•„์ฃผ ํฐ ์ผ์ด ๋‚˜๋Š” ๊ฒƒ์€ ์•„๋‹™๋‹ˆ๋‹ค. ํ•˜์ง€๋งŒ ์ž์œจ์ž๋™์ฐจ๊ฐ™์€ ๊ฒฝ์šฐ, ํ•œ๋ฒˆ์˜ ํŒ๋‹จ์ด ํฐ ์‚ฌ๊ณ ๋กœ ์ด์–ด์งˆ ์ˆ˜ ์žˆ์œผ๋ฏ€๋กœ ์šฐ๋ฆฌ๋Š” ๊ทธ ๋‚ด๋ถ€์˜ ์ž‘๋™ ์›๋ฆฌ๋ฅผ ๋ถ„์„ํ•˜๊ณ  ์—ฐ๊ตฌํ•˜์—ฌ ๋” ์•ˆ์ •์„ฑ ์žˆ๋Š” ๋ชจ๋ธ์„ ๋งŒ๋“ค ํ•„์š”๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค. ์˜์‚ฌ๊ฒฐ์ •์— ์ง์ ‘ ์˜ํ–ฅ์„ ์ฃผ๋Š” ๊ฒƒ์€ ํ•ด์„์ž…๋‹ˆ๋‹ค. ์ฆ‰, ๋ชจ๋ธ์„ ํ†ตํ•ด ์˜ˆ์ธก์— ๋Œ€ํ•œ 'score'๋Š” ๊ณ„์‚ฐํ•  ์ˆ˜ ์žˆ์ง€๋งŒ ๊ทธ ๊ณผ์ •์„ ๋ณด๊ณ  ๊ฒฐ์ •์„ ๋‚ด๋ฆฌ๋Š” ๊ฒƒ์€..