Jayden`s

    [๋”ฅ๋Ÿฌ๋‹, NLP] ๋ถˆ์šฉ์–ด, ์ถ”์ถœ, BoW/TF-IDF

    ๋ถˆ์šฉ์–ด(Stop words) ์ž์ฃผ ๋“ฑ์žฅํ•˜์ง€๋งŒ ์ž์—ฐ์–ด๋ฅผ ๋ถ„์„ํ•˜๋Š” ๊ฒƒ์— ์žˆ์–ด ํฐ ๋„์›€์ด ๋˜์ง€ ์•Š๋Š” ๋‹จ์–ด ๊ฐ–๊ณ  ์žˆ๋Š” ๋ง๋ญ‰์น˜ ๋ฐ์ดํ„ฐ์—์„œ ์ตœ๋Œ€ํ•œ ์œ ์˜๋ฏธํ•œ ๋‹จ์–ด(ํ† ํฐ)๋ฅผ ์„ ๋ณ„ํ•˜๊ธฐ ์œ„ํ•ด ๋ถˆ์šฉ์–ด๋Š” ์ œ๊ฑฐํ•˜๋Š” ๊ฒƒ์ด ์ข‹์Šต๋‹ˆ๋‹ค. I, he, her, ์กฐ์‚ฌ, ์ ‘๋ฏธ์‚ฌ ๊ฐ™์€ ๋‹จ์–ด๋“ค์ด ๋Œ€๋ถ€๋ถ„ ๋ถˆ์šฉ์–ด๋กœ ์ฒ˜๋ฆฌ๋ฉ๋‹ˆ๋‹ค. ์–ด๊ฐ„ ์ถ”์ถœ(Stemming) ๋ง๋ญ‰์น˜ ๋ฐ์ดํ„ฐ์—์„œ ๋‹จ์–ด๋ฅผ ์ค„์ผ ์ˆ˜ ์žˆ๋Š” ์ •๊ทœํ™” ๋ฐฉ๋ฒ• ์ค‘ ํ•˜๋‚˜ ๋‹จ์–ด์—์„œ ๊ฐœ๋…์  ์˜๋ฏธ๋ฅผ ๊ฐ–๋Š” ์–ด๊ฐ„๋งŒ ์ถ”์ถœํ•˜๋Š” ๋ฐฉ๋ฒ• ex) analysis๊ณผ analytic -> ๋‘˜ ๋‹ค ๋ถ„์„์˜ ์˜๋ฏธ๋ฅผ ๊ฐ–๊ณ  ์žˆ์œผ๋ฏ€๋กœ analy๋กœ ์ค„์ผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์˜ˆ์‹œ์™€ ๊ฐ™์ด ์–ด๊ฐ„๋งŒ ์ถ”์ถœํ•˜๋‹ค๋ณด๋‹ˆ ์‚ฌ์ „์— ์—†๋Š” ๋‹จ์–ด๊ฐ€ ์ƒ๊ธฐ๊ฒŒ ๋ฉ๋‹ˆ๋‹ค. ํ‘œ์ œ์–ด ์ถ”์ถœ(Lemmatization) ๋งˆ์ฐฌ๊ฐ€์ง€๋กœ ๋ง๋ญ‰์น˜ ๋ฐ์ดํ„ฐ์—์„œ ๋‹จ์–ด๋ฅผ ์ค„์ผ ์ˆ˜ ์žˆ๋Š” ์ •๊ทœํ™” ๋ฐฉ๋ฒ• ์ค‘..

    [1564]ํ‰๊ท 

    import sys N = int(sys.stdin.readline()) arr = list(map(int, sys.stdin.readline().split())) num_max = max(arr) a = 0 for i in arr: a += (i / num_max * 100) print(a / N) 1564 ํ‰๊ท 

    [3052]๋‚˜๋จธ์ง€

    import sys arr = [] for _ in range(10): arr.append(int(sys.stdin.readline()) % 42) arr = set(arr) print(len(arr)) 3052 ๋‚˜๋จธ์ง€

    [10818]์ตœ์†Œ, ์ตœ๋Œ€

    import sys N = int(sys.stdin.readline()) arr = list(map(int, sys.stdin.readline().split())) print(min(arr), max(arr)) 10818 ์ตœ์†Œ, ์ตœ๋Œ€

    [๊ฒฝ์ œ]220304_๋ฏธ๊ตญ ๊ธˆ๋ฆฌ์ธ์ƒ

    [๊ฒฝ์ œ]220304_๋ฏธ๊ตญ ๊ธˆ๋ฆฌ์ธ์ƒ

    - ์ผ๋‹จ์€ ์˜ˆ์ •๋œ๋Œ€๋กœ๋งŒ ๊ธˆ๋ฆฌ ์ธ์ƒ ์ง„ํ–‰ - ํ˜„์žฌ ์šฐํฌ๋ผ์ด๋‚˜ ์‚ฌํƒœ๋กœ ๋ถˆํ™•์‹คํ•œ ์ƒํ™ฉ์—์„œ ๋„ˆ๋ฌด ํฐ ๋ฌด๋ฆฌํ•˜์ง€ ์•Š๊ฒ ๋‹ค๋Š” ์ž…์žฅ - ๋‹ค๋งŒ, ๊ทธ ๋ง์€ ๊ณ„์† ๋œ ์ƒํ™ฉ์œผ๋กœ ์ธํ”Œ๋ ˆ์ด์…˜์ด ๋” ์ง„์ •๋˜์ง€ ์•Š์œผ๋ฉด ๋˜ ๋ชจ๋ฆ„ - ๋Œ€์ฐจ๋Œ€์กฐํ‘œ ์ถ•์†Œ : ์—ฐ์ค€์ด ๋ณด์œ ํ•œ ์ž์‚ฐ์„ ๊ฐ์ถ•ํ•œ๋‹ค๋Š” ์˜๋ฏธ, ํ—Œ๋ฐ ์—ฐ์ค€์˜ ์ž์‚ฐ ๋Œ€๋ถ€๋ถ„์€ ์ฑ„๊ถŒ์ด๋‹ค. ์ฆ‰, ๋‹ค์‹œ ๋งํ•ด ์ž์‚ฐ ๊ฐ์ถ•์€ ์ฑ„๊ถŒ์„ ๋งค๊ฐํ•˜๋Š” ๊ฒƒ์œผ๋กœ ํ˜„๊ธˆ์„ ๊ฑฐ๋‘ฌ๋“ค์ด๊ฒ ๋‹ค๋Š” ์˜๋ฏธ. ๊ธˆ๋ฆฌ์ธ์ƒ๊ณผ ๋น„์Šทํ•œ ํšจ๊ณผ๋ฅผ ๊ฐ€์ ธ์˜จ๋‹ค.(์‹œ์ค‘ ํ˜„๊ธˆ์„ ํก์ˆ˜ํ•˜๋Š” ๊ฒƒ)