--- id: BERT-001 category: "10_Wiki/๐Ÿ’ก Topics/AI" confidence_score: 1.0 tags: [ai, nlp, bert, transformer, language-model, google-research] last_reinforced: 2026-04-26 --- # BERT (Bidirectional Encoder Representations from Transformers) ## ๐Ÿ“Œ ํ•œ ์ค„ ํ†ต์ฐฐ (The Karpathy Summary) > "๋ฌธ์žฅ์˜ ์™ผ์ชฝ๊ณผ ์˜ค๋ฅธ์ชฝ์„ ๋™์‹œ์— ๋ณด๋ฉฐ ๋‹จ์–ด์˜ ์ง„์งœ ์˜๋ฏธ๋ฅผ ํŒŒ์•…ํ•˜๋ผ" โ€” ๊ตฌ๊ธ€์ด ์ œ์•ˆํ•œ ํ˜์‹ ์ ์ธ ์‚ฌ์ „ ํ•™์Šต ๋ชจ๋ธ๋กœ, ๋ฌธ๋งฅ์˜ ์–‘๋ฐฉํ–ฅ์„ฑ์„ ๋ชจ๋‘ ๊ณ ๋ คํ•˜์—ฌ ๋‹จ์–ด์˜ ์˜๋ฏธ๋ฅผ ์ˆ˜์น˜ํ™”ํ•จ์œผ๋กœ์จ NLP ๋ถ„์•ผ์˜ ์ˆ˜๋งŽ์€ ๋ฒค์น˜๋งˆํฌ ๊ธฐ๋ก์„ ๊ฐฑ์‹ ํ•œ ๋ชจ๋ธ. ## ๐Ÿ“– ๊ตฌ์กฐํ™”๋œ ์ง€์‹ (Synthesized Content) - **์ถ”์ถœ๋œ ํŒจํ„ด:** ๋ฌธ์žฅ ๋‚ด์˜ ์ผ๋ถ€ ๋‹จ์–ด๋ฅผ ๊ฐ€๋ฆฌ๊ณ (Masked LM) ์›๋ž˜ ๋‹จ์–ด๋ฅผ ๋งžํžˆ๋Š” ๊ณผ์ •๊ณผ, ๋‘ ๋ฌธ์žฅ์ด ์ด์–ด์ง€๋Š”์ง€(NSP) ์˜ˆ์ธกํ•˜๋Š” ๊ณผ์ •์„ ํ†ตํ•ด ๊นŠ์ด ์žˆ๋Š” ์–ธ์–ด ์ดํ•ด๋ ฅ์„ ๊ฐ–์ถ”๋Š” ์‚ฌ์ „ ํ•™์Šต ํŒจํ„ด. - **ํ•ต์‹ฌ ํŠน์ง•:** - **Bidirectional Context:** ์ด์ „ ์‹œ์ ์˜ ์ •๋ณด๋งŒ ๋ณด๋Š” GPT์™€ ๋‹ฌ๋ฆฌ, ์•ž๋’ค ๋ฌธ๋งฅ์„ ํ•œ๊บผ๋ฒˆ์— ๊ณ ๋ คํ•˜์—ฌ ์ค‘์˜์„ฑ ํ•ด๊ฒฐ์— ํƒ์›”ํ•จ. - **Transformer Encoder:** ํŠธ๋žœ์Šคํฌ๋จธ ์•„ํ‚คํ…์ฒ˜์˜ ์ธ์ฝ”๋” ๋ถ€๋ถ„๋งŒ ์ธต์ธต์ด ์Œ“์•„ ์˜ฌ๋ ค ๊ตฌ์„ฑ. - **Pre-training & Fine-tuning:** ๋ฐฉ๋Œ€ํ•œ ์ผ๋ฐ˜ ํ…์ŠคํŠธ๋กœ ๋จผ์ € ํ•™์Šตํ•œ ๋’ค, ํŠน์ • ํƒœ์Šคํฌ(์งˆ์˜์‘๋‹ต, ๊ฐ์„ฑ ๋ถ„์„ ๋“ฑ)์— ๋งž์ถฐ ์‚ด์ง๋งŒ ํŠœ๋‹ํ•˜์—ฌ ๊ณ ์„ฑ๋Šฅ ํ™•๋ณด. - **Contextual Embeddings:** ๋™์ผํ•œ ๋‹จ์–ด๋ผ๋„ ์ฃผ๋ณ€ ๋ฌธ๋งฅ์— ๋”ฐ๋ผ ์„œ๋กœ ๋‹ค๋ฅธ ๋ฒกํ„ฐ ๊ฐ’์„ ๊ฐ€์ง. ## โš ๏ธ ๋ชจ์ˆœ ๋ฐ ์—…๋ฐ์ดํŠธ (Contradictions & RL Update) - **๊ณผ๊ฑฐ ๋ฐ์ดํ„ฐ์™€์˜ ์ถฉ๋Œ:** ๋‹จ๋ฐฉํ–ฅ ์–ธ์–ด ๋ชจ๋ธ์˜ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๊ณ , '์ดํ•ด' ์ค‘์‹ฌ์˜ NLP ํƒœ์Šคํฌ์—์„œ ๋…๋ณด์  ์ง€์œ„๋ฅผ ํ™•๋ณด. ์ดํ›„ RoBERTa, ALBERT ๋“ฑ ๋‹ค์–‘ํ•œ ๋ณ€ํ˜• ๋ชจ๋ธ์˜ ํƒ„์ƒ์„ ์ด๋ฃธ. - **์ •์ฑ… ๋ณ€ํ™”:** Antigravity ํ”„๋กœ์ ํŠธ๋Š” ๋ฌธ์„œ ๊ฐ„์˜ ์˜๋ฏธ์  ์œ ์‚ฌ์„ฑ ํŒ๋ณ„ ๋ฐ ๊ฐœ์ฒด๋ช… ์ธ์‹(NER) ์ž‘์—…์— BERT ๊ธฐ๋ฐ˜์˜ ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ์„ ์ฃผ๋ ฅ์œผ๋กœ ์‚ฌ์šฉํ•จ. ## ๐Ÿ”— ์ง€์‹ ์—ฐ๊ฒฐ (Graph) - [[Transformer-Architecture|Transformer-Architecture]], NLP, Attention-Mechanisms, Transfer-Learning-Foundations - **Raw Source:** 10_Wiki/Topics/AI/BERT.md