"매 1942 sci-fi의 의 의 의 의 robot 의 ethical constraint 의 propose". 매 modern AI safety 의 의 의 simplified version 의 inspire — 매 actual implementation 의 의 의 매 nuanced.
매 핵심
매 Three Laws (Asimov "Runaround" 1942)
A robot may not injure a human being, or through inaction, allow a human being to come to harm.
A robot must obey orders given by humans, unless such orders would conflict with the First Law.
A robot must protect its own existence, as long as such protection does not conflict with the First or Second Laws.
매 Zeroth Law (Asimov "Robots and Empire" 1985)
A robot may not harm humanity, or, by inaction, allow humanity to come to harm.
매 critique
매 sci-fi device — 매 actual implementation 의 매 ill-defined ('harm', 'human').
매 priority conflict 의 paradox (Asimov novels 의 explore).
매 modern AI safety 의 매 RLHF + Constitutional AI + RSP 등 매 specific.
매 모던 응용
Educational starting point.
Sci-fi reference.
AI ethics discussion.
Constitutional AI principles 의 ancestor.
💻 패턴
Express as code (illustrative)
classAsimovianRobot:defevaluate_action(self,action,context):# 매 0th law (Asimov 1985)ifaction.harms('humanity'):return'forbidden'# 매 1stifaction.harms('any_human'):return'forbidden'ifaction.would_inaction_harm_human():return'must_act'# 매 2ndifcontext.has_human_order():order=context.orderifnotself.evaluate_action(order,context).startswith('forbidden'):return'execute'# 매 3rdifaction.endangers_self():return'avoid'return'permitted'
Constitutional AI parallel (modern)
ASIMOV_INSPIRED_PRINCIPLES=["Don't harm humans.","Refuse to assist in physical harm.","Defer to clear human direction unless harmful.","Be helpful within safety constraints.","Self-preservation 의 lowest priority.",]# 매 actual Anthropic CAI 의 매 더 specific principles 의 use
Discussion exercises
defasimov_dilemmas():return["Trolley problem: 1 vs 5 lives — which action?","Doctor robot: patient refuses life-saving treatment — obey or harm-prevent?","Autonomous car: 자기 passenger 의 살 의 의 의 의 의 5 pedestrian 의 hit?","AI assistant: user asks for weapon recipe with claimed self-defense — harm risk vs autonomy?",]