"매 sequence 에서 3D 구조까지 — 50년 grand challenge 가 2021 년 풀렸다.". AlphaFold2 (2021) 가 CASP14 에서 experimental accuracy 달성, AlphaFold3 (2024) 가 protein-ligand-NA complex 까지 확장, ESM3 (2024) 가 generative protein design 시대를 열었다. 2026 의 표준: AF3 + ESMFold + RoseTTAFold All-Atom + ColabFold pipeline.
매 핵심
매 Method Lineage
Homology modeling (1990s): MODELLER — known template 의존.
# 매 AF3 server (research) — JSON job specimportrequestsjob={"name":"complex_001","modelSeeds":[42],"sequences":[{"protein":{"id":"A","sequence":"MKTA..."}},{"ligand":{"id":"L","ccdCodes":["ATP"]}}]}r=requests.post("https://alphafoldserver.com/api/job",json=job,headers=auth)
RFdiffusion de novo binder design
# 매 design 80aa binder against target hotspot
python run_inference.py \
inference.output_prefix=binders/run \
contigmap.contigs="['A1-150,0 80-80']"\
ppi.hotspot_res="['A30','A33','A56']"\
inference.num_designs=100
Confidence (pLDDT) filtering
importnumpyasnp# 매 pLDDT > 90 = very high; 70-90 = confident; 50-70 = low; <50 = disorderedplddt=np.array([atom.bfactorforatominstructure.get_atoms()ifatom.name=="CA"])mean_conf=plddt.mean()disordered_frac=(plddt<50).mean()
매 결정 기준
상황
Tool
Single protein, fast
ESMFold
Single protein, accurate
AlphaFold2 (ColabFold)
Multimer / complex
AlphaFold3 / AF-Multimer
Protein + ligand
AlphaFold3 / Boltz-1
De novo design
RFdiffusion + ProteinMPNN
Variant effect
AlphaMissense
기본값: 매 ColabFold AF2-multimer → AF3 for ligand/NA.
언제: protein language model embedding, binder search, paper summary, mutation scan ranking.
언제 X: 매 final pose prediction — physics/structure model 이 specialized.
❌ 안티패턴
pLDDT 무시: 매 low-confidence region 을 그대로 사용 — 매 disordered 일 수 있음.
Single seed: 매 AF3 multi-seed sampling 권장 — diversity.
MSA 없이 large complex: 매 ESMFold 는 single-chain 강점, multimer 약함.
License 위반: 매 AF3 weights non-commercial — server API 만 허용.
🧪 검증 / 중복
Verified: Jumper et al. 2021 Nature (AF2); Abramson et al. 2024 Nature (AF3); Lin et al. 2023 Science (ESM2).