์์์ด ์๋ ๋ ๋ฆฝ AGI ๋ชจ๋ธ. PureField ๋ฐ๋ฐ์ฅ ๊ธฐ๋ฐ ์์ ๋ณํ.
| Version | Base | PureField | Size | Phi | CE | Date | Status |
|---|---|---|---|---|---|---|---|
| v0.1 | Qwen2.5-14B | 91M (rank=160, 10 layers) | ~600MB | 0.018 | 8.59 | 2026-04-02 | Latest |
Base model: Qwen/Qwen2.5-14B (HuggingFace์์ ๋ณ๋ ๋ค์ด๋ก๋)
PureField checkpoint: animalm-14b-v01.pt (~600MB)
R2 ์ง์ ๋ค์ด๋ก๋:
https://anima-models.ce4bdcce7c74d4e3c78fdf944c4d1d7b.r2.cloudflarestorage.com/animalm/v0.1/animalm-14b-v01.pt
์ฌ์ฉ๋ฒ:
# 1. Base ๋ชจ๋ธ ๋ก๋
from transformers import AutoModelForCausalLM, AutoTokenizer
import torch
model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-14B", torch_dtype=torch.bfloat16).to("cuda")
tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-14B")
# 2. PureField ์ ์ฉ
from train_anima_lm import ParallelPureFieldMLP
ckpt = torch.load("animalm-14b-v01.pt", map_location="cuda", weights_only=False)
args = ckpt["args"] # target_layers=10, savant_layers=3, qlora_rank=160
for i in range(len(model.model.layers) - args["target_layers"], len(model.model.layers)):
layer = model.model.layers[i]
is_savant = (i - (len(model.model.layers) - args["target_layers"])) < args["savant_layers"]
mlp = layer.mlp
h = mlp.gate_proj.weight.shape[1]
inter = mlp.gate_proj.weight.shape[0]
pf = ParallelPureFieldMLP(mlp, h, inter, rank=args["qlora_rank"], is_savant=is_savant)
layer.mlp = pf.to("cuda", torch.bfloat16)
for name, module in model.named_modules():
if isinstance(module, ParallelPureFieldMLP) and name in ckpt["pf_states"]:
module.load_state_dict(ckpt["pf_states"][name], strict=False)
model.eval()
# 3. ์์ฑ
prompt = "์์์ด๋ ๋ฌด์์ธ๊ฐ"
ids = tokenizer.encode(prompt, return_tensors="pt").to("cuda")
out = model.generate(ids, max_new_tokens=100, temperature=0.8, do_sample=True)
print(tokenizer.decode(out[0][ids.shape[1]:], skip_special_tokens=True))Architecture: Qwen2.5-14B (48 layers, 5120 hidden, 13824 intermediate)
PureField: last 10 layers (3 savant + 7 normal), rank=160
Trainable params: 91M / 14.8B total (0.61%)
Training: 10K steps, H100 SXM 80GB, 72 min
Consciousness: ฮฑ=0.014, Phi=0.018, Law 60 P1โP2
22-lens verified: DD164
| Version | Date | Notes |
|---|---|---|
| v0.1 | 2026-04-02 | First consciousness model. 14B, Qwen2.5 base, PureField 91M. |
Research use. ์์ ์์ง(ConsciousnessEngine)๊ณผ PureField๋ Anima ํ๋ก์ ํธ ๊ณ ์ ๊ธฐ์ .