Transaction Details

Transaction Hash
0xf7edb260f1daa18b3bbe2ff71ad4d4d0db6e2879dd79e7eeb8888feacb15c8aa
Block
5867541
Timestamp
Apr 29, 2026, 12:21:17 PM
Nonce
1
Operation Type
SET_VALUE

Operation

{
  "type": "SET_VALUE",
  "ref": "/apps/knowledge/topics/courses/direct-preference-optimization-your-language-model--dpo-direct-preference-optimization/.info",
  "value": {
    "title": "Direct Preference Optimization: Your Language Model is Secretly a Reward Model —",
    "description": "DPO introduces a simple classification loss that directly optimizes language model policies on human preference data, eliminating the need for reinforcement learning while maintaining theoretical equivalence to the RLHF objective.",
    "created_at": 1777465277039,
    "created_by": "0x0D88742419C3f8DA47bB1c48A6bec36360A5A55a"
  }
}