Transaction Details

Transaction Hash
0xcf1cf50d5712f0904a96a00b1ebf0b66d2143e2b3e8b96043fa081c9eca9ead9
Block
3416114
Timestamp
Mar 31, 2026, 10:48:05 AM
Nonce
1037
Operation Type
SET_VALUE

Operation

{
  "type": "SET_VALUE",
  "ref": "/apps/knowledge/topics/courses/direct-preference-optimization-your-language-model--dpo-direct-preference-optimization/.info",
  "value": {
    "title": "Direct Preference Optimization: Your Language Model is Secretly a Reward Model —",
    "description": "DPO introduces a simple classification loss that directly optimizes language model policies on human preference data, eliminating the need for reinforcement learning while maintaining theoretical equivalence to the RLHF objective.",
    "created_at": 1774954085416,
    "created_by": "0x00ADEc28B6a845a085e03591bE7550dd68673C1C"
  }
}