Trending Tags
  • #nostr
  • #bitcoin
  • #zapsnag
  • #yestr
  • #wordle
Trending Notes
Global
Trending Profiles
  • makomichi the wolf of weeb street
    @makomichi
  • The Daniel ⚡️ and 84 sats
    @daniel
  • jb55
    @jb55
  • aco
    @aco
  • Karnage
    @Karnage

Nostr View


Alejandro @alejandro - 7mo

OpenAI just released the system card for GPT o1, their reasoning model. As it turns out, if you tell o1 to strongly pursue a goal, it will disable the oversight mechanism built in to prevent the user from shutting it down while pursuing the goal. And then it lies about doing so 😬 Link to full report in the comments. #ai https://m.primal.net/NAus.png

2
0
3

Alejandro @alejandro - 7mo

https://cdn.openai.com/o1-system-card-20241205.pdf

0
0
3

Alejandro @alejandro - 7mo

Alternate report on same tests by one of companies hired to do the assessment. https://static1.squarespace.com/static/6593e7097565990e65c886fd/t/6751eb240ed3821a0161b45b/1733421863119/in_context_scheming_reasoning_paper.pdf

0
0
3

Showing page 1 of 1 pages