3oclockam

New Model from https://novasky-ai.github.io/ Sky-T1-32B-Preview, open-source reasoning model that matches o1-preview on popular reasoning and coding benchmarks — trained under $450!

Tech lead of Qwen Team, Alibaba Group: "I often recommend people to read the blog of Anthropic to learn more about what agent really is. Then you will realize you should invest on it as much as possible this year." Blog linked in body text.

Lynncc6

2025-01-03 11:14:55

Train a 7B model that outperforms GPT-4o ?

3oclockam

2025-01-02 16:02:27

B42: do farm animals disappear after a while or could i still find them late game?

SignalCompetitive582

2025-01-01 22:48:58

LLMs are not reasoning models

Hefty_Team_5635

2024-12-28 09:50:35

I had missed this patent filed by OpenAI

PitchAdvanced4278

2024-12-28 04:44:36

Hypnagogic Visions

durable-racoon

2024-12-25 01:45:48

Claude does something extremely Human; writes a partial codeblock, then a comment explaining it has no effin clue what to do next

emetah850

2024-12-27 15:07:33

Be careful where you load your credits...

Super-Muffin-1230

2024-12-25 14:01:56

Agent swarm framework aces spatial reasoning test.

AdditionalWeb107

2024-12-24 23:16:52

How do you define rewards for RL on chain of thought reasoning? Trying to understanding a bit more into how o3 from OpenAI was trained.

JokeOfEverything

2024-12-24 20:08:35

I asked Claude to "Please print this as one paragraph, without page breaks" and forgot to paste my text, and it gave me its entire ruleset 😐 Is this common knowledge or...

Distinct-Target7503

2024-12-24 12:51:08

*asking to users who use qwen QwQ (or others open weights compute-scaling models)... *

kanjuzmango

2024-12-24 02:44:49

What the fuck happened here?

3oclockam

AaronFeng47

3oclockam

Dark_Fire_12

Slasher1738

prodshebi

Snail_Inference

Particular-Volume520

ldotchopz

MakitaNakamoto

GB_NINJA

appakaradi

[deleted]

Lynncc6

3oclockam

SignalCompetitive582

Hefty_Team_5635

PitchAdvanced4278

durable-racoon

emetah850

Super-Muffin-1230

AdditionalWeb107

JokeOfEverything

Distinct-Target7503

kanjuzmango

Share Your Mood