Discussion about this post

User's avatar
Davis's avatar

Does the data truly *have* to be text tho? Visual data is the highest bandwidth input to the human brain which makes me think there's something more there that we just haven't cracked yet. Thinking that only text data is important seems like a false premise imo. It's just the easiest thing to train on from an engineering perspective

Expand full comment
Justin CS's avatar

Nice post, I generally agree with the details, though I have some hypotheses to add.

First, I think it's a mistake to believe that it must be the "model" itself that becomes ASI. I believe that ASI may be represented by a system that has models as a component, but also includes orchestration. For instance, an ASI may be composed of multiple models that collaborate, along with memory systems to learn and self modify.

Second, I think that reasoning may be the key to continued progress. Instead of generating the output (e.g. completed computer program, resulting chess move), models are now generating textual reasoning that ultimately leads to the right decision. If they get really good at this, it may be the true path to ASI. The bot doesn't need to learn from data about every output, it simply needs to know how to get good at reasoning, then it can solve novel problems from first principles.

This is a unique opportunity because the internet is actually a relatively poor source of reasoning data. People generally don't write their reasoning fully into text form, they write the completed result (e.g. I didn't write the thoughts that I had while writing this comment, I just posted the comment)

Since there's so little data in this space, and yet we've already gotten pretty good results (the top models are reasoning models), perhaps we will be able to generate much better data in the future, whether through human generation or synthetic.

Expand full comment

No posts