Back to people
@soumithchintala
S

Soumith Chintala

リサーチ
@soumithchintala

Building new things @thinkymachines. Also dabble in robotics at NYU. Cofounded @PyTorch. AI is delicious when it is accessible and open-source.

304KFollowers1.3KFollowing4.0KPostsView on X

Recent posts

Interaction Models のデモがもっとあって、システム設計を協力して行ったり、論文を読んだり、ライブ生成 UI でファクトチェックしてる

原文を表示 (en)

more demos on Interaction Models collaboratively doing system design, reading papers, fact-checking with live generative UI

@SeongsikKi5837
S
Seongsik Kim@SeongsikKi5837

1. (System design) - The Interaction Models see your screen and collaborates with you live. Here we're building a scalable system architecture together — no copy-pasting, no switching tabs, just thinking out loud and drawing on the screen together.

クラスタマジシャンとGPUささやき職人たちよ、我々に参加しましょう! リアルタイムインタラクティブモデル、Tinker、大規模トレーニングの背後にあるインフラを構築するスーパーコンピューティングエンジニアを探しています:スケジューリング、ストレージ、ネットワーキング、信頼性、大規模な分散システム。 NYCとSFで採用中 https://t.co/jCx00R6UvB

原文を表示 (en)

Cluster magicians and GPU whisperers, come join us! We’re looking for supercomputing engineers to build the infrastructure behind real-time interactive models, Tinker, and large-scale training: scheduling, storage, networking, reliability, and distributed systems at scale. Hiring in NYC and SF https://t.co/jCx00R6UvB

Thinkyの秘密計画: 1: Human<->AIの帯域幅を上げる 2: Human+AIインテリジェンスの上限を上げる 3: 新しい世界で人間がメインキャラクターであり続けるのを助ける 俺らはStep 1にいる。 Interaction Modelsは人間のための素晴らしいリアルタイム協調ツール。 プレビュー:

原文を表示 (en)

Thinky's secret plan: 1: Increase Human<->AI bandwidth 2: Raise ceiling of human+AI intelligence 3: Help humans continue as main-characters in the new world We are at Step 1. Interaction Models are great real-time collaborative tools for humans. Here's a preview:

@thinkymachines
T
Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

1.5K114357117KXで開く

GithubとCriticalな仕事は相容れなくなってきている

原文を表示 (en)

Github and critical work are becoming incompatible

これはちょっと興味深い。 AnthroはアカウントサポートをClaudeか従来のアカウントマネージャー経由でスケールアップする必要があるんだろう。人間を選んだら面白いな。 または、どんどん多くの企業が複数AIを統合ハーネスで使うようになる可能性もある。 クラウド時代のエンタープライズペイン問題に似てる。他のAIプロバイダーにも同じ問題が当てはまると思う。 https://t.co/8MOSWTylgs

原文を表示 (en)

This is kinda interesting. Anthro probably needs to scale up Account Support via Claude or via humans (traditional account managers). Would be funny if they chose humans. Alternatively, more and more companies can probably go multi-AI with open harnesses. Similar problems to the cloud era enterprise pains. I'm sure these issues would apply to all the other AI providers. https://t.co/8MOSWTylgs

The Jensen + @dwarkesh_sp podcast was fantastic. Jensen is someone who understood how ecosystems work and someone who understands real-world trade, policy and controls work. And in some deeper sense how AI will actually diffuse into the world. In this podcast, Dwarkesh came off as someone who picked up talking points from an AGI party in the SF Mission District. And the contrast was so evident. As someone who understood ecosystems relatively deepy, maybe I understood Jensen's take more than others did (idk). Mythos, that Dwarkesh kept bringing up, is not a single absolute turning point in the AI development landscape. Take a state-of-the-art Chinese open-source model, and give it three orders of magnitude more test-time compute + post-training algorithmic advances that haven't been published yet. That's the baseline. It was evident that in whatever bubble Dwarkesh is in, that is seen as a naive or illogical baseline. When AI has such a complex development cycle, it's evident that America needs many levers of policy intervention across multiple layers in a dominant ecosystem that ideally the Western world controls. The entire premise that a particular model with AI development will have a critical phase change is neither correct nor does evidence point to it. OpenAI made this point with GPT-4, Anthropic made this point with Mythos, but neither stood / will stand the test of time. I think Jensen's repeated emphasis within the podcast to try to make this point mostly didn't get Dwarkesh's attention. And Dwarkesh (in this podcast) represents an entire cult of AI researchers and decision-makers that are going to influence policy. The thing with policy interventions is that if you do too much too early, you shoot yourself in the foot. There's a good reason American foreign policy and general sanctions of all kinds are measured and continuous. Despite Jensen's attempt at educating the "Anthro" audience how ecosystems work, I'm also not super hopeful a lot of people who've taken the extreme position will change their thought after listening to this podcast. I do think there's a certain religiousness that has permeated some of that community that would make it hard to understand ecosystems at a deeper level.

@dwarkesh_sp
D
Dwarkesh Patel@dwarkesh_sp

The Jensen Huang episode. 0:00:00 – Is Nvidia’s biggest moat its grip on scarce supply chains? 0:16:25 – Will TPUs break Nvidia’s hold on AI compute? 0:41:06 – Why doesn’t Nvidia become a hyperscaler? 0:57:36 – Should we be selling AI chips to China? 1:35:06 – Why doesn’t Nvidia make multiple different chip architectures? Look up Dwarkesh Podcast on YouTube, Apple Podcasts, Spotify, etc. Enjoy!

1.1K109436211KXで開く

Just like us at @thinkymachines, @luke_drago_ @LRudL_ care a lot about amplifying humans, not replacing them. They're joining us to build towards that future!

@WorkshopLabs
W
Workshop Labs@WorkshopLabs

Workshop Labs is joining @thinkymachines. We believe there's a path for AI to make humans matter more. We couldn’t be prouder to join Thinking Machines to see this work through. https://www.workshoplabs.ai/blog/wsl-joining-tml

someone's getting started early!

Photo 1
3.6K168185107KXで開く