True but a cluster built on pipeline parallelism can naturally stream from multi... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

zozbot234 59 days ago | parent | context | favorite | on: Microsoft and OpenAI end their exclusive and reven...

True but a cluster built on pipeline parallelism can naturally stream from multiple SSD's in parallel. That probably makes offload somewhat more effective. And you also have RAM caching available as a natural possibility.

bigyabai 59 days ago [–]

You won't be RAM caching much of anything with experts that are 220b parameters worth of layers.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact