Original article excerpt
Server-side extracted preview paragraphs from the original source.
South Korean chip startup XCENA is betting that AI's real bottleneck is not compute, but memory.
Every time you ask ChatGPT a question, your request triggers a data relay race. Information leaves memory, passes through a CPU for preprocessing, travels to a GPU for heavy computation, and then makes its way back — and that entire journey repeats for every single word the AI generates.
