KAYTUS Launches All-QLC Flash Storage at AI EXPO 2026 for 10,000-GPU Clusters

2026年05月10日 01:10:05  [来源:]  [作者:]  [责编:admin]
字体:【

KAYTUS’s next-generation all-QLC flash solution delivers fully linear performance scaling for massive GPU clusters, while reducing TCO by 70%, enabling ultra-large-scale computing for the era of agentic AI.

SINGAPORE--(BUSINESS WIRE)--At AI EXPO KOREA 2026, KAYTUS officially launched its All-QLC Flash Storage Solution, engineered to deliver high performance, massive scalability, and cost efficiency for 10,000-GPU clusters. The solution addresses data-delivery bottlenecks in ultra-large-scale AI training, helping maximize GPU resource utilization.

Based on the KR2280 and KR1180 server platforms, the solution is deeply integrated with industry-leading AI-native parallel file systems to eliminate data silos inherent in traditional tiered storage. Purpose-built for read-intensive AI workloads, it overcomes the horizontal scaling limitations of massive clusters. Verified test-data shows that, at exabyte-scale deployment, the solution delivers 10 TB/s aggregate bandwidth and 100 million IOPS. In addition, it reduces five-year TCO by 70% compared with traditional TLC-based solutions, accelerating model innovation for AI cloud providers and intelligent computing centers.

Limitations in Traditional AI Storage Architectures.

The explosive growth of AI is fundamentally transforming enterprise computing and storage requirements. Large-scale AI model training features highly read-intensive workloads that require tens of thousands of GPUs to concurrently access exabyte-scale datasets with sub-millisecond latency. Traditional storage architectures now face three major challenges:

  • Separated Data Silos: Traditional ETL processes require data to be moved from object storage to parallel file systems before training, resulting in time-consuming physical data migration. IDC research indicates that data teams spend 81% of their time on data preparation, slowing business iteration.
  • Workload and Media Mismatch: More than 90% of AI training involves high-frequency concurrent reads. In contrast, traditional TLC flash solutions provide excessive write endurance that is unnecessary for these read-intensive workloads, driving up procurement, space, and power costs for exabyte-scale clusters and resulting in inefficient resource utilization.
  • Scalability Bottlenecks: Traditional file systems were not designed to handle the I/O burst workloads generated by 10,000-GPU clusters. As clusters scale, metadata lock contention and communication overhead introduce latency spikes and degraded overall performance.

KAYTUS Solution: All-QLC Flash Storage for Delivering High Performance, Scalability, and Cost Efficiency.

The next-generation KAYTUS All- QLC Flash Storage Server Solution is purpose-built to unlock the full potential of read-intensive AI training workloads. By tightly integrating flagship compute nodes with industry-leading AI-native parallel file systems, the solution harnesses advanced hardware–software co-design to deliver breakthrough performance, seamless scalability, and superior cost efficiency for ultra-large-scale AI computing environments.

Architectural Innovation: Overcoming AI Training Efficiency Bottlenecks.

The KAYTUS solution establishes a unified namespace with native multi-protocol access across file, object, and block storage. By leveraging high-capacity QLC flash pools and NVMe-oF fully shared interconnects, it redefines the unified data plane for AI storage, effectively eliminating the data silos inherent in traditional tiered architectures. Data can now flow on demand to GPU nodes without cross-system migration, enabling sub-millisecond access, and significantly improving AI training data retrieval efficiency.

  • Hardware Optimization: Engineered for read-intensive workloads, the solution features a PCIe 5.0 direct-connect architecture that doubles single-node I/O bandwidth compared to the previous generation. Combined with NUMA-balanced optimization, it effectively eliminates internal throughput bottlenecks.
  • Software Synergy: The solution integrates NFS over RDMA and native GPU Direct Storage technology, enabling direct data paths from QLC flash to GPU memory. By leveraging a disaggregated architecture that decouples protocol processing from storage states, it eliminates east-west traffic and achieves fully linear scaling of bandwidth and throughput, from petabyte to exabyte scale.

10,000-GPU Cluster Benchmarks: Exceptional Performance, Scalability, and Cost Efficiency

In benchmark testing in an exabyte-scale storage environment for a 10,000-GPU data center, the solution—powered by KR2280 and KR1180 nodes and optimized with industry-leading AI-native parallel file systems—demonstrated its capability to scale seamlessly to support computing clusters of up to 10,000 GPUs.

  • Extreme Performance at Scale: The system delivers 10 TB/s sustained aggregate read bandwidth and 100 million random-read IOPS, enabling concurrent access for tens of thousands of GPUs. Performance scales linearly as additional nodes are added, while GPU utilization remains consistently above 95%, with no storage-side lock contention or queuing, effectively eliminating GPU data starvation.
  • Superior Cost Efficiency: Compared with traditional TLC all-flash solutions, the solution reduces five-year TCO by 70%, cuts power and cooling costs by more than 75%, helping enterprises avoid overpaying for unnecessary extra write endurance.

Metric (1 EB Capacity)

TLC SSD Solution

QLC SSD Solution

Difference

CAPEX

1.0

0.39

65% ↓

Power Cost

1.0

0.29

75% ↓

5-Year TCO

1.0

0.36

70% ↓

(Note: Based on 15.36T TLC vs 61.44T QLC drive units)

 
猫扑网友:虐恋sadomasochism
评论:所谓出轨就是玩腻了自己的爱人,去玩别人玩腻的爱人。

本网网友:关于病态美beauty ×
评论:我要努力实现梦想,以弥补小时候吹过的牛。

淘宝网友:控魂者*monee
评论:向上爬时,对遇到的人好点,因为掉下来时,你还会遇到他们。

天猫网友:㎜  旧梦失词
评论:看我不顺眼的人,能给您心里添堵,我真是舒坦。

其它网友:Cold-blooded 凉薄
评论:最郁闷的是:网上购票,钱从账户划去了,票没出来。

凤凰网友:听ヽ天甾哭泣
评论:真怀念小时候啊,天热的时候我也可以像男人一样光膀子。

网易网友:私念° 7/m
评论:结婚的日子我已经定好了,现在就差定新郎了。

天涯网友:流受ranmuy/
评论:好名声是女人最体面的嫁妆。

腾讯网友:ゆ.舌尖腥咸
评论:世界上最遥远的距离,莫过于周一到周五。

搜狐网友:我的就是你的
评论:当今社会一瞥:男人女人化;女人小孩化;小孩宠物化;宠物贵族化;贵族痞子化;痞子玩文化;文化商业化。

相关新闻
新闻焦点
公司2024年影响力报告展示了在可持续发展方面的重大进展和创新成果洛杉矶--(美国商业资讯)--拥有40年历史的领先消费电子品牌Belkin今日发布了其2024[更多]
First results from a Phase 3 trial conducted across 25 centers in Japan were presented at the Annual Meeting of the Japanese Urological As[更多]
文传商讯,中国领先的新闻稿来源,由国际文传(中国)电讯社携手美国商业资讯对外提供。帮助美国商业资讯将其客户的新闻稿呈现给大中华区的目标受众[更多]
关于我们 | 广告服务 | 浙江热线 | 旗龙网 | 听鱼网 | 2349 | 法律声明 | 联系我们
站务及信息报错:13757197494 (非诚勿扰) | QQ:1160322105 版权所有:上海经济新闻网 未经授权禁止复制或建立镜像
相关作品的原创性、文中陈述文字以及内容数据庞杂本站无法一一核实,如果您发现本网站上有侵犯您的合法权益的内容,请联系我们,本网站将立即予以删除!
中国互联网违法和不良信息举报中心  全国新闻记者证管理及核验网络系统  网络警察报警岗亭