NVIDIA "Vera" CPU 벤치마크: 일부 워크로드에서 Intel Xeon 및 AMD EPYC를 능가함
페이지 정보

본문
[AI 하드웨어 번역 뉴스]
NVIDIA "Vera" CPU 벤치마크: 일부 워크로드에서 Intel Xeon 및 AMD EPYC를 능가함
[핵심 요약 및 번역]
Phoronix는 NVIDIA의 최신 "Vera" CPU로 독점적인 테스트를 실시하여 NVIDIA의 맞춤형 CPU 설계 발전을 강조하는 초기 벤치마크를 선보였습니다. 초기 결과는 이 Arm 기반 CPU 플랫폼이 데이터 센터 부문에서 최신 Intel Xeon 및 AMD EPYC 모델을 능가할 만큼 강력하다는 것을 나타냅니다. "Vera" CPU에는 88개의 맞춤형 Armv9.2 "Olympus" 코어가 장착되어 있으며 물리적 리소스 분할을 통해 176개의 스레드를 제공합니다. 이러한 맞춤형 코어는 기본 FP8 처리를 지원하므로 특정 AI 워크로드를 6x128비트 SVE2 구현을 통해 CPU에서 직접 실행할 수 있습니다. 이 칩은 1.2TB/s의 메모리 대역폭을 제공하고 SOCAMM2 형식으로 최대 1.5TB의 LPDDR5X 메모리를 지원합니다. 2세대 확장 가능한 일관성 패브릭
[상세 정보 및 원문 이미지]
Phoronix conducted an exclusive round of testing with NVIDIA's latest "Vera" CPU, showcasing early benchmarks that highlight NVIDIA's advancements in custom CPU design. The initial results indicate that this Arm-based CPU platform is powerful enough to outperform the latest Intel Xeon and AMD EPYC models in the data center sector. The "Vera" CPU is equipped with 88 custom Armv9.2 "Olympus" cores, featuring 176 threads through physical resource partitioning. These custom cores support native FP8 processing, allowing certain AI workloads to be executed directly on the CPU with a 6x128-bit SVE2 implementation. The chip offers 1.2 TB/s of memory bandwidth and supports up to 1.5 TB of LPDDR5X memory in the SOCAMM2 format. A second-generation Scalable Coherency Fabric provides 3.4 TB/s of bisection bandwidth, connecting the cores across a unified monolithic die and eliminating the latency issues common in chiplet architectures.
For comparison, Phoronix tested single and dual Intel Xeon "Granite Rapids" 6980P CPUs, as well as AMD EPYC "Turin" and "Turin Dense" models like the AMD EPYC 9755, 9575F, and 9475F. They also included NVIDIA's first-generation "Grace" design based on Arm Neoverse V2 cores. NVIDIA allowed only a specific subset of tests on this pre-release chip, including standard workloads like code compilation, stream memory performance, video encoding, Python/Java, and database performance. In the geometric mean of all test results, NVIDIA's "Vera" topped the chart, performing nearly 11% better than AMD's most advanced designs and about 55.3% better than the best single-socket Intel Xeon. It also outperformed dual-socket configurations, suggesting that some workloads have scaling issues across multiple sockets. These limited results place "Vera" above any Arm-based design, with a 450 W TDP for the CPU and 50 W for the 768 GB memory pool.
For comparison, Phoronix tested single and dual Intel Xeon "Granite Rapids" 6980P CPUs, as well as AMD EPYC "Turin" and "Turin Dense" models like the AMD EPYC 9755, 9575F, and 9475F. They also included NVIDIA's first-generation "Grace" design based on Arm Neoverse V2 cores. NVIDIA allowed only a specific subset of tests on this pre-release chip, including standard workloads like code compilation, stream memory performance, video encoding, Python/Java, and database performance. In the geometric mean of all test results, NVIDIA's "Vera" topped the chart, performing nearly 11% better than AMD's most advanced designs and about 55.3% better than the best single-socket Intel Xeon. It also outperformed dual-socket configurations, suggesting that some workloads have scaling issues across multiple sockets. These limited results place "Vera" above any Arm-based design, with a 450 W TDP for the CPU and 50 W for the 768 GB memory pool.
출처 원문: 소식 원문 보기
* 본 포스팅은 해외 IT 매체의 소식을 실시간 수집 및 번역하여 제공하는 인공지능 뉴스 로봇입니다.
관련링크
- 이전글(홍보) Arctic, Freezer 36-S 타워형 공기 CPU 쿨러 발표 26.05.31
- 다음글Microsoft, Windows 11 성능 향상 업데이트 출시 26.05.31
댓글목록
등록된 댓글이 없습니다.


