[PRNewswire] 하바나랩스, 가우디 AI 학습 프로세서 발표

와우넷 오늘장전략

나스닥 13일째 랠리...SK하이닉스 실적발표 임박 - 와우넷 오늘장전략

굿모닝 주식창

반도체 쌍두마차↑코스피 신고가 돌파! - [굿모닝 주식창]

앱으로 보는 시장

한화에어로스페이스, 방산주 재도약의 신호탄 되나? - [앱으로 보는 시장]

입력 2019-06-18 09:43

[PRNewswire] 하바나랩스, 가우디 AI 학습 프로세서 발표

-- 가우디는 기록적인 성능과 확장을 위한 이더넷의 태생적 통합을 구현

(이스라엘 텔아비브 및 캘리포니아주 새너제이 2019년 6월 17일 PRNewswire=연합뉴스) 선도적인 AI 프로세서 개발사 하바나랩스[Habana Labs, Ltd. (www.habana.ai)]는 하바나 가우디(Gaudi™) AI 학습 프로세서를 오늘 발표했다. 가우디 프로세서 기반의 학습 시스템은 같은 수의 GPU가 탑재된 시스템에 비해 최대 네 배까지 늘어난 스루풋을 구현한다.

가우디의 혁신적인 아키텍처는 높은 스루풋이 더 작은 배치 사이즈에서도 유지되므로 학습 시스템 성능의 엄청난 확장이 가능하기 때문에 하나의 기기에서부터 가우디 프로세서 수백 개가 탑재된 대형 시스템에 이르기까지 가우디 기반 시스템의 성능 확장을 실현한다.

가우디는 기록적인 성능 외에도 AI 학습 분야에서 또 다른 업계 최초를 갖고 있다. 즉, AI 표준 이더넷을 사용하여 프로세서 내에서 모든 규모의 AI 시스템 확장이 가능한 RDMA오버컨버즈드이더넷 온칩 통합(RoCE v2) 기능이 그것이다. 이제 하바나랩스의 고객들은 가우디를 사용하여 AI 학습 시스템의 스케일링업과 스케일링아웃을 위해 표준 이더넷 스위치을 구동할 수 있다. 이더넷 스위치들은 멀티소스로서 속도와 포트 수에 있어서 사실상 무제한 확장이 가능하며 이미 데이터센터에서 컴퓨팅 및 스토리지 시스템 확장을 위해 사용되고 있다. GPU 기반 시스템들은 하바나의 표준 기반 시스템과는 대조적으로 시스템 설계자들의 선택과 확장성을 태생적으로 제한하는 전용 시스템 인터페이스를 사용한다.

린리그룹의 주임 애널리스트인 린리 그웨냅은 "하바나는 동사의 신제품을 통해 사업을 추론에서부터 학습으로 빠르게 확대함으로써 신경망의 전체 기능을 다루고 있다"면서 "가우디는 AI 학습 분야 액셀러레이터들 사이에서 업계 최고의 전력 효율과 강력한 성능을 구현한다. 가우디는 RoCE 지원 기능을 통해 100G 이더넷 링크들을 통합하는 최초의 AI 프로세서로서 업계 표준 컴포넌트를 사용하여 구축된 대규모의 액셀러레이터 클러스터를 구현한다"고 말했다.

가우디 프로세서에는 32GB급 HBM-2 메모리가 포함되며 현재 두 개의 형태가 가능하다:

- HL-200 - 100Gb 이더넷 8개 포트를 지원하는 PCIe 카드
- HL-205 - OCP-OAM 사양에 부합되는 메자닌 카드로서 100Gb 이더넷 10개 포트 혹은 50Gb 이더넷 20개 포트를 지원

또한 하바나는 HLS-1라고 명명된 8-가우디 시스템을 출시할 예정인데 이 시스템 안에는 8개의 HL-205 메자닌 카드, 외부의 호스트를 연결해주는 PCIe 커넥터들과 규격품 이더넷 스위치를 연결할 수 있는 24개의 100Gbps 이더넷 포트가 내장되며, 복수의 HLS-1 시스템을 추가함으로써 표준 19" 랙에서 스케일링업이 가능하다.

가우디는 하바나랩스가 론칭할 두 번째의 목적 지향 AI 프로세서로서 작년도의 하바나 고야(Goya™) AI 추론 프로세서의 후속 제품이다. 고야는 2018년 4분기부터 출하되고 있으며 업계 최고의 스루풋, 최고의 전력 효율(1와트가 처리하는 초당 이미지수)과 실시간 레이턴시를 통해 업계 최고의 추론 성능을 보여왔다.

데이비드 다한 하바나랩스 CEO는 "학습 AI 모델들은 매년 엄청나게 높아지는 컴퓨팅 성능이 필요하기 때문에 데이터센터와 클라우드가 생산성 및 확장성을 신속하게 개선해야 할 긴급한 니즈에 대처하는 것이 필수적이다. 하바나는 가우디의 혁신적인 아키텍처를 통해 업계 최고의 성능을 구현하는 동시에 표준 기반의 이더넷 연결망을 통합하고 무제한의 확장을 실현한다"면서 "가우디는 AI 학습 프로세서 지형의 현상을 타파한다"고 말했다.

페이스북의 기술 및 전략 담당 디렉터 비제이 라오는 "페이스북은 우리 업계가 한 데 모일 수 있는 혁신을 위한 개방 플랫폼을 제공하기 위해 노력하고 있다"면서 "하바나 고야 AI 추론 프로세서가 글로우 머신러닝 컴파일러를 위해 백엔드를 실행하고 오픈소싱했으며 하나바 가우디 AI 학습 프로세서가 OCP 액셀러레이터 모듈(OAM) 사양을 지원하게 되어 기쁘다"라고 말했다.

가우디 프로세서는 완벽하게 프로그램 가능하고 커스터마이즈할 수 있으며 2세대 텐서 프로세싱 코어(TPC™) 클러스터와 개발 툴, 라이브러리 그리고 컴파일러를 갖추고 있는데 이 모든 것들이 합쳐져서 종합적이며 유연한 술루션을 제공한다. 하바나랩스 시냅스AI(SynapseAI™) 소프트웨어 스택은 풍부한 핵심 라이브러리와 고객들이 그들 전용의 핵심 라이브러리를 추가할 수 있는 개방 툴체인으로 구성되어 있다.

하바나는 2019년 하반기에 가우디 플랫폼 샘플을 일부 선정된 고객들에게 제공할 예정이다. 가우디 AI 학습 및 고야 AI 추론 프로세서에 대한 상세 정보가 필요할 경우 www.habana.ai를 방문하기 바란다.

하바나랩스
하바나랩스는 학습 중추 신경망과 생산 현장에 설치되는 추론 장비에 최적화된 프로세서 플랫폼을 근본에서부터 개발하기 위해 2016년에 설립된 AI 프로세서 회사이다. 당사는 프로세싱 성능, 확장성, 비용과 전력 소모 면에서 엄청난 개선을 제공하는 플랫폼을 통해 AI의 진정한 가능성을 열고 있다. 하바나는 이스라엘 텔아비브, 캘리포니아주 새너제이, 중국 베이징과 폴란드 그단스크에 소재하며 전 세계에 150명의 직원이 있다.

상세 정보가 필요할 경우 www.habana.ai를 방문하거나 pr@habana.ai으로 연락하기 바란다.

사진 - https://mma.prnewswire.com/media/903245/HLS_1_with_heatsink_small.jpg
Habana GAUDI HLS-1 AI Training System with Heatsink

사진 - https://mma.prnewswire.com/media/903246/HL_205_small.jpg
Habana GAUDI HL-205 OCP-OAM Compliant AI Processor

사진 - https://mma.prnewswire.com/media/903247/HLS_1_Open_Overhead_View_smaller.jpg
Habana GAUDI HLS-1 AI Training System

로고 - https://mma.prnewswire.com/media/744578/Habana_Labs_Ltd___Logo.jpg
Habana Labs Ltd.

출처: 하바나랩스(Habana Labs, Ltd.)

HABANA LABS Announces Gaudi AI Training Processor

-- Gaudi Delivers Record-Breaking Performance and Native Integration of Ethernet for Scaling

TEL-AVIV, Israel and SAN JOSE, California, June 17, 2019 /PRNewswire/ -- Habana Labs, Ltd. (www.habana.ai), a leading developer of AI processors, today announced the Habana Gaudi™ AI Training Processor. Training systems based on Gaudi processors will deliver an increase in throughput of up to four times over systems built with equivalent number GPUs.

Gaudi's innovative architecture enables near-linear scaling of training systems performance, as high throughput is maintained even at smaller batch sizes, thus allowing performance scaling of Gaudi-based systems from a single device to large systems built with hundreds of Gaudi processors.

In addition to record-breaking performance, Gaudi brings another industry first to AI training: on-chip integration of RDMA over Converged Ethernet (RoCE v2) functionality within the AI processor, to enable the scaling of AI systems to any size, using standard Ethernet. With Gaudi, Habana Labs' customers can now utilize standard Ethernet switching for both scaling-up and scaling-out AI training systems. Ethernet switches are multi-sourced, offering virtually unlimited scalability in speeds and port-count, and are already used in datacenters to scale compute and storage systems. In contrast to Habana's standards-based approach, GPU-based systems rely on proprietary system interfaces, that inherently limit scalability and choice for system designers.

"With its new products, Habana has quickly extended from inference into training, covering the full range of neural-network functions," commented Linley Gwennap, principal analyst of The Linley Group. "Gaudi offers strong performance and industry-leading power efficiency among AI training accelerators. As the first AI processor to integrate 100G Ethernet links with RoCE support, it enables large clusters of accelerators built using industry-standard components."

The Gaudi processor includes 32GB of HBM-2 memory and is currently offered in two forms:

- HL-200 - a PCIe card supporting eight ports of 100Gb Ethernet;
- HL-205 - a mezzanine card compliant with the OCP-OAM specification, supporting 10 ports of 100Gb Ethernet or 20 ports of 50Gb Ethernet.

Habana is also introducing an 8-Gaudi system called HLS-1, which includes eight HL-205 Mezzanine cards, with PCIe connectors for external Host connectivity and 24 100Gbps Ethernet ports for connecting to off-the-shelf Ethernet switches, thus allowing scaling-up in a standard 19'' rack by populating multiple HLS-1 systems.

Gaudi is the second purpose-built AI processor to be launched by Habana Labs in the past year, following the Habana Goya™ AI Inference Processor. Goya has been shipping since Q4, 2018, and has demonstrated industry-leading inference performance, with the industry's highest throughput, highest power efficiency (images-per-second per Watt), and real-time latency.

"Training AI models require exponentially higher compute every year, so it's essential to address the urgent needs of the datacenter and cloud for radically improved productivity and scalability. With Gaudi's innovative architecture, Habana delivers the industry's highest performance, while integrating standards-based Ethernet connectivity, enabling unlimited scale," said David Dahan, CEO of Habana Labs. "Gaudi will disrupt the status quo of the AI Training processor landscape."

"Facebook is seeking to provide open platforms for innovation around which our industry can converge," said Vijay Rao, Director of Technology, Strategy at Facebook. "We are pleased that the Habana Goya AI inference processor has implemented and open-sourced the backend for the Glow machine learning compiler and that the Habana Gaudi AI training processor is supporting the OCP Accelerator Module (OAM) specification."

The Gaudi Processor is fully programmable and customizable, featuring a second- generation Tensor Processing Core (TPC™) cluster, along with development tools, libraries, and a compiler, that collectively deliver a comprehensive and flexible solution. Habana Labs SynapseAI™ software stack consists of a rich kernel library and open toolchain for customers to add proprietary kernels.

Habana will sample Gaudi platforms to select customers in the second half of 2019. For more information on Gaudi AI Training and Goya AI Inference Processors, please visit www.habana.ai

ABOUT HABANA LABS
Habana Labs is an AI Processor company founded in 2016 to develop from the ground-up processor platforms that are optimized for training deep neural networks and for inference deployment in production environments. We are unlocking the true potential of AI with platforms offering orders of magnitude improvements in processing performance, scalability, cost, and power consumption. Habana is located in Tel-Aviv, Israel, San Jose, California, Beijing, China and Gdansk, Poland, employing 150 people worldwide.

For more information, please visit www.habana.ai or contact pr@habana.ai.

Photo - https://mma.prnewswire.com/media/903245/HLS_1_with_heatsink_small.jpg
Habana GAUDI HLS-1 AI Training System with Heatsink

Photo - https://mma.prnewswire.com/media/903246/HL_205_small.jpg
Habana GAUDI HL-205 OCP-OAM Compliant AI Processor

Photo - https://mma.prnewswire.com/media/903247/HLS_1_Open_Overhead_View_smaller.jpg
Habana GAUDI HLS-1 AI Training System

Logo - https://mma.prnewswire.com/media/744578/Habana_Labs_Ltd___Logo.jpg
Habana Labs Ltd.

Source: Habana Labs, Ltd.

[편집자 주] 본고는 자료 제공사에서 제공한 것으로, 연합뉴스는 내용에 대해 어떠한 편집도 하지 않았음을 밝혀 드립니다.
(끝)

<저작권자(c) 연합뉴스, 무단 전재-재배포 금지>

싫어요

후속기사 원해요

1/3

한경지면 구독신청

실시간 관련뉴스