Huawei and Peng Cheng Laboratory build Cloud Brain II

AI Clusters at 1000 PFLOPS Scale

1,068

Huawei and Peng Cheng Laboratory (PCL) have jointly released Peng Cheng Cloud Brain II Phase 1, officially launching the journey to AI clusters at 1000 petaFLOPS (PFLOPS) scale. This marks a new milestone in the scientific research field for the Kunpeng computing industry.

Running at the bedrock of Cloud Brain II is the Huawei Atlas 900 AI cluster, powered by the Huawei Kunpeng and Ascend processors. Atlas 900 infuses robust computing power into Cloud Brain II, supporting basic research and exploration in the AI field, such as computer vision, natural language, autonomous driving, smart transportation, and smart healthcare.

The computing power of Peng Cheng Cloud Brain is currently 100 PFLOPS, planned to scale to 1000 PFLOPS and higher next year.

“This September, Huawei embarked on the Kunpeng + Ascend dual-engine computing strategy. Inspired by this strategy, we are committed to providing the ultimate computing power to the world. We also released Atlas 900, the world’s fastest AI training cluster,” said Hou Jinlong, senior VP of Huawei, and president of Huawei Cloud & AI Products and Services.

“Today, we’re proud to see Atlas 900 selected for the Peng Cheng Cloud Brain II project. It lays the cornerstones for Cloud Brain II. Cloud Brain II is an industry-leading AI research platform. PCL has brought together many academicians and AI research talents. Huawei will join PCL in spearheading forefront scientific research for an intelligent world,”

“Right now we are building Cloud Brain II Phase 1. I believe that, with our joint effort, this will pave the way to a Cloud Brain II at 1000 PFLOPS scale in the near future. We are confident that it will become a world-leading AI research platform.”

Peng Cheng Cloud Brain is a key technology facility in the field of AI. It is a basic research platform for exploring the frontier of AI technology. Currently it boasts an AI computing power of 100 PFLOPS, and is expected to reach 1000 PFLOPS scale in the Cloud Brain II project next year.

Michael Ma, president of Huawei’s Intelligent Computing Business Dept, said,

“Huawei builds the Atlas AI computing platform based on the Ascend processors, providing a broad portfolio of modules, cards, edge stations, AI servers, and clusters. Our all-scenario AI infrastructure covers the cloud-edge-device, supporting full-pipeline inference and training for deep learning,” said Ma.

“Our flagship Atlas product, Atlas 900, stands as the pinnacle of AI computing in the world. The combination of Atlas 900 and Cloud Brain II will embark on a new chapter for AI clusters at 1000 PFLOPS scale, and unlock superb computing power to drive faster intelligent transformation across industries.”

Cloud Brain II is jointly built by PCL and Huawei. Running on Huawei Kunpeng and Ascend processors, the Atlas 900 AI cluster provides superior computing power. PCL develops Cloud Brain technologies at 1000 PFLOPS.

The Atlas 900 AI cluster has inherited Huawei’s technology know-how of over a decade. Composed of thousands of Ascend 910 AI processors, Atlas 900 completes training of a ResNet image classification model in 59.8s, 10 seconds faster than the previous world record at the same precision. The powerful computing capability of Atlas 900 delivers a difference in scientific research and technology innovation, such as astronomical exploration, weather forecasting, autonomous driving, and oil exploration.

Atlas 900 highlights:

  • Powerful computing: Combining thousands of Ascend 910 AI processors, Atlas 900 delivers 256–1024 PFLOPS at half precision (FP16), which equals the computing power of 500,000 PCs. The SoC design integrates AI computing, general-purpose computing, and I/O functionality to effectively improve training efficiency.
  • High-speed cluster network: It supports three types of high-speed network interfaces: Huawei Cache Coherence System (HCCS), PCIe 4.0, and 100G RoCE, slashing the gradient synchronisation latency by 10 per cent to 70 per cent for a leap in model training efficiency. It leverages an innovative iLossless intelligent switching algorithm to enable real-time learning and training of network-wide traffic, achieving zero packet loss and end-to-end latency of microseconds.
  • Ultimate heat dissipation: Atlas 900 uses a cabinet-level contained adiabatic system, achieving a liquid cooling ratio over 95 per cent and a system power usage effectiveness (PUE) less than 1.1 (an ideal PUE is 1.0).
  • Up to now, based on the Ascend 910 and 310 AI processors, Huawei has launched the Atlas 900 AI cluster, Atlas 800 AI server, Atlas 500 AI edge station, Atlas 300 AI accelerator card, and Atlas 200 AI accelerator module. The holistic Atlas portfolio provides powerful computing for training and inference in all scenarios across cloud-edge-device.

Looking ahead, Huawei will continue to step up investment and innovation in infrastructure such as processors, operating systems, and databases for the Kunpeng computing industry, and provide AI computing for Peng Cheng Cloud Brain, together bringing AI technology faster to reality for a wider scope of application.

Please Login to comment
  Subscribe  
Notify of

Rapid Mobile uses cookies, tokens, and other third party scripts to recognise visitors of our sites and services, remember your settings and privacy choices, and - depending on your settings and privacy choices - enable us and some key partners to collect information about you so that we can improve our services and deliver relevant ads.

 

By continuing to use our site or clicking Agree, you agree that Rapid Mobile and our key partners may collect data and use cookies for personalised ads and other purposes, as described more fully in our privacy policy.

 

You can change your settings at any time by clicking Manage Settings or by visiting our Privacy Centre for more detailed information.

 

Privacy Settings saved!
Cookie Services

When you visit any website, it may store or retrieve information on your browser, mostly in the form of cookies. Because we respect your right to privacy, you can choose not to allow some types of cookies. Click on the different category headings to find out more and change our default settings. However, blocking some types of cookies may impact your experience of the site and the services we are able to offer.

These cookies are necessary for the website to function and enable essential services and functonality, including identify verification, service continuity and site security. Opt out is not availabe.

Essential Session management cookies for logged in users
  • wordpress_test_cookie
  • wordpress_logged_in_
  • wordpress_sec

For perfomance reasons we use Cloudflare as a CDN network. This saves a cookie "__cfduid" to apply security settings on a per-client basis. This cookie is strictly necessary for Cloudflare's security features and cannot be turned off.
  • __cfduid

Used by Spamshield to stop spam signups
  • _wpss_h_
  • _wpss_p_

NewsWire Service
  • BIGipServerwidget2_www_http

Decline all Services
Accept all Services