Intel's new Data Center GPU Flex Series now supports TensorFlow acceleration

Intel adds its new Data Center GPU Flex Series to Pluggable Devices, something Intel is calling Intel Extension for TensorFlow, available right now.

Intel's new Data Center GPU Flex Series now supports TensorFlow acceleration
Published Oct 30, 2022 10:11 PM CDT   |   Updated Sun, Nov 20 2022 8:17 PM CST
2 minutes & 13 seconds read time

Intel has just announced that its new Data Center GPU Flex Series cards have been added to their family of PluggableDevices, something that is called Intel Extension for TensorFlow.

Intel Extension for TensorFlow is an open-source solution that runs TensorFlow applications on Intel AI hardware, allowing the high-performance deep learning extension for the TensorFlow PluggableDevice interface. It will allow Intel XPU (GPU, CPU, and more) devices to be readily accessible to TensorFlow developers.

Intel's new Data Center GPU Flex Series now supports TensorFlow acceleration 02

With the new Intel Extension, the company says that developers can train and infer TensorFlow models on Intel AI hardware with "zero code change". Intel Extension for TensorFlow is built on the foundations of the oneAPI software components, with most of the performance-critical graphs and operators being highly optimized by Intel oneAPI Deep Neural Network (oneDNN) which is an open-source, cross-platform performance library for Deep Learning applications.

  • Device management: The Intel and Google developers implemented TensorFlow's StreamExecutor C API utilizing C++ with SYCL and some exceptional support provided by the oneAPI SYCL runtime (DPC++ LLVM SYCL project). StreamExecutor C API defines stream, device, context, memory structure, and related functions, all have trivial mappings to corresponding implementations in the SYCL runtime.
  • Op and kernel registration: TensorFlow's kernel and op registration C API allows device-specific kernel implementations and custom operations. To ensure sufficient model coverage, the development team matched TensorFlow native GPU device's op coverage, implementing most performance-critical ops by calling highly-optimized deep learning primitives from the oneAPI Deep Neural Network Library (oneDNN). Other ops are implemented with SYCL kernels or the Eigen math library to C++ with SYCL so that it can generate programs to implement device ops.
  • Graph optimization: The Flex Series GPU plug-in optimizes TensorFlow graphs in Grappler through Graph C API and offloads performance-critical graph partitions to the oneDNN library through oneDNN Graph API. It receives a protobuf-serialized graph from TensorFlow, deserializes the graph, identifies and replaces appropriate subgraphs with a custom op, and sends the graph back to TensorFlow. When TensorFlow executes the processed graph, the custom ops are mapped to oneDNN's optimized implementation for their associated oneDNN Graph partitions.
  • The Profiler C API lets PluggableDevices communicate profiling data in TensorFlow's native profiling format. The Flex Series GPU plug-in takes a serialized XSpace object from TensorFlow, fills the object with runtime data obtained through the oneAPI Level Zero low-level device interface, and returns the object to TensorFlow. Users can display the execution profile of specific ops on The Flex Series GPU with TensorFlow's profiling tools like TensorBoard.

Inside, the Intel Data Center GPU Flex Series 170 is a full-height, single-wide passively cooled card with a 150W TDP. It features a single GPU with 32 Xe Cores, up to 16 TFLOPs of FP32 compute performance, 16GB of GDDR6 memory, and a PCIe 4.0 interface.

Intel's new Data Center GPU Flex Series now supports TensorFlow acceleration 01

Intel also has the Data Center GPU Flex Series 140, a half-height, single-wide passively cooled card with a 75W TDP.

This card has 2 x GPUs with 16 Xe Cores in total (8 x Xe Cores per GPU) which is less than the single-GPU Flex Series 170, with only 8 TFLOPs of FP32 compute performance, and 12GB of GDDR6 memory in total (6GB GDDR6 memory per GPU).

Buy at Amazon

Intel Arc A770 16GB

TodayYesterday7 days ago30 days ago
* Prices last scanned on 12/1/2022 at 3:17 am CST - prices may not be accurate, click links above for the latest price. We may earn an affiliate commission.

Anthony joined the TweakTown team in 2010 and has since reviewed 100s of graphics cards. Anthony is a long time PC enthusiast with a passion of hate for games built around consoles. FPS gaming since the pre-Quake days, where you were insulted if you used a mouse to aim, he has been addicted to gaming and hardware ever since. Working in IT retail for 10 years gave him great experience with custom-built PCs. His addiction to GPU tech is unwavering.

Newsletter Subscription

    Related Tags

    Newsletter Subscription
    Latest News
    View More News
    Latest Reviews
    View More Reviews
    Latest Articles
    View More Articles