{"id":8530,"date":"2018-09-18T08:54:29","date_gmt":"2018-09-18T06:54:29","guid":{"rendered":"http:\/\/blog.netspark.de\/?p=8530"},"modified":"2018-09-14T09:30:33","modified_gmt":"2018-09-14T07:30:33","slug":"nvidia-tesla-m4-p4-and-t4-revealed","status":"publish","type":"post","link":"https:\/\/blog.netspark.de\/?p=8530","title":{"rendered":"NVidia Tesla M4, P4 and T4 revealed!"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-Post-Thumb wp-image-4661\" src=\"https:\/\/blog.netspark.de\/wp-content\/uploads\/2013\/02\/nvidia-logo-png-i2-64x64.png\" alt=\"\" width=\"64\" height=\"64\" srcset=\"https:\/\/blog.netspark.de\/wp-content\/uploads\/2013\/02\/nvidia-logo-png-i2-64x64.png 64w, https:\/\/blog.netspark.de\/wp-content\/uploads\/2013\/02\/nvidia-logo-png-i2-150x150.png 150w\" sizes=\"auto, (max-width: 64px) 100vw, 64px\" \/>It seems as if NVidia takes GPU computing serious. The<br \/>\nrecently presented Tesla T4 card, a successor to M4 and<br \/>\nP4 (some say they&#8217;re graphics cards but that&#8217;s only half<br \/>\nthe truth) seem to offer tremendous processing power!<\/p>\n<p><!--more-->Especially for cryptocoin mining this GPU-based card might be interesting as they offer immense calculation power with only 75W power demand. This means that the cards don&#8217;t rely on a separate power connector but can be powered with the PCIe-slot alone.<\/p>\n<h3>NVIDIA Tesla T4 GPU Specifications<\/h3>\n<table style=\"width: 640px; height: 330px;\">\n<thead>\n<tr style=\"background-color: #c9c9c9;\">\n<th style=\"width: 193.117px; height: 22px;\"><span style=\"color: #000000;\">Product Name<\/span><\/th>\n<th style=\"width: 139.733px; height: 22px;\"><span style=\"color: #000000;\">Tesla M4<\/span><\/th>\n<th style=\"width: 139.583px; height: 22px;\"><span style=\"color: #000000;\">Tesla P4<\/span><\/th>\n<th style=\"width: 139.567px; height: 22px;\"><span style=\"color: #000000;\">Tesla T4<\/span><\/th>\n<\/tr>\n<\/thead>\n<tbody class=\"row-hover\">\n<tr style=\"height: 22px; background-color: #161616;\">\n<td style=\"width: 193.117px; height: 22px;\"><span style=\"color: #ffffff;\"><strong>GPU Architecture<\/strong><\/span><\/td>\n<td style=\"width: 139.733px; height: 22px;\"><span style=\"color: #ffffff;\">Maxwell GM206<\/span><\/td>\n<td style=\"width: 139.583px; height: 22px;\"><span style=\"color: #ffffff;\">Pascal GP104<\/span><\/td>\n<td style=\"width: 139.567px; height: 22px;\"><span style=\"color: #ffffff;\">Turing TU104<\/span><\/td>\n<\/tr>\n<tr style=\"height: 22px; background-color: #333333;\">\n<td style=\"height: 22px; width: 193.117px;\"><span style=\"color: #ffffff;\"><strong>GPU Process<\/strong><\/span><\/td>\n<td style=\"height: 22px; width: 139.733px;\"><span style=\"color: #ffffff;\">28nm<\/span><\/td>\n<td style=\"height: 22px; width: 139.583px;\"><span style=\"color: #ffffff;\">16nm FinFET<\/span><\/td>\n<td style=\"height: 22px; width: 139.567px;\"><span style=\"color: #ffffff;\">12nm FinFET<\/span><\/td>\n<\/tr>\n<tr style=\"height: 22px; background-color: #161616;\">\n<td style=\"height: 22px; width: 193.117px;\"><span style=\"color: #ffffff;\"><strong>CUDA Cores<\/strong><\/span><\/td>\n<td style=\"height: 22px; width: 139.733px;\"><span style=\"color: #ffffff;\">1280 CUDA<\/span><\/td>\n<td style=\"height: 22px; width: 139.583px;\"><span style=\"color: #ffffff;\">2560 CUDA<\/span><\/td>\n<td style=\"height: 22px; width: 139.567px;\"><span style=\"color: #ffffff;\">2560 CUDA<\/span><\/td>\n<\/tr>\n<tr style=\"height: 22px; background-color: #333333;\">\n<td style=\"height: 22px; width: 193.117px;\"><span style=\"color: #ffffff;\"><strong>Clock Speed<\/strong><\/span><\/td>\n<td style=\"height: 22px; width: 139.733px;\"><span style=\"color: #ffffff;\">1072 MHz<\/span><\/td>\n<td style=\"height: 22px; width: 139.583px;\"><span style=\"color: #ffffff;\">1063 MHz<\/span><\/td>\n<td style=\"height: 22px; width: 139.567px;\"><span style=\"color: #ffffff;\">1582 MHz<\/span><\/td>\n<\/tr>\n<tr style=\"height: 22px; background-color: #161616;\">\n<td style=\"height: 22px; width: 193.117px;\"><span style=\"color: #ffffff;\"><strong>FP32 Compute<\/strong><\/span><\/td>\n<td style=\"height: 22px; width: 139.733px;\"><span style=\"color: #ffffff;\">2.20 TFLOPs<\/span><\/td>\n<td style=\"height: 22px; width: 139.583px;\"><span style=\"color: #ffffff;\">5.50 TFLOPs<\/span><\/td>\n<td style=\"height: 22px; width: 139.567px;\"><span style=\"color: #ffffff;\">8.1 TFLOPs<\/span><\/td>\n<\/tr>\n<tr style=\"height: 22px; background-color: #333333;\">\n<td style=\"height: 22px; width: 193.117px;\"><span style=\"color: #ffffff;\"><strong>FP16 Compute<\/strong><\/span><\/td>\n<td style=\"height: 22px; width: 139.733px;\"><span style=\"color: #ffffff;\">N\/A<\/span><\/td>\n<td style=\"height: 22px; width: 139.583px;\"><span style=\"color: #ffffff;\">11 TFLOPs<\/span><\/td>\n<td style=\"height: 22px; width: 139.567px;\"><span style=\"color: #ffffff;\">65 TFLOPs<\/span><\/td>\n<\/tr>\n<tr style=\"height: 22px; background-color: #161616;\">\n<td style=\"height: 22px; width: 193.117px;\"><span style=\"color: #ffffff;\"><strong>INT8 Compute<\/strong><\/span><\/td>\n<td style=\"height: 22px; width: 139.733px;\"><span style=\"color: #ffffff;\">N\/A<\/span><\/td>\n<td style=\"height: 22px; width: 139.583px;\"><span style=\"color: #ffffff;\">22 DLTOPs<\/span><\/td>\n<td style=\"height: 22px; width: 139.567px;\"><span style=\"color: #ffffff;\">130 DLTOPs<\/span><\/td>\n<\/tr>\n<tr style=\"height: 22px; background-color: #333333;\">\n<td style=\"height: 22px; width: 193.117px;\"><span style=\"color: #ffffff;\"><strong>INT4 Compute<\/strong><\/span><\/td>\n<td style=\"height: 22px; width: 139.733px;\"><span style=\"color: #ffffff;\">N\/A<\/span><\/td>\n<td style=\"height: 22px; width: 139.583px;\"><span style=\"color: #ffffff;\">N\/A<\/span><\/td>\n<td style=\"height: 22px; width: 139.567px;\"><span style=\"color: #ffffff;\">260 DLTOPs<\/span><\/td>\n<\/tr>\n<tr style=\"height: 22px; background-color: #161616;\">\n<td style=\"height: 22px; width: 193.117px;\"><span style=\"color: #ffffff;\"><strong>VRAM<\/strong><\/span><\/td>\n<td style=\"height: 22px; width: 139.733px;\"><span style=\"color: #ffffff;\">4 GB GDDR5<\/span><\/td>\n<td style=\"height: 22px; width: 139.583px;\"><span style=\"color: #ffffff;\">8 GB GDDR5<\/span><\/td>\n<td style=\"height: 22px; width: 139.567px;\"><span style=\"color: #ffffff;\">16 GB GDDR6<\/span><\/td>\n<\/tr>\n<tr style=\"height: 22px; background-color: #333333;\">\n<td style=\"height: 22px; width: 193.117px;\"><span style=\"color: #ffffff;\"><strong>Memory Clock<\/strong><\/span><\/td>\n<td style=\"height: 22px; width: 139.733px;\"><span style=\"color: #ffffff;\">5.5 GHz<\/span><\/td>\n<td style=\"height: 22px; width: 139.583px;\"><span style=\"color: #ffffff;\">6.0 GHz<\/span><\/td>\n<td style=\"height: 22px; width: 139.567px;\"><span style=\"color: #ffffff;\">10 GHz<\/span><\/td>\n<\/tr>\n<tr style=\"height: 22px; background-color: #161616;\">\n<td style=\"height: 22px; width: 193.117px;\"><span style=\"color: #ffffff;\"><strong>Memory Bus<\/strong><\/span><\/td>\n<td style=\"height: 22px; width: 139.733px;\"><span style=\"color: #ffffff;\">128-bit<\/span><\/td>\n<td style=\"height: 22px; width: 139.583px;\"><span style=\"color: #ffffff;\">256-bit<\/span><\/td>\n<td style=\"height: 22px; width: 139.567px;\"><span style=\"color: #ffffff;\">256-bit<\/span><\/td>\n<\/tr>\n<tr style=\"height: 22px; background-color: #333333;\">\n<td style=\"height: 22px; width: 193.117px;\"><span style=\"color: #ffffff;\"><strong>Memory Bandwidth<\/strong><\/span><\/td>\n<td style=\"height: 22px; width: 139.733px;\"><span style=\"color: #ffffff;\">88.0 GB\/s<\/span><\/td>\n<td style=\"height: 22px; width: 139.583px;\"><span style=\"color: #ffffff;\">192.0 GB\/s<\/span><\/td>\n<td style=\"height: 22px; width: 139.567px;\"><span style=\"color: #ffffff;\">320 GB\/s+<\/span><\/td>\n<\/tr>\n<tr style=\"height: 22px; background-color: #161616;\">\n<td style=\"height: 22px; width: 193.117px;\"><span style=\"color: #ffffff;\"><strong>TDP<\/strong><\/span><\/td>\n<td style=\"height: 22px; width: 139.733px;\"><span style=\"color: #ffffff;\">~75W<\/span><\/td>\n<td style=\"height: 22px; width: 139.583px;\"><span style=\"color: #ffffff;\">~75W<\/span><\/td>\n<td style=\"height: 22px; width: 139.567px;\"><span style=\"color: #ffffff;\">75W<\/span><\/td>\n<\/tr>\n<tr style=\"height: 22px; background-color: #333333;\">\n<td style=\"height: 22px; width: 193.117px;\"><span style=\"color: #ffffff;\"><strong>Launch<\/strong><\/span><\/td>\n<td style=\"height: 22px; width: 139.733px;\"><span style=\"color: #ffffff;\">2015<\/span><\/td>\n<td style=\"height: 22px; width: 139.583px;\"><span style=\"color: #ffffff;\">2016<\/span><\/td>\n<td style=\"height: 22px; width: 139.567px;\"><span style=\"color: #ffffff;\">Q4 2018<\/span><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Impressive. However there&#8217;s no word spoken about the price, the slides may promise a lot of bang for the buck and who knows&#8230; maybe it&#8217;ll bring back &#8220;classic&#8221; GPU cards into more humane price regions as the current RTX models start at 900 US$ and end up to 1800US$. And the 1800 US$ model isn&#8217;t a RTX Titan or whatever the card would be named. It&#8217;s just a RTX 2080Ti&#8230;<\/p>\n<p>The Turing based NVIDIA Tesla T4 graphics card is aimed at inference acceleration markets. It is designed to give deep learning performance a massive boost over its predecessors and is also going to deliver immense performance for AI video applications. NVIDIA\u2019s own estimate put the graphics card at twice as fast in video processing, allowing users to decode up to 38(!) full-HD video streams at the same time which just wasn\u2019t possible on the previous generation.<\/p>\n<div class=\"post_quote\">The NVIDIA Tesla T4 GPU is the world\u2019s most advanced inference accelerator. Powered by NVIDIA Turing Tensor Cores, T4 brings revolutionary multi-precision inference performance to accelerate the diverse applications of modern AI. Packaged in an energy-efficient 75-watt, small PCIe form factor, T4 is optimized for scale-out servers and is purpose-built to deliver state-of-the-art inference in real time.<\/p>\n<p style=\"text-align: left;\">As the volume of online videos continues to grow exponentially, demand for solutions to efficiently search and gain insights from video continues to grow as well. Tesla T4 delivers breakthrough performance for AI video applications, with dedicated hardware transcoding engines that bring twice the decoding performance of prior-generation GPUs. T4 can decode up to 38 full-HD video streams, making it easy to integrate scalable deep learning into video pipelines to deliver innovative, smart video services.<\/p>\n<p style=\"text-align: right;\">Jensen Huang, CEO of NVIDIA<\/p>\n<\/div>\n<p style=\"text-align: center;\"><a href=\"https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Nvidia-Tesla-T4-announcement.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-medium wp-image-8531\" src=\"https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Nvidia-Tesla-T4-announcement-640x407.jpg\" alt=\"\" width=\"640\" height=\"407\" srcset=\"https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Nvidia-Tesla-T4-announcement-640x407.jpg 640w, https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Nvidia-Tesla-T4-announcement-768x488.jpg 768w, https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Nvidia-Tesla-T4-announcement-500x318.jpg 500w, https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Nvidia-Tesla-T4-announcement.jpg 1574w\" sizes=\"auto, (max-width: 640px) 100vw, 640px\" \/><\/a>Jensen Huang, at the GTC 2018 Japan keynote presenting the Tesla T4<\/p>\n<p style=\"text-align: center;\"><a href=\"https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/NVIDIA-Tesla-T4-Graphics-Card.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-medium wp-image-8532\" src=\"https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/NVIDIA-Tesla-T4-Graphics-Card-640x320.jpg\" alt=\"\" width=\"640\" height=\"320\" srcset=\"https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/NVIDIA-Tesla-T4-Graphics-Card-640x320.jpg 640w, https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/NVIDIA-Tesla-T4-Graphics-Card-768x384.jpg 768w, https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/NVIDIA-Tesla-T4-Graphics-Card-500x250.jpg 500w, https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/NVIDIA-Tesla-T4-Graphics-Card.jpg 1024w\" sizes=\"auto, (max-width: 640px) 100vw, 640px\" \/><\/a>Product shot of the Tesla T4<\/p>\n<p style=\"text-align: center;\"><a href=\"https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Tesla_T4_Performance.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-medium wp-image-8533\" src=\"https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Tesla_T4_Performance-640x244.png\" alt=\"\" width=\"640\" height=\"244\" srcset=\"https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Tesla_T4_Performance-640x244.png 640w, https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Tesla_T4_Performance-768x293.png 768w, https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Tesla_T4_Performance-500x191.png 500w, https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Tesla_T4_Performance.png 1315w\" sizes=\"auto, (max-width: 640px) 100vw, 640px\" \/><\/a>Performance chart<\/p>\n<p style=\"text-align: center;\"><a href=\"https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Turing_Tensor_Cores_Tesla_T4.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-medium wp-image-8534\" src=\"https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Turing_Tensor_Cores_Tesla_T4-640x283.png\" alt=\"\" width=\"640\" height=\"283\" srcset=\"https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Turing_Tensor_Cores_Tesla_T4-640x283.png 640w, https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Turing_Tensor_Cores_Tesla_T4-768x340.png 768w, https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Turing_Tensor_Cores_Tesla_T4-500x221.png 500w, https:\/\/blog.netspark.de\/wp-content\/uploads\/2018\/09\/Turing_Tensor_Cores_Tesla_T4.png 990w\" sizes=\"auto, (max-width: 640px) 100vw, 640px\" \/><\/a>Turing Tensor Cores scheme<\/p>\n<p>There&#8217;s no price tag available yet nor when it&#8217;ll become in reasonable numbers. But if they get available by then, maybe we will see a sligt change in pricing for the RTX GPU series. I would greatly anticipate a price slide downwards!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>It seems as if NVidia takes GPU computing serious. The recently presented Tesla T4 card, a successor to M4 and P4 (some say they&#8217;re graphics cards but that&#8217;s only half the truth) seem to offer tremendous processing power!<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[55,3,2949],"tags":[3181,2975,3180,347,3179,3178,3182],"class_list":["post-8530","post","type-post","status-publish","format-standard","hentry","category-computer-2","category-news","category-technology","tag-ai-acceleration","tag-cryptomining","tag-deep-learning","tag-nvidia","tag-tensor-core","tag-tesla-t4","tag-turing"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/blog.netspark.de\/index.php?rest_route=\/wp\/v2\/posts\/8530","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.netspark.de\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.netspark.de\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.netspark.de\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.netspark.de\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=8530"}],"version-history":[{"count":0,"href":"https:\/\/blog.netspark.de\/index.php?rest_route=\/wp\/v2\/posts\/8530\/revisions"}],"wp:attachment":[{"href":"https:\/\/blog.netspark.de\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=8530"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.netspark.de\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=8530"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.netspark.de\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=8530"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}