{"id":125290,"date":"2025-10-10T11:42:03","date_gmt":"2025-10-10T03:42:03","guid":{"rendered":"https:\/\/www.ioiotimes.com\/?p=125290"},"modified":"2025-10-10T11:42:04","modified_gmt":"2025-10-10T03:42:04","slug":"nvidia-blackwell%e5%9c%a8%e5%85%a8%e6%96%b0inferencemax%e5%9f%ba%e6%ba%96%e6%b8%ac%e8%a9%a6%e4%b8%ad%e6%a8%b9%e7%ab%8b%e6%96%b0%e6%a8%99%e7%ab%bf%ef%bc%8c%e5%b1%95%e7%8f%be%e7%84%a1%e5%8f%af%e5%8c%b9","status":"publish","type":"post","link":"https:\/\/www.ioiotimes.com\/?p=125290","title":{"rendered":"NVIDIA Blackwell\u5728\u5168\u65b0InferenceMAX\u57fa\u6e96\u6e2c\u8a66\u4e2d\u6a39\u7acb\u65b0\u6a19\u7aff\uff0c\u5c55\u73fe\u7121\u53ef\u5339\u6575\u7684\u6548\u80fd\u8207\u6548\u7387"},"content":{"rendered":"\n<p>\u25cf\u00a0\u00a0\u00a0NVIDIA Blackwell\u00a0\u6a6b\u6383\u5168\u65b0\u00a0SemiAnalysis InferenceMAX v1\u00a0\u57fa\u6e96\u6e2c\u8a66\uff0c\u5c55\u73fe\u6700\u9ad8\u6548\u80fd\u8207\u6700\u4f73\u6574\u9ad4\u6548\u7387\u3002\u25cf\u00a0\u00a0\u00a0InferenceMAX v1\u00a0\u662f\u9996\u500b\u5728\u591a\u6a23\u5316\u6a21\u578b\u8207\u771f\u5be6\u5834\u666f\u4e2d\u8861\u91cf\u7e3d\u904b\u7b97\u6210\u672c\u7684\u7368\u7acb\u57fa\u6e96\u6e2c\u8a66\u3002<br>\u25cf\u00a0\u00a0\u00a0\u6700\u4f73\u6295\u8cc7\u5831\u916c\uff1aNVIDIA GB200 NVL72\u00a0\u5e36\u4f86\u7121\u53ef\u6bd4\u64ec\u7684\u00a0AI\u00a0\u5de5\u5ee0\u7d93\u6fdf\u6548\u76ca\u3002\u6295\u8cc7\u00a0500\u00a0\u842c\u7f8e\u5143\u53ef\u5275\u9020\u00a07,500\u00a0\u842c\u7f8e\u5143\u7684\u00a0DSR1\u00a0\u8a5e\u5143\u6536\u76ca\uff0c\u63d0\u4f9b\u00a015\u00a0\u500d\u6295\u8cc7\u5831\u916c\u7387\u3002<br>\u25cf\u00a0\u00a0\u00a0\u6700\u4f4e\u7e3d\u64c1\u6709\u6210\u672c\uff1aNVIDIA B200\u00a0\u7684\u8edf\u9ad4\u6700\u4f73\u5316\u5728\u00a0gpt-oss\u00a0\u4e0a\u5be6\u73fe\u6bcf\u767e\u842c\u8a5e\u5143\u5169\u7f8e\u5206\uff0c\u65bc\u5169\u500b\u6708\u5167\u964d\u4f4e\u8a5e\u5143\u6210\u672c5\u00a0\u500d\u3002<br>\u25cf\u00a0\u00a0\u00a0\u6700\u4f73\u8f38\u9001\u91cf\u8207\u4e92\u52d5\u6027\uff1a\u642d\u8f09\u6700\u65b0\u00a0NVIDIA TensorRT-LLM\u00a0\u6280\u8853\u67b6\u69cb\u7684\u00a0NVIDIA B200\uff0c\u5728\u00a0gpt-oss\u00a0\u4e0a\u53ef\u9054\u5230\u6bcf\u00a0GPU\u00a0\u6bcf\u79d2\u00a060,000\u00a0\u500b\u8a5e\u5143\u3001\u6bcf\u4f7f\u7528\u8005\u6bcf\u79d2\u00a01,000\u00a0\u500b\u00a0\u8a5e\u5143\u00a0\u7684\u6548\u80fd\u6c34\u6e96\u3002<\/p>\n\n\n\n<p><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"680\" src=\"https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia01.jpg\" alt=\"\" class=\"wp-image-125292\" title=\"\" srcset=\"https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia01.jpg 1280w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia01-300x159.jpg 300w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia01-1024x544.jpg 1024w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia01-768x408.jpg 768w\" sizes=\"(max-width: 1280px) 100vw, 1280px\" \/><\/figure><\/div>\n\n\n<p><\/p>\n\n\n\n<p>\u96a8\u8457\u4eba\u5de5\u667a\u6167\uff08AI\uff09\u5f9e\u4e00\u6b21\u6027\u56de\u8986\u8f49\u8b8a\u6210\u8907\u96dc\u63a8\u7406\uff08reasoning\uff09\uff0c<a href=\"https:\/\/www.nvidia.com\/en-us\/glossary\/ai-inference\/\" target=\"_blank\" rel=\"noreferrer noopener\">\u63a8\u8ad6\uff08inference\uff09<\/a>\u7684\u9700\u6c42\u8207\u5176\u80cc\u5f8c\u7684\u7d93\u6fdf\u6548\u76ca\u6b63\u6025\u901f\u6210\u9577\u3002<\/p>\n\n\n\n<p>\u5168\u65b0\u7684\u7368\u7acb\u00a0InferenceMAX v1 \u662f\u9996\u500b\u5728\u771f\u5be6\u5834\u666f\u4e2d\u8861\u91cf\u7e3d\u904b\u7b97\u6210\u672c\u7684\u57fa\u6e96\u6e2c\u8a66\u3002\u7d50\u679c\u986f\u793a\uff0c<a href=\"https:\/\/developer.nvidia.com\/blog\/nvidia-blackwell-leads-on-new-semianalysis-inferencemax-benchmarks\/\" target=\"_blank\" rel=\"noreferrer noopener\">NVIDIA Blackwell\u00a0\u5e73\u53f0\u6a6b\u6383\u5168\u5834<\/a>\uff0c\u70ba\u00a0<a href=\"https:\/\/www.nvidia.com\/zh-tw\/solutions\/ai-factories\/\" target=\"_blank\" rel=\"noreferrer noopener\">AI\u00a0\u5de5\u5ee0<\/a>\u5be6\u73fe\u7121\u8207\u502b\u6bd4\u7684\u6548\u80fd\u8207\u6700\u4f73\u6574\u9ad4\u6548\u7387\u3002<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1489\" height=\"850\" src=\"https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia02.png\" alt=\"\" class=\"wp-image-125314\" title=\"\" srcset=\"https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia02.png 1489w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia02-300x171.png 300w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia02-1024x585.png 1024w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia02-768x438.png 768w\" sizes=\"(max-width: 1489px) 100vw, 1489px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p><strong>\u6295\u8cc7\u00a0500\u00a0\u842c\u7f8e\u5143\u65bc\u00a0NVIDIA GB200 NVL72\u00a0\u7cfb\u7d71\uff0c\u53ef\u5275\u9020\u00a07,500\u00a0\u842c\u7f8e\u5143\u7684\u8a5e\u5143\uff08token\uff09\u6536\u76ca\uff0c\u5e36\u4f86 15\u00a0\u500d\u6295\u8cc7\u5831\u916c\u7387<\/strong>\u00a0&#8211;\u00a0\u9019\u662f\u63a8\u8ad6\u904b\u7b97\u7684\u65b0\u7d93\u6fdf\u6a21\u5f0f\u3002<\/p>\n\n\n\n<p>NVIDIA&nbsp;\u8d85\u5927\u898f\u6a21\u8207\u9ad8\u6548\u80fd\u904b\u7b97\u526f\u7e3d\u88c1&nbsp;Ian Buck&nbsp;\u8868\u793a\uff1a\u300c\u63a8\u8ad6\u662f&nbsp;AI&nbsp;\u6bcf\u5929\u5275\u9020\u50f9\u503c\u7684\u95dc\u9375\u3002\u9019\u4e9b\u7d50\u679c\u8b49\u660e\uff0cNVIDIA&nbsp;\u7684\u5168\u7aef\u7b56\u7565\u63d0\u4f9b\u5ba2\u6236\u5728\u5927\u898f\u6a21\u90e8\u7f72&nbsp;AI&nbsp;\u6642\u6240\u9700\u7684\u6548\u80fd\u8207\u6548\u7387\u3002\u300d<\/p>\n\n\n\n<p><\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>InferenceMAX v1&nbsp;<\/strong><strong>\u767b\u5834<\/strong><strong><\/strong><\/h3>\n\n\n\n<p>InferenceMAX v1&nbsp;\u662f&nbsp;SemiAnalysis&nbsp;\u65bc\u9031\u4e00\u767c\u5e03\u7684\u5168\u65b0\u57fa\u6e96\u6e2c\u8a66\uff0c\u518d\u6b21\u7a81\u986f\u4e86&nbsp;Blackwell&nbsp;\u5728\u63a8\u8ad6\u7684\u9818\u5c0e\u5730\u4f4d\u3002\u8a72\u57fa\u6e96\u5728\u4e3b\u8981\u5e73\u53f0\u4e0a\u904b\u884c\u71b1\u9580\u6a21\u578b\uff0c\u6e2c\u91cf\u591a\u6a23\u5316\u4f7f\u7528\u60c5\u5883\u4e0b\u7684\u6548\u80fd\uff0c\u4e26\u516c\u958b\u4efb\u4f55\u4eba\u7686\u53ef\u9a57\u8b49\u7684\u7d50\u679c\u3002<\/p>\n\n\n\n<p>\u70ba\u4ec0\u9ebc\u9019\u4e00\u985e\u578b\u7684\u57fa\u6e96\u6e2c\u8a66\u5982\u6b64\u91cd\u8981\uff1f<\/p>\n\n\n\n<p>\u56e0\u70ba\u73fe\u4ee3&nbsp;AI&nbsp;\u4e0d\u50c5\u95dc\u4e4e\u901f\u5ea6\uff0c\u66f4\u95dc\u4e4e\u6548\u7387\u8207\u7d93\u6fdf\u898f\u6a21\u3002\u96a8\u8457\u6a21\u578b\u5f9e\u4e00\u6b21\u56de\u8986\u8f49\u8b8a\u70ba\u591a\u6b65\u9a5f\u63a8\u7406\u8207\u5de5\u5177\u4f7f\u7528\uff0c\u6bcf\u6b21\u67e5\u8a62\u751f\u6210\u7684<a href=\"https:\/\/blogs.nvidia.com\/blog\/ai-tokens-explained\/\" target=\"_blank\" rel=\"noreferrer noopener\">\u8a5e\u5143<\/a>\u6578\u5927\u91cf\u589e\u52a0\uff0c\u986f\u8457\u63a8\u5347\u4e86\u904b\u7b97\u9700\u6c42\u3002<\/p>\n\n\n\n<p>NVIDIA&nbsp;\u8207&nbsp;OpenAI\uff08<a href=\"https:\/\/build.nvidia.com\/openai\/gpt-oss-120b\" target=\"_blank\" rel=\"noreferrer noopener\">gpt-oss 120B<\/a>\uff09\u3001Meta\uff08<a href=\"https:\/\/build.nvidia.com\/meta\/llama-3_3-70b-instruct\" target=\"_blank\" rel=\"noreferrer noopener\">Llama 3 70B<\/a>\uff09\u53ca&nbsp;DeepSeek AI\uff08<a href=\"https:\/\/build.nvidia.com\/deepseek-ai\/deepseek-r1\" target=\"_blank\" rel=\"noreferrer noopener\">DeepSeek R1<\/a>\uff09\u5728\u958b\u6e90\u9818\u57df\u7684\u5408\u4f5c\uff0c\u5c55\u793a\u4e86\u793e\u7fa4\u9a45\u52d5\u6a21\u578b\u5982\u4f55\u63a8\u9032\u63a8\u7406\u8207\u6548\u7387\u7684\u6700\u5148\u9032\u6210\u679c\u3002<\/p>\n\n\n\n<p>\u900f\u904e\u8207\u9019\u4e9b\u9818\u5148\u6a21\u578b\u958b\u767c\u8005\u53ca\u958b\u6e90\u793e\u7fa4\u5408\u4f5c\uff0cNVIDIA&nbsp;\u78ba\u4fdd\u6700\u65b0\u6a21\u578b\u80fd\u91dd\u5c0d\u5168\u7403\u6700\u5927\u898f\u6a21\u7684&nbsp;AI&nbsp;\u63a8\u8ad6\u57fa\u790e\u8a2d\u65bd\u9032\u884c\u6700\u4f73\u5316\u3002\u9019\u53cd\u6620&nbsp;NVIDIA&nbsp;\u5c0d\u958b\u653e\u751f\u614b\u7cfb\u7684\u627f\u8afe\uff0c\u5171\u4eab\u5275\u65b0\u4ee5\u70ba\u6240\u6709\u4eba\u52a0\u901f\u9032\u5c55\u3002<\/p>\n\n\n\n<p>\u8207&nbsp;FlashInfer\u3001SGLang&nbsp;\u548c&nbsp;vLLM&nbsp;\u793e\u7fa4\u7684\u6df1\u5ea6\u5408\u4f5c\uff0c\u4f7f\u5f97\u5171\u540c\u958b\u767c\u7684\u589e\u5f37\u6838\u5fc3\u8207\u904b\u884c\u6642\uff0c\u80fd\u5927\u898f\u6a21\u9a45\u52d5\u9019\u4e9b\u6a21\u578b\u3002<\/p>\n\n\n\n<p><\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>\u8edf\u9ad4\u6700\u4f73\u5316\u6301\u7e8c\u63a8\u5347\u6548\u80fd<\/strong><strong><\/strong><\/h3>\n\n\n\n<p>NVIDIA&nbsp;\u900f\u904e\u786c\u9ad4\u8207\u8edf\u9ad4\u5354\u540c\u8a2d\u8a08\u4f86\u6301\u7e8c\u63d0\u5347\u6548\u80fd\u3002gpt-oss-120B&nbsp;\u5728\u642d\u8f09&nbsp;<a href=\"https:\/\/docs.nvidia.com\/tensorrt-llm\/index.html\" target=\"_blank\" rel=\"noreferrer noopener\">NVIDIA TensorRT-LLM<\/a>&nbsp;\u51fd\u5f0f\u5eab\u7684NVIDIA DGX Blackwell B200&nbsp;\u7cfb\u7d71\u4e0a\uff0c\u521d\u59cb\u6548\u80fd\u5373\u70ba\u696d\u754c\u9818\u5148\uff0c\u4f46&nbsp;NVIDIA&nbsp;\u5718\u968a\u8207\u793e\u7fa4\u9032\u4e00\u6b65\u5c0d\u91dd\u5c0d\u958b\u6e90\u5927\u578b\u8a9e\u8a00\u6a21\u578b\uff0c\u5927\u5e45\u6700\u4f73\u5316\u4e86&nbsp;TensorRT LLM&nbsp;\u7684\u6548\u80fd\u8868\u73fe\u3002<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"720\" src=\"https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia03.png\" alt=\"\" class=\"wp-image-125319\" title=\"\" srcset=\"https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia03.png 1280w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia03-300x169.png 300w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia03-1024x576.png 1024w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia03-768x432.png 768w\" sizes=\"(max-width: 1280px) 100vw, 1280px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p><a href=\"https:\/\/developer.nvidia.com\/tensorrt-llm\" target=\"_blank\" rel=\"noreferrer noopener\">TensorRT-LLM v1.0<\/a>&nbsp;\u7684\u767c\u5e03\uff0c\u662f\u63a8\u52d5\u5927\u578b&nbsp;AI&nbsp;\u6a21\u578b\u66f4\u5feb\u901f\u3001\u66f4\u5177\u56de\u61c9\u6027\u7684\u91cd\u5927\u7a81\u7834\u3002<\/p>\n\n\n\n<p>\u900f\u904e\u5148\u9032\u7684\u5e73\u884c\u5316\u6280\u8853\uff0c\u5b83\u904b\u7528&nbsp;B200&nbsp;\u7cfb\u7d71\u8207&nbsp;<a href=\"https:\/\/www.nvidia.com\/zh-tw\/data-center\/nvlink\/\" target=\"_blank\" rel=\"noreferrer noopener\">NVIDIA NVLink Switch<\/a>&nbsp;\u7684&nbsp;1,800 GB\/s&nbsp;\u96d9\u5411\u983b\u5bec\uff0c\u5927\u5e45\u63d0\u5347&nbsp;gpt-oss-120B&nbsp;\u6a21\u578b\u7684\u6548\u80fd\u3002<\/p>\n\n\n\n<p>\u5275\u65b0\u4e0d\u50c5\u65bc\u6b64\u3002\u5168\u65b0\u767c\u5e03\u7684&nbsp;gpt-oss-120b-Eagle3-v2&nbsp;\u6a21\u578b\u5f15\u5165\u300c<a href=\"https:\/\/developer.nvidia.com\/blog\/an-introduction-to-speculative-decoding-for-reducing-latency-in-ai-inference\/\" target=\"_blank\" rel=\"noreferrer noopener\">\u63a8\u6e2c\u5f0f\u89e3\u78bc&nbsp;\uff08speculative decoding\uff09<\/a>\u300d\uff0c\u9019\u500b\u8070\u660e\u7684\u65b9\u6cd5\u80fd\u4e00\u6b21\u9810\u6e2c\u591a\u500b\u8a5e\u5143\uff0c\u964d\u4f4e\u5ef6\u9072\u4e26\u63d0\u5347\u901f\u5ea6\uff0c\u5c07\u6bcf\u4f7f\u7528\u8005\u8f38\u9001\u91cf\u63d0\u5347\u4e09\u500d\uff0c\u9054\u6bcf\u7528\u6236\u6bcf\u79d2100&nbsp;\u8a5e\u5143\uff08TPS \/ user\uff09\uff0c\u6bcf&nbsp;GPU&nbsp;\u901f\u5ea6\u5f9e&nbsp;6,000&nbsp;\u63d0\u5347\u81f3&nbsp;30,000&nbsp;\u8a5e\u5143\u3002<\/p>\n\n\n\n<p>\u5c0d\u65bc&nbsp;Llama 3.3 70B&nbsp;\u7b49\u5bc6\u96c6&nbsp;AI&nbsp;\u6a21\u578b\uff0c\u7531\u65bc\u5176\u9f90\u5927\u53c3\u6578\u9700\u65bc\u63a8\u8ad6\u4e2d\u540c\u6642\u904b\u4f5c\u800c\u9700\u8981\u5927\u91cf\u7684\u904b\u7b97\u8cc7\u6e90\uff0cNVIDIA Blackwell B200&nbsp;\u5728&nbsp;InferenceMAX v1&nbsp;\u57fa\u6e96\u6e2c\u8a66\u4e2d\u5275\u4e0b\u5168\u65b0\u6548\u80fd\u6a19\u6e96\u3002<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"960\" height=\"540\" src=\"https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia04.png\" alt=\"\" class=\"wp-image-125320\" title=\"\" srcset=\"https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia04.png 960w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia04-300x169.png 300w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia04-768x432.png 768w\" sizes=\"(max-width: 960px) 100vw, 960px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>Blackwell\u00a0\u5728\u6bcf\u00a0GPU\u00a0\u905410,000 TPS\u3001\u6bcf\u4f7f\u7528\u800550 TPS \u4e92\u52d5\u6027\u7684\u689d\u4ef6\u4e0b\uff0c\u63d0\u4f9b\u8f03\u00a0NVIDIA H200\u00a0\u9ad8\u00a04\u00a0\u500d\u7684\u6bcf GPU \u8f38\u9001\u91cf\u3002<\/p>\n\n\n\n<p><\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>\u6548\u80fd\u6548\u7387\u5e36\u4f86\u50f9\u503c<\/strong><strong><\/strong><\/h3>\n\n\n\n<p>\u6bcf\u74e6\u8f38\u51fa\u8a5e\u5143\u91cf\u3001\u6bcf\u767e\u842c\u8a5e\u5143\u6210\u672c\u8207\u6bcf\u4f7f\u7528\u8005 TPS \u7b49\u6307\u6a19\u8207\u8f38\u9001\u91cf\u540c\u7b49\u91cd\u8981\u3002\u5c0d\u529f\u7387\u53d7\u9650\u7684\u00a0AI\u00a0\u5de5\u5ee0\u800c\u8a00\uff0cBlackwell\u00a0\u6bcf\u5146\u74e6\u8f38\u9001\u91cf\u6bd4\u4e0a\u4e00\u4ee3\u63d0\u5347\u00a010\u00a0\u500d\uff0c\u80fd\u8f49\u5316\u70ba\u66f4\u7684\u9ad8\u8a5e\u5143\u6536\u76ca\u3002<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1920\" height=\"1080\" src=\"https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia05.png\" alt=\"\" class=\"wp-image-125322\" title=\"\" srcset=\"https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia05.png 1920w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia05-300x169.png 300w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia05-1024x576.png 1024w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia05-768x432.png 768w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia05-1536x864.png 1536w\" sizes=\"(max-width: 1920px) 100vw, 1920px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>\u6bcf\u8a5e\u5143\u6210\u672c\u662f\u8861\u91cf\u00a0AI\u00a0\u6a21\u578b\u6548\u7387\u7684\u95dc\u9375\uff0c\u76f4\u63a5\u5f71\u97ff\u71df\u904b\u652f\u51fa\u3002NVIDIA Blackwell\u00a0\u67b6\u69cb\u5c07\u6bcf\u767e\u842c\u8a5e\u5143\u6210\u672c\u8f03\u4e0a\u4e00\u4ee3\u964d\u4f4e\u00a015\u00a0\u500d\uff0c\u5e36\u4f86\u53ef\u89c0\u7bc0\u7701\u4e26\u63a8\u52d5\u66f4\u5ee3\u6cdb\u7684AI\u61c9\u7528\u3002<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1491\" height=\"839\" src=\"https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia06.png\" alt=\"\" class=\"wp-image-125323\" title=\"\" srcset=\"https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia06.png 1491w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia06-300x169.png 300w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia06-1024x576.png 1024w, https:\/\/www.ioiotimes.com\/wordpress\/wp-content\/uploads\/2025\/10\/20251010-nvidia06-768x432.png 768w\" sizes=\"(max-width: 1491px) 100vw, 1491px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>\u591a\u7dad\u6548\u80fd<\/strong><strong><\/strong><\/h3>\n\n\n\n<p>InferenceMAX&nbsp;\u63a1\u7528\u5e15\u96f7\u6258\u524d\u6cbf\uff08Pareto frontier\uff09\u5c55\u793a\u8cc7\u6599\u4e2d\u5fc3\u8f38\u9001\u91cf\u8207\u56de\u61c9\u6027\u7b49\u56e0\u7d20\u9593\u7684\u6700\u4f73\u6b0a\u8861\uff0c\u4e26\u6bd4\u8f03\u6548\u80fd\u3002<\/p>\n\n\n\n<p>\u4f46\u9019\u4e0d\u50c5\u662f\u4e00\u5f35\u5716\u8868\uff0c\u5b83\u5c55\u73fe&nbsp;NVIDIA Blackwell&nbsp;\u5982\u4f55\u5728\u6210\u672c\u3001\u80fd\u6e90\u6548\u7387\u3001\u8f38\u9001\u91cf\u8207\u56de\u61c9\u6027\u7b49\u8003\u91cf\u4e2d\u53d6\u5f97\u5e73\u8861\uff0c\u5f9e\u800c\u5728\u771f\u5be6\u5de5\u4f5c\u8ca0\u8f09\u4e2d\u5be6\u73fe\u6700\u9ad8\u6295\u8cc7\u5831\u916c\u7387\u3002<\/p>\n\n\n\n<p>\u53ea\u91dd\u5c0d\u55ae\u4e00\u5834\u666f\u6700\u4f73\u5316\u7684\u7cfb\u7d71\u96d6\u53ef\u80fd\u5728\u5b64\u7acb\u6e2c\u8a66\u4e2d\u9054\u5dd4\u5cf0\uff0c\u4f46\u7d93\u6fdf\u6027\u7121\u6cd5\u64f4\u5c55\u3002Blackwell&nbsp;\u7684\u5168\u7aef\u8a2d\u8a08\u5728\u5be6\u969b\u751f\u7522\u4e2d\u63d0\u4f9b\u95dc\u9375\u7684\u6548\u7387\u8207\u50f9\u503c\u3002<\/p>\n\n\n\n<p>\u82e5\u8981\u6df1\u5165\u4e86\u89e3\u9019\u4e9b\u66f2\u7dda\u7684\u69cb\u5efa\u65b9\u5f0f\uff0c\u4ee5\u53ca\u5176\u5c0d\u7e3d\u9ad4\u64c1\u6709\u6210\u672c\u8207\u670d\u52d9\u6c34\u6e96\u5354\u8b70\u898f\u5283\u7684\u610f\u7fa9\uff0c\u53ef\u53c3\u8003<a href=\"https:\/\/developer.nvidia.com\/blog\/nvidia-blackwell-leads-on-new-semianalysis-inferencemax-benchmarks\/\" target=\"_blank\" rel=\"noreferrer noopener\">\u6df1\u5ea6\u6280\u8853\u5831\u544a<\/a>\u4ee5\u67e5\u770b\u5b8c\u6574\u5716\u8868\u8207\u7814\u7a76\u65b9\u6cd5\u3002<\/p>\n\n\n\n<p><\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>\u6210\u5c31\u95dc\u9375<\/strong><strong><\/strong><\/h3>\n\n\n\n<p>Blackwell&nbsp;\u7684\u9818\u5148\u5730\u4f4d\u4f86\u81ea\u6975\u81f4\u7684\u786c\u9ad4\u8207\u8edf\u9ad4\u5354\u540c\u8a2d\u8a08\u3002\u9019\u662f\u4e00\u5957\u70ba\u901f\u5ea6\u3001\u6548\u7387\u8207\u898f\u6a21\u800c\u751f\u7684\u5168\u7aef\u67b6\u69cb\uff1a<\/p>\n\n\n\n<p>\u25cf\u00a0\u00a0\u00a0<strong>Blackwell\u00a0\u67b6\u69cb\u7279\u8272\u5305\u62ec<\/strong>\uff1a<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>\u3000\u25cb\u3000<strong>NVFP4\u00a0<\/strong>\u4f4e\u7cbe\u5ea6\u683c\u5f0f\uff0c\u5728\u4e0d\u72a7\u7272\u6e96\u78ba\u5ea6\u7684\u60c5\u6cc1\u4e0b\u63d0\u5347\u6548\u7387<\/li>\n\n\n\n<li>\u3000\u25cb\u3000<strong>\u7b2c\u4e94\u4ee3\u00a0NVIDIA NVLink<\/strong>\uff0c\u9023\u63a5\u00a072\u00a0\u500b\u00a0Blackwell GPU\u5982\u540c\u4e00\u500b\u5927\u578b\u00a0GPU\u5171\u540c\u904b\u4f5c<\/li>\n\n\n\n<li>\u3000\u25cb\u3000<strong>NVLink Switch\u00a0<\/strong>\u900f\u904e\u5148\u9032\u7684\u5f35\u91cf\u3001\u5c08\u5bb6\u7cfb\u7d71\u8207\u00a0<a>data parallel attention<\/a>\u6f14\u7b97\u6cd5\u652f\u63f4\u9ad8\u5ea6\u4e26\u884c<\/li>\n<\/ul>\n\n\n\n<p>\u25cf\u00a0\u00a0\u00a0<strong>\u5e74\u5ea6\u786c\u9ad4\u66f4\u65b0\u7bc0\u594f<\/strong>\u8207\u6301\u7e8c\u8edf\u9ad4\u6700\u4f73\u5316\u3002NVIDIA\u00a0\u81ea\u767c\u8868\u4ee5\u4f86\u50c5\u900f\u904e\u8edf\u9ad4\u4fbf\u4f7f\u00a0Blackwell\u00a0\u6548\u80fd\u63d0\u5347\u5169\u500d\u4ee5\u4e0a<br>\u25cf\u00a0\u00a0\u00a0<strong>NVIDIA TensorRT-LLM<\/strong>\u3001<strong><a href=\"https:\/\/www.nvidia.com\/en-us\/ai\/dynamo\/\" target=\"_blank\" rel=\"noreferrer noopener\">NVIDIA Dynamo<\/a><\/strong>\u3001<strong>SGLang<\/strong>\u00a0\u8207\u00a0<strong>vLLM<\/strong>\u00a0\u7b49\u958b\u6e90\u63a8\u8ad6\u6846\u67b6\u7686\u7d93\u904e\u6700\u4f73\u5316\u4ee5\u5be6\u73fe\u5dd4\u5cf0\u6548\u80fd<br>\u25cf\u00a0\u00a0\u00a0<strong>\u9f90\u5927\u751f\u614b\u7cfb<\/strong>\uff1a\u6578\u767e\u842c\u00a0GPU\u00a0\u90e8\u7f72\u3001700\u00a0\u842c\u00a0CUDA\u00a0\u958b\u767c\u8005\uff0c\u4e26\u5c0d\u8d85\u904e\u00a01,000\u00a0\u500b\u958b\u6e90\u5c08\u6848\u8ca2\u737b<\/p>\n\n\n\n<p><\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>\u66f4\u5b8f\u89c0\u7684\u8996\u91ce<\/strong><strong><\/strong><\/h3>\n\n\n\n<p>AI&nbsp;\u6b63\u5f9e\u8a66\u9ede\u968e\u6bb5\u9081\u5411&nbsp;AI&nbsp;\u5de5\u5ee0\u6642\u4ee3\uff0c\u9019\u4e9b\u57fa\u790e\u8a2d\u65bd\u80fd\u5373\u6642\u5c07\u8cc7\u6599\u8f49\u5316\u70ba\u8a5e\u5143\u8207\u6c7a\u7b56\u3002<\/p>\n\n\n\n<p>\u958b\u653e\u4e14\u5b9a\u671f\u66f4\u65b0\u7684\u57fa\u6e96\u6e2c\u8a66\u5354\u52a9\u5718\u968a\u5728\u6bcf\u8a5e\u5143\u6210\u672c\u3001\u5ef6\u9072\u6027\u670d\u52d9\u6c34\u6e96\u5354\u8b70\u8207\u52d5\u614b\u5de5\u4f5c\u8ca0\u8f09\u5229\u7528\u7387\u9593\u505a\u51fa\u6b63\u78ba\u5e73\u53f0\u9078\u64c7\u3002<\/p>\n\n\n\n<p><a href=\"https:\/\/blogs.nvidia.com\/blog\/think-smart-optimize-ai-factory-inference-performance\/\" target=\"_blank\" rel=\"noreferrer noopener\">NVIDIA&nbsp;\u7684&nbsp;Think SMART&nbsp;\u67b6\u69cb\u5354\u52a9\u4f01\u696d\u9762\u5c0d\u9019\u4e00\u8f49\u8b8a<\/a>\uff0c\u8aaa\u660e&nbsp;NVIDIA&nbsp;\u5168\u7aef\u63a8\u8ad6\u5e73\u53f0\u5982\u4f55\u5c07\u6548\u80fd\u8f49\u5316\u70ba\u5be6\u969b\u6295\u8cc7\u5831\u916c\u7387\uff0c\u8b93\u8868\u73fe\u8b8a\u6210\u6536\u76ca\u3002<\/p>\n\n\n\n<p><\/p>\n\n\n\n<p>\u3000\u3000\u3000\u3000<\/p>\n\n\n\n<h4 class=\"wp-block-heading has-text-align-right has-very-light-gray-to-cyan-bluish-gray-gradient-background has-background\">\ud83d\udfe6<a href=\"https:\/\/reurl.cc\/lYKQaY\" target=\"_blank\" data-type=\"link\" data-id=\"https:\/\/reurl.cc\/lYKQaY\" rel=\"noreferrer noopener\">\u770b\u6587\u7ae0\u5f97Mercusys Wi-Fi 7\u8def\u7531\u5668 <\/a><br>\ud83d\udfe6\u73fe\u5728\u5c31\u52a0\u5165&nbsp;ioioTIMES \u81c9\u66f8\u7c89\u7d72\u5718&nbsp;\u66f4\u591a\u4e92\u52d5\u3001\u66f4\u591a\u597d\u5eb7\u650f\u62b5\u52a0!!<br>\ud83d\udfe6<strong>\u6211\u5011\u6709<a href=\"https:\/\/today.line.me\/tw\/v2\/publisher\/103117\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">LINE TODAY<\/a>\u983b\u9053\u4e86\uff0c\u5feb\u4f86\u8ffd\u8e2a\u6211\u5011\u5427!!&#8211;\u6700\u65b0\u79d1\u6280\u65b0\u805e \u76e1\u5728\u4f60<\/strong>\u624b<br><\/h4>\n","protected":false},"excerpt":{"rendered":"<p>\u25cf\u00a0\u00a0\u00a0NVIDIA Blackwe<\/p>\n","protected":false},"author":3,"featured_media":125292,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"rank_math_lock_modified_date":false,"footnotes":""},"categories":[13],"tags":[3988,580,15449,71],"class_list":["post-125290","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news","tag-blackwell","tag-focus","tag-inferencemax","tag-nvidia"],"_links":{"self":[{"href":"https:\/\/www.ioiotimes.com\/index.php?rest_route=\/wp\/v2\/posts\/125290"}],"collection":[{"href":"https:\/\/www.ioiotimes.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.ioiotimes.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.ioiotimes.com\/index.php?rest_route=\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.ioiotimes.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=125290"}],"version-history":[{"count":2,"href":"https:\/\/www.ioiotimes.com\/index.php?rest_route=\/wp\/v2\/posts\/125290\/revisions"}],"predecessor-version":[{"id":125334,"href":"https:\/\/www.ioiotimes.com\/index.php?rest_route=\/wp\/v2\/posts\/125290\/revisions\/125334"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.ioiotimes.com\/index.php?rest_route=\/wp\/v2\/media\/125292"}],"wp:attachment":[{"href":"https:\/\/www.ioiotimes.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=125290"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.ioiotimes.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=125290"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.ioiotimes.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=125290"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}