{"id":38339,"date":"2023-05-17T08:13:39","date_gmt":"2023-05-17T12:13:39","guid":{"rendered":"https:\/\/www.technewsday.com\/?p=38339"},"modified":"2023-05-18T09:35:46","modified_gmt":"2023-05-18T13:35:46","slug":"goltsev-discuss-cost-effective-ai-deployment-at-data-center-world-2023","status":"publish","type":"post","link":"https:\/\/technewsday.com\/staging\/goltsev-discuss-cost-effective-ai-deployment-at-data-center-world-2023\/","title":{"rendered":"Goltsev discuss cost effective AI deployment at Data Center World 2023"},"content":{"rendered":"<p data-ar-index=\"1\">At the Data Center World 2023 event in Austin, Texas, the high expenses of training huge language models were discussed. There, theMind&#8217;s Constantine Goltsev, an AI\/ML solutions business, stressed the expenses associated with training models like ChatGPT.<\/p>\n<p data-ar-index=\"2\">Goltsev stressed that training ChatGPT&#8217;s 175 billion parameters is extremely expensive, requiring 175 billion computations for each input and using a large amount of electricity. OpenAI&#8217;s future GPT-4 model conducted compute operations for months using 12,000 to 15,000 Nvidia A100 processors, which cost $10,000 apiece. These discoveries highlight the need for a more realistic approach to artificial intelligence deployment.<\/p>\n<p data-ar-index=\"3\">Goltsev advocated using smaller, open source models that can match or even outperform ChatGPT&#8217;s performance. Organizations may accomplish spectacular results by adopting the same fine-tuning approaches used to produce ChatGPT by using academic models or open source equivalents with parameters numbering in the billions (e.g., 6 billion or 3 billion). This method makes it possible to use AI technology at a reasonable cost.<\/p>\n<p data-ar-index=\"4\">Goltsev used Amazon Web Services (AWS) as an example to show how a major legal firm may use AWS to develop a semantic search engine. The company might achieve its goals for roughly $30 to $40 per hour by hiring a few big instances with around eight A100 cards, plenty of memory, and storage.<\/p>\n<p data-ar-index=\"5\">The sources for this piece include an <a href=\"https:\/\/www.datacenterknowledge.com\/artificial-intelligence\/dcw-23-deploying-generative-ai-without-gpus-or-supercomputers\" target=\"_blank\" rel=\"noopener\">article<\/a> in\u00a0DataCenterKnowledge.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>At the Data Center World 2023 event in Austin, Texas, the high expenses of training huge language models were discussed. There, theMind&#8217;s Constantine Goltsev, an AI\/ML solutions business, stressed the expenses associated with training models like ChatGPT. Goltsev stressed that training ChatGPT&#8217;s 175 billion parameters is extremely expensive, requiring 175 billion computations for each input [&hellip;]<\/p>\n","protected":false},"author":4,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[34],"tags":[525],"class_list":["post-38339","post","type-post","status-publish","format-standard","hentry","category-artificial-intelligence","tag-ai"],"acf":[],"_links":{"self":[{"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/posts\/38339","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/comments?post=38339"}],"version-history":[{"count":2,"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/posts\/38339\/revisions"}],"predecessor-version":[{"id":38341,"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/posts\/38339\/revisions\/38341"}],"wp:attachment":[{"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/media?parent=38339"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/categories?post=38339"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/technewsday.com\/staging\/wp-json\/wp\/v2\/tags?post=38339"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}