{"id":216170,"date":"2017-06-20T09:06:57","date_gmt":"2017-06-20T07:06:57","guid":{"rendered":"http:\/\/mybroadband.co.za\/news\/?p=216170"},"modified":"2017-06-20T09:08:16","modified_gmt":"2017-06-20T07:08:16","slug":"google-tensor2tensor-to-speed-up-deep-learning-research","status":"publish","type":"post","link":"https:\/\/mybroadband.co.za\/news\/software\/216170-google-tensor2tensor-to-speed-up-deep-learning-research.html","title":{"rendered":"Google Tensor2Tensor to speed up deep learning research"},"content":{"rendered":"<p>Google has launched Tensor2Tensor, a library that will help researchers train deep learning models for use in its TensorFlow framework.<\/p>\n<p>&#8220;T2T facilitates the creation of state-of-the art models for a variety of ML applications, such as translation, parsing, image captioning, and more,&#8221; said Google.<\/p>\n<p>&#8220;This release also includes a library of datasets and models, including the best models from a few recent papers to help kick-start your own DL research.&#8221;<\/p>\n<p>Models available include:<\/p>\n<ul>\n<li>Attention Is All You Need<\/li>\n<li>Depthwise Separable Convolutions for Neural Machine Translation<\/li>\n<li>One Model to Learn Them All<\/li>\n<\/ul>\n<p>The following results of a\u00a0standard WMT English-German translation task using previous state-of-the-art models, compared to Tensor2Tensor, were provided by Google.<\/p>\n<p>Its Transformer and SliceNet outperformed GNMT and GNMT+MOE.<\/p>\n<div class=\"mybb_table\">\n<div class=\"table-responsive\"><table class=\"table\" border=\"0\" width=\"100%\" cellpadding=\"7\">\n<thead>\n<tr>\n<td bgcolor=\"#0A00C8\" width=\"32%\">\n<div><span style=\"color: #ffffff;\"><b>Translation Model<\/b><\/span><\/div>\n<\/td>\n<td bgcolor=\"#0A00C8\" width=\"34%\">\n<div style=\"text-align: left;\"><span style=\"color: #ffffff;\"><b>Training time<\/b><\/span><\/div>\n<\/td>\n<td bgcolor=\"#0A00C8\" width=\"34%\">\n<div style=\"text-align: left;\"><span style=\"color: #ffffff;\"><b>BLEU (difference from baseline)<\/b><\/span><\/div>\n<\/td>\n<\/tr>\n<tr>\n<td><a href=\"https:\/\/arxiv.org\/abs\/1706.03762\">Transformer<\/a> (T2T)<\/td>\n<td>\n<div style=\"text-align: left;\">3 days on 8 GPUs<\/div>\n<\/td>\n<td>\n<div style=\"text-align: left;\"><b>28.4 (+7.8)<\/b><\/div>\n<\/td>\n<\/tr>\n<tr>\n<td><a href=\"https:\/\/arxiv.org\/abs\/1706.03059\">SliceNet<\/a> (T2T)<\/td>\n<td>\n<div style=\"text-align: left;\">6 days on 32 GPUs<\/div>\n<\/td>\n<td>\n<div style=\"text-align: left;\"><strong>26.1 (+5.5)<\/strong><\/div>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<div style=\"text-align: left;\"><a href=\"https:\/\/arxiv.org\/abs\/1701.06538\">GNMT + Mixture of Experts <\/a><\/div>\n<\/td>\n<td>\n<div style=\"text-align: left;\">1 day on 64 GPUs<\/div>\n<\/td>\n<td>\n<div style=\"text-align: left;\"><strong>26.0 (+5.4)<\/strong><\/div>\n<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: left;\"><a href=\"https:\/\/arxiv.org\/abs\/1705.03122\">ConvS2S<\/a><\/td>\n<td style=\"text-align: left;\">\n<div>18 days on 1 GPU<\/div>\n<\/td>\n<td>\n<div style=\"text-align: left;\"><strong>25.1 (+4.5)<\/strong><\/div>\n<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: left;\"><a href=\"https:\/\/research.googleblog.com\/2016\/09\/a-neural-network-for-machine.html\">GNMT<\/a><\/td>\n<td style=\"text-align: left;\">\n<div>1 day on 96 GPUs<\/div>\n<\/td>\n<td>\n<div style=\"text-align: left;\"><strong>24.6 (+4.0)<\/strong><\/div>\n<\/td>\n<\/tr>\n<tr>\n<td>\n<div style=\"text-align: left;\"><a href=\"https:\/\/arxiv.org\/abs\/1610.10099\">ByteNet<\/a><\/div>\n<\/td>\n<td>\n<div style=\"text-align: left;\">8 days on 32 GPUs<\/div>\n<\/td>\n<td>\n<div style=\"text-align: left;\"><strong>23.8 (+3.2)<\/strong><\/div>\n<\/td>\n<\/tr>\n<tr>\n<td><a href=\"http:\/\/matrix.statmt.org\/matrix\/systems_list\/1745\">MOSES<\/a> (phrase-based baseline)<\/td>\n<td>\n<div style=\"text-align: left;\">N\/A<\/div>\n<\/td>\n<td>\n<div><strong>20.6 (+0.0)<\/strong><\/div>\n<\/td>\n<\/tr>\n<\/thead>\n<\/table><\/div>\n<\/div>\n<h3 class=\"my-4\">Now read:\u00a0<a href=\"http:\/\/mybroadband.co.za\/news\/software\/145579-google-has-open-sourced-its-artificial-intelligence-engine.html\">Google has open sourced its artificial intelligence engine<\/a><\/h3>\n","protected":false},"excerpt":{"rendered":"<p>Google has launched Tensor2Tensor, a library that will help researchers train deep learning models.<\/p>\n","protected":false},"author":23,"featured_media":145583,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[16],"tags":[35793,167,22845,43982],"class_list":["post-216170","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-software","tag-artificial-intelligence-ai","tag-google","tag-machine-learning","tag-tensor2tensor"],"_links":{"self":[{"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/posts\/216170"}],"collection":[{"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/users\/23"}],"replies":[{"embeddable":true,"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/comments?post=216170"}],"version-history":[{"count":2,"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/posts\/216170\/revisions"}],"predecessor-version":[{"id":216186,"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/posts\/216170\/revisions\/216186"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/media\/145583"}],"wp:attachment":[{"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/media?parent=216170"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/categories?post=216170"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/tags?post=216170"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}