Book a Demo!
CoCalc Logo Icon
StoreFeaturesDocsShareSupportNewsAboutPoliciesSign UpSign In
labmlai
GitHub Repository: labmlai/annotated_deep_learning_paper_implementations
Path: blob/master/translate_cache/neox/checkpoint.zh.json
4923 views
1
{
2
"<h1>GPT-NeoX Checkpoints</h1>\n": "<h1>GPT-neox \u68c0\u67e5\u70b9</h1>\n",
3
"<h2>Download all checkpoint files</h2>\n": "<h2>\u4e0b\u8f7d\u6240\u6709\u68c0\u67e5\u70b9\u6587\u4ef6</h2>\n",
4
"<h3>Get files to download</h3>\n<ul><p><em>Returns</em> a list of files to be downloaded</p></ul>\n": "<h3>\u83b7\u53d6\u8981\u4e0b\u8f7d\u7684\u6587\u4ef6</h3>\n<ul><p><em>\u8fd4\u56de</em>\u8981\u4e0b\u8f7d\u7684\u6587\u4ef6\u5217\u8868</p></ul>\n",
5
"<h3>Load a pair of checkpoint files</h3>\n<ul><li><span translate=no>_^_0_^_</span> pair of files to load </li>\n<p><em>Returns</em> the loaded parameter tensors</p></ul>\n": "<h3>\u52a0\u8f7d\u4e00\u5bf9\u68c0\u67e5\u70b9\u6587\u4ef6</h3>\n<ul><li><span translate=no>_^_0_^_</span>\u4e00\u5bf9\u8981\u52a0\u8f7d\u7684\u6587\u4ef6</li>\n<p><em>\u8fd4\u56de</em>\u52a0\u8f7d\u7684\u53c2\u6570\u5f20\u91cf</p></ul>\n",
6
"<h3>Load a parameter by merging the partitions along first dimension</h3>\n<ul><li><span translate=no>_^_0_^_</span> is the parameter </li>\n<li><span translate=no>_^_1_^_</span> is the name of the parameter </li>\n<li><span translate=no>_^_2_^_</span> first partition dictionary </li>\n<li><span translate=no>_^_3_^_</span> second partition dictionary</li></ul>\n": "<h3>\u901a\u8fc7\u5408\u5e76\u6cbf\u7b2c\u4e00\u7ef4\u5ea6\u7684\u5206\u533a\u6765\u52a0\u8f7d\u53c2\u6570</h3>\n<ul><li><span translate=no>_^_0_^_</span>\u662f\u53c2\u6570</li>\n<li><span translate=no>_^_1_^_</span>\u662f\u53c2\u6570\u7684\u540d\u79f0</li>\n<li><span translate=no>_^_2_^_</span>\u7b2c\u4e00\u4e2a\u5206\u533a\u5b57\u5178</li>\n<li><span translate=no>_^_3_^_</span>\u7b2c\u4e8c\u4e2a\u5206\u533a\u5b57\u5178</li></ul>\n",
7
"<h3>Load a parameter by merging the partitions along second dimension</h3>\n<ul><li><span translate=no>_^_0_^_</span> is the parameter </li>\n<li><span translate=no>_^_1_^_</span> is the name of the parameter </li>\n<li><span translate=no>_^_2_^_</span> first partition dictionary </li>\n<li><span translate=no>_^_3_^_</span> second partition dictionary</li></ul>\n": "<h3>\u901a\u8fc7\u5408\u5e76\u7b2c\u4e8c\u7ef4\u5ea6\u7684\u5206\u533a\u6765\u52a0\u8f7d\u53c2\u6570</h3>\n<ul><li><span translate=no>_^_0_^_</span>\u662f\u53c2\u6570</li>\n<li><span translate=no>_^_1_^_</span>\u662f\u53c2\u6570\u7684\u540d\u79f0</li>\n<li><span translate=no>_^_2_^_</span>\u7b2c\u4e00\u4e2a\u5206\u533a\u5b57\u5178</li>\n<li><span translate=no>_^_3_^_</span>\u7b2c\u4e8c\u4e2a\u5206\u533a\u5b57\u5178</li></ul>\n",
8
"<h3>Load an un-partitioned parameter</h3>\n<p>This does a sanity check to make use both partitions are the same</p>\n<ul><li><span translate=no>_^_0_^_</span> is the parameter </li>\n<li><span translate=no>_^_1_^_</span> is the name of the parameter </li>\n<li><span translate=no>_^_2_^_</span> first partition dictionary </li>\n<li><span translate=no>_^_3_^_</span> second partition dictionary</li></ul>\n": "<h3>\u52a0\u8f7d\u672a\u5206\u533a\u7684\u53c2\u6570</h3>\n<p>\u8fd9\u4f1a\u8fdb\u884c\u5065\u5168\u6027\u68c0\u67e5\uff0c\u4ee5\u4f7f\u7528\u4e24\u4e2a\u5206\u533a\u662f\u76f8\u540c\u7684</p>\n<ul><li><span translate=no>_^_0_^_</span>\u662f\u53c2\u6570</li>\n<li><span translate=no>_^_1_^_</span>\u662f\u53c2\u6570\u7684\u540d\u79f0</li>\n<li><span translate=no>_^_2_^_</span>\u7b2c\u4e00\u4e2a\u5206\u533a\u5b57\u5178</li>\n<li><span translate=no>_^_3_^_</span>\u7b2c\u4e8c\u4e2a\u5206\u533a\u5b57\u5178</li></ul>\n",
9
"<h3>Load biases that are partitioned which gets added on reduce</h3>\n<ul><li><span translate=no>_^_0_^_</span> is the parameter </li>\n<li><span translate=no>_^_1_^_</span> is the name of the parameter </li>\n<li><span translate=no>_^_2_^_</span> first partition dictionary </li>\n<li><span translate=no>_^_3_^_</span> second partition dictionary</li></ul>\n": "<h3>\u5206\u533a\u7684\u8d1f\u8f7d\u504f\u5dee\u5728 reduce \u65f6\u88ab\u6dfb\u52a0</h3>\n<ul><li><span translate=no>_^_0_^_</span>\u662f\u53c2\u6570</li>\n<li><span translate=no>_^_1_^_</span>\u662f\u53c2\u6570\u7684\u540d\u79f0</li>\n<li><span translate=no>_^_2_^_</span>\u7b2c\u4e00\u4e2a\u5206\u533a\u5b57\u5178</li>\n<li><span translate=no>_^_3_^_</span>\u7b2c\u4e8c\u4e2a\u5206\u533a\u5b57\u5178</li></ul>\n",
10
"<p> </p>\n": "<p></p>\n",
11
"<p>Download </p>\n": "<p>\u4e0b\u8f7d</p>\n",
12
"<p>Download path </p>\n": "<p>\u4e0b\u8f7d\u8def\u5f84</p>\n",
13
"<p>Embedding layer </p>\n": "<p>\u5d4c\u5165\u5c42</p>\n",
14
"<p>Empty states (not used) </p>\n": "<p>\u7a7a\u72b6\u6001\uff08\u672a\u4f7f\u7528\uff09</p>\n",
15
"<p>Final normalization layer and readout layer </p>\n": "<p>\u6700\u7ec8\u5f52\u4e00\u5316\u5c42\u548c\u8bfb\u51fa\u5c42</p>\n",
16
"<p>Get files to download </p>\n": "<p>\u83b7\u53d6\u8981\u4e0b\u8f7d\u7684\u6587\u4ef6</p>\n",
17
"<p>Iterate </p>\n": "<p>\u8fed\u4ee3</p>\n",
18
"<p>Layer checkpoints </p>\n": "<p>\u56fe\u5c42\u68c0\u67e5\u70b9</p>\n",
19
"<p>Log </p>\n": "<p>\u65e5\u5fd7</p>\n",
20
"<p>Parent url </p>\n": "<p>\u5bb6\u957f\u7f51\u5740</p>\n",
21
"<p>Transformer layers </p>\n": "<p>\u53d8\u538b\u5668\u5c42</p>\n",
22
"<p>Vocabulary and configs </p>\n": "<p>\u8bcd\u6c47\u548c\u914d\u7f6e</p>\n",
23
"Code to download checkpoints and helpers to load them.": "\u4e0b\u8f7d\u68c0\u67e5\u70b9\u7684\u4ee3\u7801\u548c\u52a0\u8f7d\u5b83\u4eec\u7684\u52a9\u624b\u3002",
24
"GPT-NeoX Checkpoints": "GPT-neox \u68c0\u67e5\u70b9"
25
}
26