Book a Demo!
CoCalc Logo Icon
StoreFeaturesDocsShareSupportNewsAboutPoliciesSign UpSign In
labmlai
GitHub Repository: labmlai/annotated_deep_learning_paper_implementations
Path: blob/master/translate_cache/experiments/arithmetic_dataset.zh.json
4923 views
1
{
2
"<h2>Arithmetic Dataset</h2>\n<p>This creates arithmetic addition problems and solutions with workings. We&#x27;ve only implemented addition so far.</p>\n<p>It&#x27;s based on a character level tokenization.</p>\n": "<h2>\u7b97\u672f\u6570\u636e\u96c6</h2>\n<p>\u8fd9\u4f1a\u4ea7\u751f\u7b97\u672f\u52a0\u6cd5\u95ee\u9898\u548c\u8fd0\u4f5c\u89e3\u51b3\u65b9\u6848\u3002\u5230\u76ee\u524d\u4e3a\u6b62\uff0c\u6211\u4eec\u53ea\u5b9e\u65bd\u4e86\u52a0\u6cd5\u3002</p>\n<p>\u5b83\u57fa\u4e8e\u89d2\u8272\u7ea7\u522b\u7684\u6807\u8bb0\u5316\u3002</p>\n",
3
"<h2>Arithmetic Task Experiment Configurations</h2>\n": "<h2>\u7b97\u672f\u4efb\u52a1\u5b9e\u9a8c\u914d\u7f6e</h2>\n",
4
"<h3>Evaluation</h3>\n<p>We use the sampling function to evaluate the model on a set of problems</p>\n": "<h3>\u8bc4\u4f30</h3>\n<p>\u6211\u4eec\u4f7f\u7528\u91c7\u6837\u51fd\u6570\u6765\u8bc4\u4f30\u4e00\u7ec4\u95ee\u9898\u7684\u6a21\u578b</p>\n",
5
"<p> </p>\n": "<p></p>\n",
6
"<p> Code to test generated problems</p>\n": "<p>\u7528\u4e8e\u6d4b\u8bd5\u751f\u6210\u7684\u95ee\u9898\u7684\u4ee3\u7801</p>\n",
7
"<p> Decode a list of token ids</p>\n": "<p>\u89e3\u7801\u4ee4\u724c ID \u5217\u8868</p>\n",
8
"<p> Encode a given string</p>\n": "<p>\u5bf9\u7ed9\u5b9a\u5b57\u7b26\u4e32\u8fdb\u884c\u7f16\u7801</p>\n",
9
"<p> Generate multiple problems and pack them into a sequence.</p>\n": "<p>\u751f\u6210\u591a\u4e2a\u95ee\u9898\u5e76\u5c06\u5b83\u4eec\u6253\u5305\u6210\u4e00\u4e2a\u5e8f\u5217\u3002</p>\n",
10
"<p> Generates an integer with <span translate=no>_^_0_^_</span> number of digits</p>\n": "<p>\u751f\u6210\u4e00\u4e2a\u5305\u542b\u4f4d<span translate=no>_^_0_^_</span>\u6570\u7684\u6574\u6570</p>\n",
11
"<p> Generates the workings for <span translate=no>_^_0_^_</span>. For example for <span translate=no>_^_1_^_</span> it generates <span translate=no>_^_2_^_</span>.</p>\n": "<p>\u751f\u6210\u7684\u5de5\u4f5c\u539f\u7406<span translate=no>_^_0_^_</span>\u3002\u4f8b\u5982\uff0c<span translate=no>_^_1_^_</span>\u5b83\u4f1a\u751f\u6210<span translate=no>_^_2_^_</span>\u3002</p>\n",
12
"<p> Get a input and target pair for auto-regressive modelling</p>\n": "<p>\u83b7\u53d6\u81ea\u52a8\u56de\u5f52\u5efa\u6a21\u7684\u8f93\u5165\u548c\u76ee\u6807\u5bf9</p>\n",
13
"<p> Get arithmetic problem and answer. This is used for evaluation.</p>\n": "<p>\u83b7\u53d6\u7b97\u672f\u95ee\u9898\u548c\u7b54\u6848\u3002\u8fd9\u7528\u4e8e\u8bc4\u4f30\u3002</p>\n",
14
"<p> Number of sequences per epoch</p>\n": "<p>\u6bcf\u4e2a\u7eaa\u5143\u7684\u5e8f\u5217\u6570</p>\n",
15
"<p> Training data loader</p>\n": "<p>\u8bad\u7ec3\u6570\u636e\u52a0\u8f7d\u5668</p>\n",
16
"<p><em>This is based on code by <a href=\"https://twitter.com/gharik\">Georges Harik (@gharik)</a>.</em></p>\n": "<p><em>\u8fd9\u662f\u57fa\u4e8e<a href=\"https://twitter.com/gharik\">\u4e54\u6cbb\u00b7\u54c8\u91cc\u514b\uff08@gharik\uff09</a>\u7684\u4ee3\u7801\u3002</em></p>\n",
17
"<p>Add the next token to the input </p>\n": "<p>\u5c06\u4e0b\u4e00\u4e2a\u4ee4\u724c\u6dfb\u52a0\u5230\u8f93\u5165\u4e2d</p>\n",
18
"<p>Character to token id </p>\n": "<p>\u5b57\u7b26\u5230\u4ee4\u724c ID</p>\n",
19
"<p>Collect the problems only </p>\n": "<p>\u53ea\u6536\u96c6\u95ee\u9898</p>\n",
20
"<p>Count the number of correct answers </p>\n": "<p>\u8ba1\u7b97\u6b63\u786e\u7b54\u6848\u7684\u6570\u91cf</p>\n",
21
"<p>Create a dataset to generate problems </p>\n": "<p>\u521b\u5efa\u6570\u636e\u96c6\u4ee5\u751f\u6210\u95ee\u9898</p>\n",
22
"<p>Create a tensor with only the initial token </p>\n": "<p>\u4ec5\u4f7f\u7528\u521d\u59cb\u4ee4\u724c\u521b\u5efa\u5f20\u91cf</p>\n",
23
"<p>Discard everything after the answer in the results </p>\n": "<p>\u4e22\u5f03\u7ed3\u679c\u4e2d\u7b54\u6848\u540e\u7684\u6240\u6709\u5185\u5bb9</p>\n",
24
"<p>Find which sequences have finished </p>\n": "<p>\u627e\u51fa\u54ea\u4e9b\u5e8f\u5217\u5df2\u5b8c\u6210</p>\n",
25
"<p>Get a set of problems and answers </p>\n": "<p>\u83b7\u53d6\u4e00\u7cfb\u5217\u95ee\u9898\u548c\u7b54\u6848</p>\n",
26
"<p>Get the answers </p>\n": "<p>\u5f97\u5230\u7b54\u6848</p>\n",
27
"<p>Get the model output </p>\n": "<p>\u83b7\u53d6\u6a21\u578b\u8f93\u51fa</p>\n",
28
"<p>Get the model prediction (greedy) </p>\n": "<p>\u83b7\u53d6\u6a21\u578b\u9884\u6d4b\uff08\u8d2a\u5a6a\uff09</p>\n",
29
"<p>Get the sampled results </p>\n": "<p>\u83b7\u53d6\u62bd\u6837\u7ed3\u679c</p>\n",
30
"<p>If all the sequences have completed we skip this </p>\n": "<p>\u5982\u679c\u6240\u6709\u7684\u5e8f\u5217\u90fd\u5b8c\u6210\u4e86\uff0c\u6211\u4eec\u5c31\u8df3\u8fc7\u8fd9\u4e2a</p>\n",
31
"<p>Log a sample </p>\n": "<p>\u8bb0\u5f55\u6837\u672c</p>\n",
32
"<p>Log the score </p>\n": "<p>\u8bb0\u5f55\u5206\u6570</p>\n",
33
"<p>Make a problem with a pre_explanation or not</p>\n<p>Creates an arithmetic addition problem with workings and answer.</p>\n": "<p>\u4e0d\u7ba1\u662f\u5426\u7528 pre_explansion \u95ee\u95ee\u9898</p>\n<p>\u7528\u8fd0\u4f5c\u548c\u7b54\u6848\u521b\u5efa\u7b97\u672f\u52a0\u6cd5\u95ee\u9898\u3002</p>\n",
34
"<p>Maximum number of digits per operand integer </p>\n": "<p>\u6bcf\u4e2a\u64cd\u4f5c\u6570\u6574\u6570\u7684\u6700\u5927\u4f4d\u6570</p>\n",
35
"<p>Move to device </p>\n": "<p>\u79fb\u81f3\u8bbe\u5907</p>\n",
36
"<p>No need of a validation dataset </p>\n": "<p>\u4e0d\u9700\u8981\u9a8c\u8bc1\u6570\u636e\u96c6</p>\n",
37
"<p>Number of problems in evaluation </p>\n": "<p>\u8bc4\u4f30\u4e2d\u7684\u95ee\u9898\u6570\u91cf</p>\n",
38
"<p>Number of sequences that have completed </p>\n": "<p>\u5df2\u5b8c\u6210\u7684\u5e8f\u5217\u6570</p>\n",
39
"<p>Number of times to run evaluations per epoch </p>\n": "<p>\u6bcf\u4e2a\u7eaa\u5143\u8fd0\u884c\u8bc4\u4f30\u7684\u6b21\u6570</p>\n",
40
"<p>Number of tokens in the vocabulary </p>\n": "<p>\u8bcd\u6c47\u8868\u4e2d\u7684\u4ee3\u5e01\u6570\u91cf</p>\n",
41
"<p>Number of training sequences per epoch </p>\n": "<p>\u6bcf\u4e2a\u7eaa\u5143\u7684\u8bad\u7ec3\u5e8f\u5217\u6570</p>\n",
42
"<p>Override with the question </p>\n": "<p>\u7528\u95ee\u9898\u8986\u76d6</p>\n",
43
"<p>Sample upto sequence length </p>\n": "<p>\u6837\u672c\u76f4\u81f3\u5e8f\u5217\u957f\u5ea6</p>\n",
44
"<p>Sampled results </p>\n": "<p>\u62bd\u6837\u7ed3\u679c</p>\n",
45
"<p>Skip if all have finished </p>\n": "<p>\u5982\u679c\u5168\u90e8\u5b8c\u6210\uff0c\u5219\u8df3\u8fc7</p>\n",
46
"<p>Skip in the first epoch </p>\n": "<p>\u8df3\u8fc7\u7b2c\u4e00\u4e2a\u7eaa\u5143</p>\n",
47
"<p>Token id of the new line character - this marks end of the answer </p>\n": "<p>\u6362\u884c\u7b26\u7684\u6807\u8bb0 ID-\u8fd9\u6807\u5fd7\u7740\u7b54\u6848\u7684\u7ed3\u675f</p>\n",
48
"<p>Token id to string </p>\n": "<p>\u4ee4\u724c ID \u8f6c\u6362\u4e3a\u5b57\u7b26\u4e32</p>\n",
49
"<p>Training data loader </p>\n": "<p>\u8bad\u7ec3\u6570\u636e\u52a0\u8f7d\u5668</p>\n",
50
"<ul><li><span translate=no>_^_0_^_</span> is the sequence length of generated math problems. We fill as many problems as possible upto this length :max_digits: is the maximum number of digits in the operand integers :n_sequences: is the number of sequences per epoch</li></ul>\n": "<ul><li><span translate=no>_^_0_^_</span>\u662f\u751f\u6210\u7684\u6570\u5b66\u95ee\u9898\u7684\u5e8f\u5217\u957f\u5ea6\u3002\u6211\u4eec\u5c3d\u53ef\u80fd\u591a\u5730\u586b\u5199\u95ee\u9898\uff0c\u76f4\u5230\u8fd9\u4e2a\u957f\u5ea6\uff1amax_digits: \u662f\u64cd\u4f5c\u6570\u4e2d\u7684\u6700\u5927\u4f4d\u6570\u6574\u6570:n_sequences: \u662f\u6bcf\u4e2a\u7eaa\u5143\u7684\u5e8f\u5217\u6570</li></ul>\n",
51
"Arithmetic Dataset": "\u7b97\u672f\u6570\u636e\u96c6",
52
"This creates arithmetic problems.": "\u8fd9\u4f1a\u4ea7\u751f\u7b97\u672f\u95ee\u9898\u3002"
53
}
54