Path: blob/master/translate_cache/experiments/arithmetic_dataset.zh.json
4923 views
{1"<h2>Arithmetic Dataset</h2>\n<p>This creates arithmetic addition problems and solutions with workings. We've only implemented addition so far.</p>\n<p>It's based on a character level tokenization.</p>\n": "<h2>\u7b97\u672f\u6570\u636e\u96c6</h2>\n<p>\u8fd9\u4f1a\u4ea7\u751f\u7b97\u672f\u52a0\u6cd5\u95ee\u9898\u548c\u8fd0\u4f5c\u89e3\u51b3\u65b9\u6848\u3002\u5230\u76ee\u524d\u4e3a\u6b62\uff0c\u6211\u4eec\u53ea\u5b9e\u65bd\u4e86\u52a0\u6cd5\u3002</p>\n<p>\u5b83\u57fa\u4e8e\u89d2\u8272\u7ea7\u522b\u7684\u6807\u8bb0\u5316\u3002</p>\n",2"<h2>Arithmetic Task Experiment Configurations</h2>\n": "<h2>\u7b97\u672f\u4efb\u52a1\u5b9e\u9a8c\u914d\u7f6e</h2>\n",3"<h3>Evaluation</h3>\n<p>We use the sampling function to evaluate the model on a set of problems</p>\n": "<h3>\u8bc4\u4f30</h3>\n<p>\u6211\u4eec\u4f7f\u7528\u91c7\u6837\u51fd\u6570\u6765\u8bc4\u4f30\u4e00\u7ec4\u95ee\u9898\u7684\u6a21\u578b</p>\n",4"<p> </p>\n": "<p></p>\n",5"<p> Code to test generated problems</p>\n": "<p>\u7528\u4e8e\u6d4b\u8bd5\u751f\u6210\u7684\u95ee\u9898\u7684\u4ee3\u7801</p>\n",6"<p> Decode a list of token ids</p>\n": "<p>\u89e3\u7801\u4ee4\u724c ID \u5217\u8868</p>\n",7"<p> Encode a given string</p>\n": "<p>\u5bf9\u7ed9\u5b9a\u5b57\u7b26\u4e32\u8fdb\u884c\u7f16\u7801</p>\n",8"<p> Generate multiple problems and pack them into a sequence.</p>\n": "<p>\u751f\u6210\u591a\u4e2a\u95ee\u9898\u5e76\u5c06\u5b83\u4eec\u6253\u5305\u6210\u4e00\u4e2a\u5e8f\u5217\u3002</p>\n",9"<p> Generates an integer with <span translate=no>_^_0_^_</span> number of digits</p>\n": "<p>\u751f\u6210\u4e00\u4e2a\u5305\u542b\u4f4d<span translate=no>_^_0_^_</span>\u6570\u7684\u6574\u6570</p>\n",10"<p> Generates the workings for <span translate=no>_^_0_^_</span>. For example for <span translate=no>_^_1_^_</span> it generates <span translate=no>_^_2_^_</span>.</p>\n": "<p>\u751f\u6210\u7684\u5de5\u4f5c\u539f\u7406<span translate=no>_^_0_^_</span>\u3002\u4f8b\u5982\uff0c<span translate=no>_^_1_^_</span>\u5b83\u4f1a\u751f\u6210<span translate=no>_^_2_^_</span>\u3002</p>\n",11"<p> Get a input and target pair for auto-regressive modelling</p>\n": "<p>\u83b7\u53d6\u81ea\u52a8\u56de\u5f52\u5efa\u6a21\u7684\u8f93\u5165\u548c\u76ee\u6807\u5bf9</p>\n",12"<p> Get arithmetic problem and answer. This is used for evaluation.</p>\n": "<p>\u83b7\u53d6\u7b97\u672f\u95ee\u9898\u548c\u7b54\u6848\u3002\u8fd9\u7528\u4e8e\u8bc4\u4f30\u3002</p>\n",13"<p> Number of sequences per epoch</p>\n": "<p>\u6bcf\u4e2a\u7eaa\u5143\u7684\u5e8f\u5217\u6570</p>\n",14"<p> Training data loader</p>\n": "<p>\u8bad\u7ec3\u6570\u636e\u52a0\u8f7d\u5668</p>\n",15"<p><em>This is based on code by <a href=\"https://twitter.com/gharik\">Georges Harik (@gharik)</a>.</em></p>\n": "<p><em>\u8fd9\u662f\u57fa\u4e8e<a href=\"https://twitter.com/gharik\">\u4e54\u6cbb\u00b7\u54c8\u91cc\u514b\uff08@gharik\uff09</a>\u7684\u4ee3\u7801\u3002</em></p>\n",16"<p>Add the next token to the input </p>\n": "<p>\u5c06\u4e0b\u4e00\u4e2a\u4ee4\u724c\u6dfb\u52a0\u5230\u8f93\u5165\u4e2d</p>\n",17"<p>Character to token id </p>\n": "<p>\u5b57\u7b26\u5230\u4ee4\u724c ID</p>\n",18"<p>Collect the problems only </p>\n": "<p>\u53ea\u6536\u96c6\u95ee\u9898</p>\n",19"<p>Count the number of correct answers </p>\n": "<p>\u8ba1\u7b97\u6b63\u786e\u7b54\u6848\u7684\u6570\u91cf</p>\n",20"<p>Create a dataset to generate problems </p>\n": "<p>\u521b\u5efa\u6570\u636e\u96c6\u4ee5\u751f\u6210\u95ee\u9898</p>\n",21"<p>Create a tensor with only the initial token </p>\n": "<p>\u4ec5\u4f7f\u7528\u521d\u59cb\u4ee4\u724c\u521b\u5efa\u5f20\u91cf</p>\n",22"<p>Discard everything after the answer in the results </p>\n": "<p>\u4e22\u5f03\u7ed3\u679c\u4e2d\u7b54\u6848\u540e\u7684\u6240\u6709\u5185\u5bb9</p>\n",23"<p>Find which sequences have finished </p>\n": "<p>\u627e\u51fa\u54ea\u4e9b\u5e8f\u5217\u5df2\u5b8c\u6210</p>\n",24"<p>Get a set of problems and answers </p>\n": "<p>\u83b7\u53d6\u4e00\u7cfb\u5217\u95ee\u9898\u548c\u7b54\u6848</p>\n",25"<p>Get the answers </p>\n": "<p>\u5f97\u5230\u7b54\u6848</p>\n",26"<p>Get the model output </p>\n": "<p>\u83b7\u53d6\u6a21\u578b\u8f93\u51fa</p>\n",27"<p>Get the model prediction (greedy) </p>\n": "<p>\u83b7\u53d6\u6a21\u578b\u9884\u6d4b\uff08\u8d2a\u5a6a\uff09</p>\n",28"<p>Get the sampled results </p>\n": "<p>\u83b7\u53d6\u62bd\u6837\u7ed3\u679c</p>\n",29"<p>If all the sequences have completed we skip this </p>\n": "<p>\u5982\u679c\u6240\u6709\u7684\u5e8f\u5217\u90fd\u5b8c\u6210\u4e86\uff0c\u6211\u4eec\u5c31\u8df3\u8fc7\u8fd9\u4e2a</p>\n",30"<p>Log a sample </p>\n": "<p>\u8bb0\u5f55\u6837\u672c</p>\n",31"<p>Log the score </p>\n": "<p>\u8bb0\u5f55\u5206\u6570</p>\n",32"<p>Make a problem with a pre_explanation or not</p>\n<p>Creates an arithmetic addition problem with workings and answer.</p>\n": "<p>\u4e0d\u7ba1\u662f\u5426\u7528 pre_explansion \u95ee\u95ee\u9898</p>\n<p>\u7528\u8fd0\u4f5c\u548c\u7b54\u6848\u521b\u5efa\u7b97\u672f\u52a0\u6cd5\u95ee\u9898\u3002</p>\n",33"<p>Maximum number of digits per operand integer </p>\n": "<p>\u6bcf\u4e2a\u64cd\u4f5c\u6570\u6574\u6570\u7684\u6700\u5927\u4f4d\u6570</p>\n",34"<p>Move to device </p>\n": "<p>\u79fb\u81f3\u8bbe\u5907</p>\n",35"<p>No need of a validation dataset </p>\n": "<p>\u4e0d\u9700\u8981\u9a8c\u8bc1\u6570\u636e\u96c6</p>\n",36"<p>Number of problems in evaluation </p>\n": "<p>\u8bc4\u4f30\u4e2d\u7684\u95ee\u9898\u6570\u91cf</p>\n",37"<p>Number of sequences that have completed </p>\n": "<p>\u5df2\u5b8c\u6210\u7684\u5e8f\u5217\u6570</p>\n",38"<p>Number of times to run evaluations per epoch </p>\n": "<p>\u6bcf\u4e2a\u7eaa\u5143\u8fd0\u884c\u8bc4\u4f30\u7684\u6b21\u6570</p>\n",39"<p>Number of tokens in the vocabulary </p>\n": "<p>\u8bcd\u6c47\u8868\u4e2d\u7684\u4ee3\u5e01\u6570\u91cf</p>\n",40"<p>Number of training sequences per epoch </p>\n": "<p>\u6bcf\u4e2a\u7eaa\u5143\u7684\u8bad\u7ec3\u5e8f\u5217\u6570</p>\n",41"<p>Override with the question </p>\n": "<p>\u7528\u95ee\u9898\u8986\u76d6</p>\n",42"<p>Sample upto sequence length </p>\n": "<p>\u6837\u672c\u76f4\u81f3\u5e8f\u5217\u957f\u5ea6</p>\n",43"<p>Sampled results </p>\n": "<p>\u62bd\u6837\u7ed3\u679c</p>\n",44"<p>Skip if all have finished </p>\n": "<p>\u5982\u679c\u5168\u90e8\u5b8c\u6210\uff0c\u5219\u8df3\u8fc7</p>\n",45"<p>Skip in the first epoch </p>\n": "<p>\u8df3\u8fc7\u7b2c\u4e00\u4e2a\u7eaa\u5143</p>\n",46"<p>Token id of the new line character - this marks end of the answer </p>\n": "<p>\u6362\u884c\u7b26\u7684\u6807\u8bb0 ID-\u8fd9\u6807\u5fd7\u7740\u7b54\u6848\u7684\u7ed3\u675f</p>\n",47"<p>Token id to string </p>\n": "<p>\u4ee4\u724c ID \u8f6c\u6362\u4e3a\u5b57\u7b26\u4e32</p>\n",48"<p>Training data loader </p>\n": "<p>\u8bad\u7ec3\u6570\u636e\u52a0\u8f7d\u5668</p>\n",49"<ul><li><span translate=no>_^_0_^_</span> is the sequence length of generated math problems. We fill as many problems as possible upto this length :max_digits: is the maximum number of digits in the operand integers :n_sequences: is the number of sequences per epoch</li></ul>\n": "<ul><li><span translate=no>_^_0_^_</span>\u662f\u751f\u6210\u7684\u6570\u5b66\u95ee\u9898\u7684\u5e8f\u5217\u957f\u5ea6\u3002\u6211\u4eec\u5c3d\u53ef\u80fd\u591a\u5730\u586b\u5199\u95ee\u9898\uff0c\u76f4\u5230\u8fd9\u4e2a\u957f\u5ea6\uff1amax_digits: \u662f\u64cd\u4f5c\u6570\u4e2d\u7684\u6700\u5927\u4f4d\u6570\u6574\u6570:n_sequences: \u662f\u6bcf\u4e2a\u7eaa\u5143\u7684\u5e8f\u5217\u6570</li></ul>\n",50"Arithmetic Dataset": "\u7b97\u672f\u6570\u636e\u96c6",51"This creates arithmetic problems.": "\u8fd9\u4f1a\u4ea7\u751f\u7b97\u672f\u95ee\u9898\u3002"52}5354