Path: blob/master/translate_cache/experiments/arithmetic_dataset.ja.json
4923 views
{1"<h2>Arithmetic Dataset</h2>\n<p>This creates arithmetic addition problems and solutions with workings. We've only implemented addition so far.</p>\n<p>It's based on a character level tokenization.</p>\n": "<h2>\u7b97\u8853\u30c7\u30fc\u30bf\u30bb\u30c3\u30c8</h2>\n<p>\u3053\u308c\u306b\u3088\u308a\u3001\u7b97\u8853\u52a0\u7b97\u306e\u554f\u984c\u3068\u89e3\u6cd5\u304c\u751f\u6210\u3055\u308c\u307e\u3059\u3002\u4eca\u306e\u3068\u3053\u308d\u3001\u8ffd\u52a0\u3092\u5b9f\u88c5\u3057\u305f\u3060\u3051\u3067\u3059\u3002</p>\n<p>\u30ad\u30e3\u30e9\u30af\u30bf\u30fc\u30ec\u30d9\u30eb\u306e\u30c8\u30fc\u30af\u30f3\u5316\u306b\u57fa\u3065\u3044\u3066\u3044\u307e\u3059\u3002</p>\n",2"<h2>Arithmetic Task Experiment Configurations</h2>\n": "<h2>\u7b97\u8853\u30bf\u30b9\u30af\u5b9f\u9a13\u69cb\u6210</h2>\n",3"<h3>Evaluation</h3>\n<p>We use the sampling function to evaluate the model on a set of problems</p>\n": "<h3>\u8a55\u4fa1</h3>\n<p>\u30b5\u30f3\u30d7\u30ea\u30f3\u30b0\u95a2\u6570\u3092\u4f7f\u7528\u3057\u3066\u3001\u4e00\u9023\u306e\u554f\u984c\u306b\u3064\u3044\u3066\u30e2\u30c7\u30eb\u3092\u8a55\u4fa1\u3057\u307e\u3059\u3002</p>\n",4"<p> </p>\n": "<p></p>\n",5"<p> Code to test generated problems</p>\n": "<p>\u751f\u6210\u3055\u308c\u305f\u554f\u984c\u3092\u30c6\u30b9\u30c8\u3059\u308b\u30b3\u30fc\u30c9</p>\n",6"<p> Decode a list of token ids</p>\n": "<p>\u30c8\u30fc\u30af\u30f3 ID \u306e\u30ea\u30b9\u30c8\u3092\u30c7\u30b3\u30fc\u30c9\u3059\u308b</p>\n",7"<p> Encode a given string</p>\n": "<p>\u4e0e\u3048\u3089\u308c\u305f\u6587\u5b57\u5217\u3092\u30a8\u30f3\u30b3\u30fc\u30c9\u3059\u308b</p>\n",8"<p> Generate multiple problems and pack them into a sequence.</p>\n": "<p>\u8907\u6570\u306e\u554f\u984c\u3092\u751f\u6210\u3057\u3001\u305d\u308c\u3089\u3092\u30b7\u30fc\u30b1\u30f3\u30b9\u306b\u307e\u3068\u3081\u307e\u3059\u3002</p>\n",9"<p> Generates an integer with <span translate=no>_^_0_^_</span> number of digits</p>\n": "<p><span translate=no>_^_0_^_</span>\u6841\u6570\u306e\u6574\u6570\u3092\u751f\u6210\u3057\u307e\u3059</p>\n",10"<p> Generates the workings for <span translate=no>_^_0_^_</span>. For example for <span translate=no>_^_1_^_</span> it generates <span translate=no>_^_2_^_</span>.</p>\n": "<p>\u306e\u4f5c\u696d\u3092\u751f\u6210\u3057\u307e\u3059\u3002<span translate=no>_^_0_^_</span>\u305f\u3068\u3048\u3070<span translate=no>_^_1_^_</span>\u3001\u751f\u6210\u3059\u308b\u5834\u5408\u306a\u3069\u3067\u3059<span translate=no>_^_2_^_</span>\u3002</p>\n",11"<p> Get a input and target pair for auto-regressive modelling</p>\n": "<p>\u81ea\u5df1\u56de\u5e30\u30e2\u30c7\u30ea\u30f3\u30b0\u306e\u5165\u529b\u3068\u30bf\u30fc\u30b2\u30c3\u30c8\u306e\u30da\u30a2\u3092\u53d6\u5f97</p>\n",12"<p> Get arithmetic problem and answer. This is used for evaluation.</p>\n": "<p>\u7b97\u8853\u554f\u984c\u3092\u51fa\u3057\u3066\u3001\u7b54\u3048\u3092\u51fa\u3057\u3066\u304f\u3060\u3055\u3044\u3002\u3053\u308c\u306f\u8a55\u4fa1\u306b\u4f7f\u7528\u3055\u308c\u307e\u3059\u3002</p>\n",13"<p> Number of sequences per epoch</p>\n": "<p>\u30a8\u30dd\u30c3\u30af\u3042\u305f\u308a\u306e\u30b7\u30fc\u30b1\u30f3\u30b9\u6570</p>\n",14"<p> Training data loader</p>\n": "<p>\u30c8\u30ec\u30fc\u30cb\u30f3\u30b0\u30c7\u30fc\u30bf\u30ed\u30fc\u30c0\u30fc</p>\n",15"<p><em>This is based on code by <a href=\"https://twitter.com/gharik\">Georges Harik (@gharik)</a>.</em></p>\n": "<p><em><a href=\"https://twitter.com/gharik\">\u3053\u308c\u306f\u30b8\u30e7\u30eb\u30b8\u30e5\u30fb\u30cf\u30ea\u30af</a> (@gharik) \u306e\u30b3\u30fc\u30c9\u306b\u57fa\u3065\u3044\u3066\u3044\u307e\u3059\u3002</em></p>\n",16"<p>Add the next token to the input </p>\n": "<p>\u6b21\u306e\u30c8\u30fc\u30af\u30f3\u3092\u5165\u529b\u306b\u8ffd\u52a0\u3057\u307e\u3059</p>\n",17"<p>Character to token id </p>\n": "<p>\u6587\u5b57\u304b\u3089\u30c8\u30fc\u30af\u30f3 ID \u3078</p>\n",18"<p>Collect the problems only </p>\n": "<p>\u554f\u984c\u3060\u3051\u96c6\u3081\u3088\u3046</p>\n",19"<p>Count the number of correct answers </p>\n": "<p>\u6b63\u89e3\u306e\u6570\u3092\u6570\u3048\u308b</p>\n",20"<p>Create a dataset to generate problems </p>\n": "<p>\u554f\u984c\u3092\u751f\u6210\u3059\u308b\u305f\u3081\u306e\u30c7\u30fc\u30bf\u30bb\u30c3\u30c8\u306e\u4f5c\u6210</p>\n",21"<p>Create a tensor with only the initial token </p>\n": "<p>\u6700\u521d\u306e\u30c8\u30fc\u30af\u30f3\u306e\u307f\u3067\u30c6\u30f3\u30bd\u30eb\u3092\u4f5c\u6210</p>\n",22"<p>Discard everything after the answer in the results </p>\n": "<p>\u7d50\u679c\u306e\u56de\u7b54\u306e\u5f8c\u306b\u7d9a\u304f\u3082\u306e\u306f\u3059\u3079\u3066\u7834\u68c4\u3057\u3066\u304f\u3060\u3055\u3044</p>\n",23"<p>Find which sequences have finished </p>\n": "<p>\u3069\u306e\u30b7\u30fc\u30b1\u30f3\u30b9\u304c\u7d42\u4e86\u3057\u305f\u304b\u8abf\u3079\u308b</p>\n",24"<p>Get a set of problems and answers </p>\n": "<p>\u4e00\u9023\u306e\u554f\u984c\u3068\u56de\u7b54\u3092\u5165\u624b</p>\n",25"<p>Get the answers </p>\n": "<p>\u7b54\u3048\u3092\u30b2\u30c3\u30c8</p>\n",26"<p>Get the model output </p>\n": "<p>\u30e2\u30c7\u30eb\u51fa\u529b\u3092\u53d6\u5f97</p>\n",27"<p>Get the model prediction (greedy) </p>\n": "<p>\u30e2\u30c7\u30eb\u4e88\u6e2c\u3092\u53d6\u5f97 (\u6b32\u5f35\u308a)</p>\n",28"<p>Get the sampled results </p>\n": "<p>\u30b5\u30f3\u30d7\u30eb\u7d50\u679c\u3092\u53d6\u5f97</p>\n",29"<p>If all the sequences have completed we skip this </p>\n": "<p>\u3059\u3079\u3066\u306e\u30b7\u30fc\u30b1\u30f3\u30b9\u304c\u5b8c\u4e86\u3057\u305f\u3089\u3053\u308c\u3092\u30b9\u30ad\u30c3\u30d7\u3057\u307e\u3059\u3002</p>\n",30"<p>Log a sample </p>\n": "<p>\u30b5\u30f3\u30d7\u30eb\u3092\u30ed\u30b0\u306b\u8a18\u9332\u3059\u308b</p>\n",31"<p>Log the score </p>\n": "<p>\u30b9\u30b3\u30a2\u3092\u8a18\u9332\u3059\u308b</p>\n",32"<p>Make a problem with a pre_explanation or not</p>\n<p>Creates an arithmetic addition problem with workings and answer.</p>\n": "<p>pre_explanation \u3067\u554f\u984c\u3092\u8d77\u3053\u3059\u304b\u3057\u306a\u3044\u304b</p>\n<p>\u8a08\u7b97\u3068\u89e3\u3092\u542b\u3080\u7b97\u8853\u52a0\u7b97\u554f\u984c\u3092\u4f5c\u6210\u3057\u307e\u3059\u3002</p>\n",33"<p>Maximum number of digits per operand integer </p>\n": "<p>\u30aa\u30da\u30e9\u30f3\u30c9\u6574\u6570\u3042\u305f\u308a\u306e\u6700\u5927\u6841\u6570</p>\n",34"<p>Move to device </p>\n": "<p>\u30c7\u30d0\u30a4\u30b9\u306b\u79fb\u52d5</p>\n",35"<p>No need of a validation dataset </p>\n": "<p>\u691c\u8a3c\u30c7\u30fc\u30bf\u30bb\u30c3\u30c8\u306f\u4e0d\u8981</p>\n",36"<p>Number of problems in evaluation </p>\n": "<p>\u8a55\u4fa1\u4e2d\u306e\u554f\u984c\u306e\u6570</p>\n",37"<p>Number of sequences that have completed </p>\n": "<p>\u5b8c\u4e86\u3057\u305f\u30b7\u30fc\u30b1\u30f3\u30b9\u306e\u6570</p>\n",38"<p>Number of times to run evaluations per epoch </p>\n": "<p>\u30a8\u30dd\u30c3\u30af\u3054\u3068\u306b\u8a55\u4fa1\u3092\u5b9f\u884c\u3059\u308b\u56de\u6570</p>\n",39"<p>Number of tokens in the vocabulary </p>\n": "<p>\u30dc\u30ad\u30e3\u30d6\u30e9\u30ea\u30fc\u306e\u30c8\u30fc\u30af\u30f3\u306e\u6570</p>\n",40"<p>Number of training sequences per epoch </p>\n": "<p>\u30a8\u30dd\u30c3\u30af\u3042\u305f\u308a\u306e\u30c8\u30ec\u30fc\u30cb\u30f3\u30b0\u30b7\u30fc\u30b1\u30f3\u30b9\u306e\u6570</p>\n",41"<p>Override with the question </p>\n": "<p>\u8cea\u554f\u3067\u4e0a\u66f8\u304d</p>\n",42"<p>Sample upto sequence length </p>\n": "<p>\u30b7\u30fc\u30b1\u30f3\u30b9\u9577\u307e\u3067\u306e\u30b5\u30f3\u30d7\u30eb</p>\n",43"<p>Sampled results </p>\n": "<p>\u30b5\u30f3\u30d7\u30eb\u7d50\u679c</p>\n",44"<p>Skip if all have finished </p>\n": "<p>\u3059\u3079\u3066\u7d42\u4e86\u3057\u305f\u3089\u30b9\u30ad\u30c3\u30d7</p>\n",45"<p>Skip in the first epoch </p>\n": "<p>\u6700\u521d\u306e\u30a8\u30dd\u30c3\u30af\u3092\u30b9\u30ad\u30c3\u30d7</p>\n",46"<p>Token id of the new line character - this marks end of the answer </p>\n": "<p>\u6539\u884c\u6587\u5b57\u306e\u30c8\u30fc\u30af\u30f3ID-\u3053\u308c\u3067\u56de\u7b54\u306e\u6700\u5f8c\u306b\u306a\u308a\u307e\u3059</p>\n",47"<p>Token id to string </p>\n": "<p>\u30c8\u30fc\u30af\u30f3 ID \u3092\u6587\u5b57\u5217\u306b</p>\n",48"<p>Training data loader </p>\n": "<p>\u30c8\u30ec\u30fc\u30cb\u30f3\u30b0\u30c7\u30fc\u30bf\u30ed\u30fc\u30c0\u30fc</p>\n",49"<ul><li><span translate=no>_^_0_^_</span> is the sequence length of generated math problems. We fill as many problems as possible upto this length :max_digits: is the maximum number of digits in the operand integers :n_sequences: is the number of sequences per epoch</li></ul>\n": "<ul><li><span translate=no>_^_0_^_</span>\u751f\u6210\u3055\u308c\u305f\u6570\u5b66\u554f\u984c\u306e\u30b7\u30fc\u30b1\u30f3\u30b9\u9577\u3067\u3059\u3002\u3053\u306e\u9577\u3055\u307e\u3067\u3067\u304d\u308b\u3060\u3051\u591a\u304f\u306e\u554f\u984c\u3092\u89e3\u304d\u307e\u3059\u3002max_digits: \u306f\u30aa\u30da\u30e9\u30f3\u30c9\u6574\u6570\u306e\u6700\u5927\u6841\u6570:n_sequences: \u306f\u30a8\u30dd\u30c3\u30af\u3042\u305f\u308a\u306e\u30b7\u30fc\u30b1\u30f3\u30b9\u6570</li></ul>\n",50"Arithmetic Dataset": "\u7b97\u8853\u30c7\u30fc\u30bf\u30bb\u30c3\u30c8",51"This creates arithmetic problems.": "\u3053\u308c\u306f\u7b97\u8853\u4e0a\u306e\u554f\u984c\u3092\u5f15\u304d\u8d77\u3053\u3057\u307e\u3059\u3002"52}5354