CoCalc -- arithmetic_dataset.ja.json

GitHub Repository: labmlai/annotated_deep_learning_paper_implementations
Path: blob/master/translate_cache/experiments/arithmetic_dataset.ja.json
⁴⁹²³ views
1
{
2
 "<h2>Arithmetic Dataset</h2>\n<p>This creates arithmetic addition problems and solutions with workings. We&#x27;ve only implemented addition so far.</p>\n<p>It&#x27;s based on a character level tokenization.</p>\n": "<h2>\u7b97\u8853\u30c7\u30fc\u30bf\u30bb\u30c3\u30c8</h2>\n<p>\u3053\u308c\u306b\u3088\u308a\u3001\u7b97\u8853\u52a0\u7b97\u306e\u554f\u984c\u3068\u89e3\u6cd5\u304c\u751f\u6210\u3055\u308c\u307e\u3059\u3002\u4eca\u306e\u3068\u3053\u308d\u3001\u8ffd\u52a0\u3092\u5b9f\u88c5\u3057\u305f\u3060\u3051\u3067\u3059\u3002</p>\n<p>\u30ad\u30e3\u30e9\u30af\u30bf\u30fc\u30ec\u30d9\u30eb\u306e\u30c8\u30fc\u30af\u30f3\u5316\u306b\u57fa\u3065\u3044\u3066\u3044\u307e\u3059\u3002</p>\n",
3
 "<h2>Arithmetic Task Experiment Configurations</h2>\n": "<h2>\u7b97\u8853\u30bf\u30b9\u30af\u5b9f\u9a13\u69cb\u6210</h2>\n",
4
 "<h3>Evaluation</h3>\n<p>We use the sampling function to evaluate the model on a set of problems</p>\n": "<h3>\u8a55\u4fa1</h3>\n<p>\u30b5\u30f3\u30d7\u30ea\u30f3\u30b0\u95a2\u6570\u3092\u4f7f\u7528\u3057\u3066\u3001\u4e00\u9023\u306e\u554f\u984c\u306b\u3064\u3044\u3066\u30e2\u30c7\u30eb\u3092\u8a55\u4fa1\u3057\u307e\u3059\u3002</p>\n",
5
 "<p> </p>\n": "<p></p>\n",
6
 "<p> Code to test generated problems</p>\n": "<p>\u751f\u6210\u3055\u308c\u305f\u554f\u984c\u3092\u30c6\u30b9\u30c8\u3059\u308b\u30b3\u30fc\u30c9</p>\n",
7
 "<p> Decode a list of token ids</p>\n": "<p>\u30c8\u30fc\u30af\u30f3 ID \u306e\u30ea\u30b9\u30c8\u3092\u30c7\u30b3\u30fc\u30c9\u3059\u308b</p>\n",
8
 "<p> Encode a given string</p>\n": "<p>\u4e0e\u3048\u3089\u308c\u305f\u6587\u5b57\u5217\u3092\u30a8\u30f3\u30b3\u30fc\u30c9\u3059\u308b</p>\n",
9
 "<p> Generate multiple problems and pack them into a sequence.</p>\n": "<p>\u8907\u6570\u306e\u554f\u984c\u3092\u751f\u6210\u3057\u3001\u305d\u308c\u3089\u3092\u30b7\u30fc\u30b1\u30f3\u30b9\u306b\u307e\u3068\u3081\u307e\u3059\u3002</p>\n",
10
 "<p> Generates an integer with <span translate=no>_^_0_^_</span> number of digits</p>\n": "<p><span translate=no>_^_0_^_</span>\u6841\u6570\u306e\u6574\u6570\u3092\u751f\u6210\u3057\u307e\u3059</p>\n",
11
 "<p> Generates the workings for <span translate=no>_^_0_^_</span>. For example for <span translate=no>_^_1_^_</span> it generates <span translate=no>_^_2_^_</span>.</p>\n": "<p>\u306e\u4f5c\u696d\u3092\u751f\u6210\u3057\u307e\u3059\u3002<span translate=no>_^_0_^_</span>\u305f\u3068\u3048\u3070<span translate=no>_^_1_^_</span>\u3001\u751f\u6210\u3059\u308b\u5834\u5408\u306a\u3069\u3067\u3059<span translate=no>_^_2_^_</span>\u3002</p>\n",
12
 "<p> Get a input and target pair for auto-regressive modelling</p>\n": "<p>\u81ea\u5df1\u56de\u5e30\u30e2\u30c7\u30ea\u30f3\u30b0\u306e\u5165\u529b\u3068\u30bf\u30fc\u30b2\u30c3\u30c8\u306e\u30da\u30a2\u3092\u53d6\u5f97</p>\n",
13
 "<p> Get arithmetic problem and answer. This is used for evaluation.</p>\n": "<p>\u7b97\u8853\u554f\u984c\u3092\u51fa\u3057\u3066\u3001\u7b54\u3048\u3092\u51fa\u3057\u3066\u304f\u3060\u3055\u3044\u3002\u3053\u308c\u306f\u8a55\u4fa1\u306b\u4f7f\u7528\u3055\u308c\u307e\u3059\u3002</p>\n",
14
 "<p> Number of sequences per epoch</p>\n": "<p>\u30a8\u30dd\u30c3\u30af\u3042\u305f\u308a\u306e\u30b7\u30fc\u30b1\u30f3\u30b9\u6570</p>\n",
15
 "<p> Training data loader</p>\n": "<p>\u30c8\u30ec\u30fc\u30cb\u30f3\u30b0\u30c7\u30fc\u30bf\u30ed\u30fc\u30c0\u30fc</p>\n",
16
 "<p><em>This is based on code by <a href=\"https://twitter.com/gharik\">Georges Harik (@gharik)</a>.</em></p>\n": "<p><em><a href=\"https://twitter.com/gharik\">\u3053\u308c\u306f\u30b8\u30e7\u30eb\u30b8\u30e5\u30fb\u30cf\u30ea\u30af</a> (@gharik) \u306e\u30b3\u30fc\u30c9\u306b\u57fa\u3065\u3044\u3066\u3044\u307e\u3059\u3002</em></p>\n",
17
 "<p>Add the next token to the input </p>\n": "<p>\u6b21\u306e\u30c8\u30fc\u30af\u30f3\u3092\u5165\u529b\u306b\u8ffd\u52a0\u3057\u307e\u3059</p>\n",
18
 "<p>Character to token id </p>\n": "<p>\u6587\u5b57\u304b\u3089\u30c8\u30fc\u30af\u30f3 ID \u3078</p>\n",
19
 "<p>Collect the problems only </p>\n": "<p>\u554f\u984c\u3060\u3051\u96c6\u3081\u3088\u3046</p>\n",
20
 "<p>Count the number of correct answers </p>\n": "<p>\u6b63\u89e3\u306e\u6570\u3092\u6570\u3048\u308b</p>\n",
21
 "<p>Create a dataset to generate problems </p>\n": "<p>\u554f\u984c\u3092\u751f\u6210\u3059\u308b\u305f\u3081\u306e\u30c7\u30fc\u30bf\u30bb\u30c3\u30c8\u306e\u4f5c\u6210</p>\n",
22
 "<p>Create a tensor with only the initial token </p>\n": "<p>\u6700\u521d\u306e\u30c8\u30fc\u30af\u30f3\u306e\u307f\u3067\u30c6\u30f3\u30bd\u30eb\u3092\u4f5c\u6210</p>\n",
23
 "<p>Discard everything after the answer in the results </p>\n": "<p>\u7d50\u679c\u306e\u56de\u7b54\u306e\u5f8c\u306b\u7d9a\u304f\u3082\u306e\u306f\u3059\u3079\u3066\u7834\u68c4\u3057\u3066\u304f\u3060\u3055\u3044</p>\n",
24
 "<p>Find which sequences have finished </p>\n": "<p>\u3069\u306e\u30b7\u30fc\u30b1\u30f3\u30b9\u304c\u7d42\u4e86\u3057\u305f\u304b\u8abf\u3079\u308b</p>\n",
25
 "<p>Get a set of problems and answers </p>\n": "<p>\u4e00\u9023\u306e\u554f\u984c\u3068\u56de\u7b54\u3092\u5165\u624b</p>\n",
26
 "<p>Get the answers </p>\n": "<p>\u7b54\u3048\u3092\u30b2\u30c3\u30c8</p>\n",
27
 "<p>Get the model output </p>\n": "<p>\u30e2\u30c7\u30eb\u51fa\u529b\u3092\u53d6\u5f97</p>\n",
28
 "<p>Get the model prediction (greedy) </p>\n": "<p>\u30e2\u30c7\u30eb\u4e88\u6e2c\u3092\u53d6\u5f97 (\u6b32\u5f35\u308a)</p>\n",
29
 "<p>Get the sampled results </p>\n": "<p>\u30b5\u30f3\u30d7\u30eb\u7d50\u679c\u3092\u53d6\u5f97</p>\n",
30
 "<p>If all the sequences have completed we skip this </p>\n": "<p>\u3059\u3079\u3066\u306e\u30b7\u30fc\u30b1\u30f3\u30b9\u304c\u5b8c\u4e86\u3057\u305f\u3089\u3053\u308c\u3092\u30b9\u30ad\u30c3\u30d7\u3057\u307e\u3059\u3002</p>\n",
31
 "<p>Log a sample </p>\n": "<p>\u30b5\u30f3\u30d7\u30eb\u3092\u30ed\u30b0\u306b\u8a18\u9332\u3059\u308b</p>\n",
32
 "<p>Log the score </p>\n": "<p>\u30b9\u30b3\u30a2\u3092\u8a18\u9332\u3059\u308b</p>\n",
33
 "<p>Make a problem with a pre_explanation or not</p>\n<p>Creates an arithmetic addition problem with workings and answer.</p>\n": "<p>pre_explanation \u3067\u554f\u984c\u3092\u8d77\u3053\u3059\u304b\u3057\u306a\u3044\u304b</p>\n<p>\u8a08\u7b97\u3068\u89e3\u3092\u542b\u3080\u7b97\u8853\u52a0\u7b97\u554f\u984c\u3092\u4f5c\u6210\u3057\u307e\u3059\u3002</p>\n",
34
 "<p>Maximum number of digits per operand integer </p>\n": "<p>\u30aa\u30da\u30e9\u30f3\u30c9\u6574\u6570\u3042\u305f\u308a\u306e\u6700\u5927\u6841\u6570</p>\n",
35
 "<p>Move to device </p>\n": "<p>\u30c7\u30d0\u30a4\u30b9\u306b\u79fb\u52d5</p>\n",
36
 "<p>No need of a validation dataset </p>\n": "<p>\u691c\u8a3c\u30c7\u30fc\u30bf\u30bb\u30c3\u30c8\u306f\u4e0d\u8981</p>\n",
37
 "<p>Number of problems in evaluation </p>\n": "<p>\u8a55\u4fa1\u4e2d\u306e\u554f\u984c\u306e\u6570</p>\n",
38
 "<p>Number of sequences that have completed </p>\n": "<p>\u5b8c\u4e86\u3057\u305f\u30b7\u30fc\u30b1\u30f3\u30b9\u306e\u6570</p>\n",
39
 "<p>Number of times to run evaluations per epoch </p>\n": "<p>\u30a8\u30dd\u30c3\u30af\u3054\u3068\u306b\u8a55\u4fa1\u3092\u5b9f\u884c\u3059\u308b\u56de\u6570</p>\n",
40
 "<p>Number of tokens in the vocabulary </p>\n": "<p>\u30dc\u30ad\u30e3\u30d6\u30e9\u30ea\u30fc\u306e\u30c8\u30fc\u30af\u30f3\u306e\u6570</p>\n",
41
 "<p>Number of training sequences per epoch </p>\n": "<p>\u30a8\u30dd\u30c3\u30af\u3042\u305f\u308a\u306e\u30c8\u30ec\u30fc\u30cb\u30f3\u30b0\u30b7\u30fc\u30b1\u30f3\u30b9\u306e\u6570</p>\n",
42
 "<p>Override with the question </p>\n": "<p>\u8cea\u554f\u3067\u4e0a\u66f8\u304d</p>\n",
43
 "<p>Sample upto sequence length </p>\n": "<p>\u30b7\u30fc\u30b1\u30f3\u30b9\u9577\u307e\u3067\u306e\u30b5\u30f3\u30d7\u30eb</p>\n",
44
 "<p>Sampled results </p>\n": "<p>\u30b5\u30f3\u30d7\u30eb\u7d50\u679c</p>\n",
45
 "<p>Skip if all have finished </p>\n": "<p>\u3059\u3079\u3066\u7d42\u4e86\u3057\u305f\u3089\u30b9\u30ad\u30c3\u30d7</p>\n",
46
 "<p>Skip in the first epoch </p>\n": "<p>\u6700\u521d\u306e\u30a8\u30dd\u30c3\u30af\u3092\u30b9\u30ad\u30c3\u30d7</p>\n",
47
 "<p>Token id of the new line character - this marks end of the answer </p>\n": "<p>\u6539\u884c\u6587\u5b57\u306e\u30c8\u30fc\u30af\u30f3ID-\u3053\u308c\u3067\u56de\u7b54\u306e\u6700\u5f8c\u306b\u306a\u308a\u307e\u3059</p>\n",
48
 "<p>Token id to string </p>\n": "<p>\u30c8\u30fc\u30af\u30f3 ID \u3092\u6587\u5b57\u5217\u306b</p>\n",
49
 "<p>Training data loader </p>\n": "<p>\u30c8\u30ec\u30fc\u30cb\u30f3\u30b0\u30c7\u30fc\u30bf\u30ed\u30fc\u30c0\u30fc</p>\n",
50
 "<ul><li><span translate=no>_^_0_^_</span>  is the sequence length of generated math problems.  We fill as many problems as possible upto this length :max_digits: is the maximum number of digits in the operand integers :n_sequences: is the number of sequences per epoch</li></ul>\n": "<ul><li><span translate=no>_^_0_^_</span>\u751f\u6210\u3055\u308c\u305f\u6570\u5b66\u554f\u984c\u306e\u30b7\u30fc\u30b1\u30f3\u30b9\u9577\u3067\u3059\u3002\u3053\u306e\u9577\u3055\u307e\u3067\u3067\u304d\u308b\u3060\u3051\u591a\u304f\u306e\u554f\u984c\u3092\u89e3\u304d\u307e\u3059\u3002max_digits: \u306f\u30aa\u30da\u30e9\u30f3\u30c9\u6574\u6570\u306e\u6700\u5927\u6841\u6570:n_sequences: \u306f\u30a8\u30dd\u30c3\u30af\u3042\u305f\u308a\u306e\u30b7\u30fc\u30b1\u30f3\u30b9\u6570</li></ul>\n",
51
 "Arithmetic Dataset": "\u7b97\u8853\u30c7\u30fc\u30bf\u30bb\u30c3\u30c8",
52
 "This creates arithmetic problems.": "\u3053\u308c\u306f\u7b97\u8853\u4e0a\u306e\u554f\u984c\u3092\u5f15\u304d\u8d77\u3053\u3057\u307e\u3059\u3002"
53
}
54
Product

Resources

Company