You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
*[Suggested Datasets and Models](#suggested-datasets-and-models)
50
+
*[Mathematical Language Understanding](#mathematical-language-understanding)
50
51
*[Story, Question and Answer](#story-question-and-answer)
51
52
*[Image Classification](#image-classification)
52
53
*[Image Generation](#image-generation)
@@ -79,6 +80,24 @@ hyperparameters that we know works well in our setup. We usually
79
80
run either on Cloud TPUs or on 8-GPU machines; you might need
80
81
to modify the hyperparameters if you run on a different setup.
81
82
83
+
### Mathematical Language Understanding
84
+
85
+
For evaluating mathematical expressions at the character level involving addition, subtraction and multiplication of both positive and negative decimal numbers with variable digits assigned to symbolic variables, use
86
+
87
+
* the [MLU](https://art.wangperawong.com/mathematical_language_understanding_train.tar.gz) data-set:
88
+
`--problem=mathematical_language_understanding`
89
+
90
+
You can try solving the problem with different transformer models and hyperparameters as described in the [paper](https://arxiv.org/abs/1812.02825):
0 commit comments