Jen, Tien YiTien YiJenHuang, Hen HsenHen HsenHuangHSIN-HSI CHEN2023-03-102023-03-102021-12-149781450391153https://www.scopus.com/inward/record.uri?eid=2-s2.0-85128691088&doi=10.1145%2f3486622.3494000&partnerID=40&md5=ff4c58b0a9b53ccb9bf0f8f401ca2aa9Most studies of automatic math word problem solving rely on a dataset for training the model that transforms a question into the corresponding answer directly, or translates the question into a sequence of operations that form a program to derive the answer. The program serving as the intermediate symbolic form between the question and the answer provides more information for the model to learn arithmetic reasoning. However, manually composing the programs for numerous questions is a labor-intensive work, resulting in only one medium-sized dataset, MathQA, is available. This work proposes a novel recycling numeracy data augmentation (RNDA) approach that automatically generates high quality training instances in the MathQA style. Experimental results show that the model trained on the augmented data achieves the state-of-the-art performance. We will release the dataset as a resource for the research community.data augmentation | math word problem solving | recyclingStatistics; Data augmentation; High quality; Labour-intensive; Learn+; Math word problem solving; Quality training; Sequences of operations; State-of-the-art performance; Symbolic verification; Word problem solving; RecyclingRecycling Numeracy Data Augmentation with Symbolic Verification for Math Word Problem Solvingconference paper10.1145/3486622.34940002-s2.0-85128691088https://api.elsevier.com/content/abstract/scopus_id/85128691088