论文标题
数学单词问题的异构线图变压器
Heterogeneous Line Graph Transformer for Math Word Problems
论文作者
论文摘要
本文介绍了用于在线学习系统的新机器学习模型的设计和实施。我们旨在通过启用自动数学单词问题求解器来提高系统的智能水平,该求解器可以支持广泛的功能,例如家庭作业校正,难度估计和优先建议。我们最初计划采用现有模型,但意识到他们将数学单词问题处理为序列或代币的均匀图。多种类型的令牌(例如实体,单位,速率和数字)之间的关系被忽略。我们决定设计和实施一种新型模型,以使用这种关系数据来弥合人类可读语言和机器可读性逻辑形式之间的信息差距。我们提出了一个异质线图变压器(HLGT)模型,该模型通过在数学单词问题上通过语义角色标记构建异构线图,然后执行节点表示学习,从而了解Edge类型。我们添加数值比较作为一项辅助任务,以改善用于现实世界使用的模型培训。实验结果表明,所提出的模型比现有模型的性能更好,并表明它仍然远低于人类绩效。不断需要信息利用和知识发现来改善在线学习系统。
This paper describes the design and implementation of a new machine learning model for online learning systems. We aim at improving the intelligent level of the systems by enabling an automated math word problem solver which can support a wide range of functions such as homework correction, difficulty estimation, and priority recommendation. We originally planned to employ existing models but realized that they processed a math word problem as a sequence or a homogeneous graph of tokens. Relationships between the multiple types of tokens such as entity, unit, rate, and number were ignored. We decided to design and implement a novel model to use such relational data to bridge the information gap between human-readable language and machine-understandable logical form. We propose a heterogeneous line graph transformer (HLGT) model that constructs a heterogeneous line graph via semantic role labeling on math word problems and then perform node representation learning aware of edge types. We add numerical comparison as an auxiliary task to improve model training for real-world use. Experimental results show that the proposed model achieves a better performance than existing models and suggest that it is still far below human performance. Information utilization and knowledge discovery is continuously needed to improve the online learning systems.