基于异构感知和对比学习的图推理机制

黄如; 吴新宇

doi:10.14135/j.cnki.1006-3080.20250429001

基于异构感知和对比学习的图推理机制

黄如,
吴新宇

Research on Graph Reasoning Mechanism Based on Heterogeneous Perception and Contrastive Learning

HUANG Ru,
WU Xinyu

摘要

摘要: 传统图神经网络（GNN）通常局限于预设任务，依赖明确的标记数据和固定的训练机制，导致其在处理开放式任务时灵活性不足；现有图神经网络和大型语言模型的交叉领域（GNN-LLM）的研究多聚焦于同质化的文本属性图，未考虑到节点异质性问题。针对异构特征表征空间错位和开放域任务泛化的问题，本文提出了一个异构感知图学习联合对比学习的多跳图形推理机制：该模型基于元路径对称性解耦并重构异构子图，通过差异化注意力机制和层次化特征聚合算法，实现拓扑嵌入与语义表示的高效融合；针对模态对齐难题，采用渐进式阶段优化策略训练图形查询转换器，基于对比学习方法弥合模态差异，通过自监督的图文匹配建立细粒度特征关联，融合语言建模目标促使模型生成问题的有效回答。实验结果表明，该模型兼具预定义任务适配性与开放场景泛化性，在异构网络问答任务中展现出对未见问题的高质量推理能力。

Abstract: Heterogeneous graphs can mine complex correlations in graph data, which is of great significance in the real world. Traditional Graph Neural Network (GNN) is usually limited to preset tasks and relies on clear labeled data and fixed training mechanism, which leads to its lack of flexibility when dealing with open-ended tasks. The existing research of GNN-LLM mostly focuses on homogeneous text attribute graphs, and does not consider the node heterogeneity. Aiming at the problems of heterogeneous feature representation space misalignment and open-domain task generalization, a multi-hop graphical reasoning mechanism based on heterogeneous perceptual graph learning joint contrastive learning is proposed. The model decouples and reconstructs heterogeneous subgraphs based on meta-path symmetry, and realizes the efficient fusion of topological embedding and semantic representation through a differentiated attention mechanism and a hierarchical feature aggregation algorithm. Aiming at the problem of modal alignment, a progressive phase optimization strategy is used to train the graph query converter, and a contrastive learning method is used to bridge the modal differences. The fine-grained feature association is established through self-supervised image-text matching, and the language modeling goal is integrated to promote the model to generate effective answers to the question. Experimental results show that the model has both predefined task adaptation and open scene generalization, and shows high quality reasoning ability for unseen questions in heterogeneous network question answering tasks.

HTML全文

参考文献(29)

施引文献

资源附件(0)