Revisiting Graph-Tokenizing Large Language Models: A Systematic Evaluation of Graph Token Understanding

ByAdmin

May 7, 2026

THE AI TODAY

arXiv:2605.03514v1 Announce Type: cross
Abstract: The remarkable success of large language models (LLMs) has motivated researchers to adapt them as universal predictors for various graph tasks. As a widely recognized paradigm, Graph-Tokenizing LLMs (GTokenLLMs) compress complex graph data into graph tokens and treat them as prefix tokens for querying LLMs, leading many to believe that LLMs can understand graphs more effectively and efficiently. In this paper, we challenge this belief: textit{Do GTokenLLMs fully understand graph tokens in the natural-language embedding space?} Motivated by this question, we formalize a unified framework for GTokenLLMs and propose an evaluation pipeline, textbf{GTEval}, to assess graph-token understanding via instruction transformations at the format and content levels. We conduct extensive experiments on 6 representative GTokenLLMs with GTEval. The primary findings are as follows: (1) Existing GTokenLLMs do not fully understand graph tokens. They exhibit over-sensitivity or over-insensitivity to instruction changes, and rely heavily on text for reasoning; (2) Although graph tokens preserve task-relevant graph information and receive attention across LLM layers, their utilization varies across models and instruction variants; (3) Additional instruction tuning can improve performance on the original and seen instructions, but it does not fully address the challenge of graph-token understanding, calling for further improvement.

By Admin

AI RESEARCH

Revisiting Graph-Tokenizing Large Language Models: A Systematic Evaluation of Graph Token Understanding

ByAdmin

By Admin

Related Post

SpecKV: Adaptive Speculative Decoding with Compression-Aware Gamma Selection

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

Decoupled Guidance Diffusion for Adaptive Offline Safe Reinforcement Learning

Leave a Reply Cancel reply

You missed

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

SpecKV: Adaptive Speculative Decoding with Compression-Aware Gamma Selection

Parametrizing Convex Sets Using Sublinear Neural Networks

Revisiting Graph-Tokenizing Large Language Models: A Systematic Evaluation of Graph Token Understanding