Text-Guided Token Communication for Wireless Image Transmission

Bole Liu*, Li Qiao, Ye Wang, Zhen Gao, Yu Ma, Keke Ying, Tong Qin

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the emergence of 6G networks and proliferation of visual applications, efficient image transmission under adverse channel conditions is critical. We present a text-guided token communication system leveraging pre-trained foundation models for wireless image transmission. Our approach converts images to discrete tokens, applies 5G NR polar codec on top of the tokenizeation, and employs text as a conditioning signal to generate lost tokens to mitigate the cliff effect at lower signal-to-noise ratios (SNRs). Evaluations on ImageNet show our method outperforms state-of-the-art deep joint source-channel coding scheme in perceptual quality and semantic preservation at extremely low bandwidth ratio, i.e., 1/96. In addition, Our system requires no scenario-specific retraining and exhibits superior cross-dataset generalization, establishing a new paradigm for efficient image transmission aligned with human perceptual priorities.

Original languageEnglish
Title of host publication2025 IEEE/CIC International Conference on Communications in China:Shaping the Future of Integrated Connectivity, ICCC 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331544447
DOIs
Publication statusPublished - 2025
Externally publishedYes
Event2025 IEEE/CIC International Conference on Communications in China, ICCC 2025 - Shanghai, China
Duration: 10 Aug 202513 Aug 2025

Publication series

Name2025 IEEE/CIC International Conference on Communications in China:Shaping the Future of Integrated Connectivity, ICCC 2025

Conference

Conference2025 IEEE/CIC International Conference on Communications in China, ICCC 2025
Country/TerritoryChina
CityShanghai
Period10/08/2513/08/25

Keywords

  • 6 G networks
  • cross modality
  • foundation models
  • generative semantic communications
  • Token communication

Fingerprint

Dive into the research topics of 'Text-Guided Token Communication for Wireless Image Transmission'. Together they form a unique fingerprint.

Cite this