Text-Guided Token Communication for Wireless Image Transmission

Bole Liu*, Li Qiao, Ye Wang, Zhen Gao, Yu Ma, Keke Ying, Tong Qin

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

With the emergence of 6G networks and proliferation of visual applications, efficient image transmission under adverse channel conditions is critical. We present a text-guided token communication system leveraging pre-trained foundation models for wireless image transmission. Our approach converts images to discrete tokens, applies 5G NR polar codec on top of the tokenizeation, and employs text as a conditioning signal to generate lost tokens to mitigate the cliff effect at lower signal-to-noise ratios (SNRs). Evaluations on ImageNet show our method outperforms state-of-the-art deep joint source-channel coding scheme in perceptual quality and semantic preservation at extremely low bandwidth ratio, i.e., 1/96. In addition, Our system requires no scenario-specific retraining and exhibits superior cross-dataset generalization, establishing a new paradigm for efficient image transmission aligned with human perceptual priorities.

源语言英语
主期刊名2025 IEEE/CIC International Conference on Communications in China:Shaping the Future of Integrated Connectivity, ICCC 2025
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798331544447
DOI
出版状态已出版 - 2025
已对外发布
活动2025 IEEE/CIC International Conference on Communications in China, ICCC 2025 - Shanghai, 中国
期限: 10 8月 202513 8月 2025

出版系列

姓名2025 IEEE/CIC International Conference on Communications in China:Shaping the Future of Integrated Connectivity, ICCC 2025

会议

会议2025 IEEE/CIC International Conference on Communications in China, ICCC 2025
国家/地区中国
Shanghai
时期10/08/2513/08/25

指纹

探究 'Text-Guided Token Communication for Wireless Image Transmission' 的科研主题。它们共同构成独一无二的指纹。

引用此