Research Article | | Peer-Reviewed

A Review of the Current State of Multimodal Data Annotation in Traditional Chinese Medicine

Received: 28 October 2025     Accepted: 12 November 2025     Published: 11 December 2025
Views:       Downloads:
Abstract

Integrating artificial intelligence with traditional Chinese medicine (TCM) is a vital step towards the modernization and intelligent development of TCM. The aim of this article is to explore the current state of research and development in the annotation of multimodal data for TCM. First, the paper elucidates the background and core significance of multimodal data annotation. Then, it provides a systematic analysis of the various sources of TCM data, including textual materials such as classical texts and modern medical case records; imaging data such as facial, tongue, and ocular diagnoses; and signal data such as pulse and auscultation diagnoses. It also covers the technical characteristics of the annotation of these data types. Subsequently, the article explores detailed annotation methodologies tailored to various application scenarios. These include knowledge representation for classical Chinese medicine texts, in-depth annotation to capture the expertise of renowned traditional Chinese medicine practitioners, and standardized annotation for modern clinical information systems. Finally, this paper focuses on exploring the cross-disciplinary integration of global workspace theory (GWT) in cognitive neuroscience with multi-granularity computing and multimodal annotation in computer science. GWT offers an approach to understanding how diverse diagnostic information is synthesized into a coherent awareness, mirroring the process of TCM pattern differentiation. Multi-granularity computation is a methodological approach to processing this information through attention mechanisms at different levels. By integrating these advanced concepts with a practical framework for multimodal annotation, this paper aims to provide theoretical support and methodological insights for constructing intelligent diagnostic and therapeutic systems tailored to the cognitive characteristics of TCM.

Published in American Journal of Health Research (Volume 13, Issue 6)
DOI 10.11648/j.ajhr.20251306.17
Page(s) 343-351
Creative Commons

This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited.

Copyright

Copyright © The Author(s), 2025. Published by Science Publishing Group

Keywords

Traditional Chinese Medicine, Multimodal Fusion, Data Annotation, Review

References
[1] Yang, T., Ren, H. Y., Zhou, Z. J., et al. Issues and Challenges in the AI-Empowered High-Quality Development of Traditional Chinese Medicine. Journal of Nanjing University of Traditional Chinese Medicine. 2024, 40(12), 1285-1290. [In Chinese]
[2] China Information Association of Chinese Medicine. Specification for Annotation of Multimedia Resources in Chinese Medicine (T/CIATCM 114—2024) [Z].2024. [In Chinese].
[3] Lin, S. Y., Huang, H. W., Liu, C., et al. Cognitive mechanisms and multimodal research methodologies of traditional Chinese medicine diagnostics under the large language model perspective. Chinese Journal of Traditional Chinese Medicine (formerly China Journal of Traditional Chinese Medicine and Pharmacy). 2025, 40(1), 98-102. [In Chinese]
[4] Wang, P., Research on Text Classification for Chinese Medicine Knowledge. Master's Thesis, North China University of Science and Technology. Tangshan, 2021. [In Chinese]
[5] Wang, X. P., Study on clinical thinking of “causes tracing, diseases differentiation, syndromes differentiation” and the decision model of spleen and stomach diseases under the guidance of Traditional Chinese Medicine master Yu Ying'ao's Tongzhi prescriptions. Ph. D. Thesis, China Academy of Chinese Medical Sciences. Beijing, 2025. [In Chinese]
[6] Wang, S. Y., Yang, J., Tian, Y. Research Progress of Key Technologies of TCM Clinical Information Collection Based on TCM Data Science. Zhonghua Zhongyiyao Xuekan. 2024, 42(3), 10-18. [In Chinese]
[7] Li, H. Y., Li, C., Lang, X. F., et al. Analysis of the Research Status and Hot Spot of Intelligent Four-Diagnosis in TCM. Journal of Nanjing University of Traditional Chinese Medicine. 2022, 38(2), 180-186. [In Chinese]
[8] He, J., Application Value of Traditional Chinese Medicine Syndrome Differentiation Artificial Intelligence System Based on Bayesian Network in Tongue Diagnosis and Pulse Diagnosis. Journal of Qiqihar Medical University. 2019, 40(22), 2872-2873. [In Chinese]
[9] Song, X. Design and Implementation of a TCM Interrogation System Integrating Tongue Image Annotation. Master's Thesis, Dalian University of Technology, Dalian, 2020. [In Chinese]
[10] Yi, K., Zhang, M., Guo, S., et al. Research review on data annotation for intelligent pulse diagnosis devices in traditional Chinese medicine. Shanghai Journal of Traditional Chinese Medicine. 2024, 58(10), 5-10. [In Chinese]
[11] Xu, R., Lai, Y., Niu, T., Niu, X. Simulation of Digital TCM Auscultation Based on Deep Learning. Computer Simulation. 2025, 42(7), 525-528. [In Chinese]
[12] Xu, Z., Fu, Q., Jiang, Y., et al. Textual Research on the Origin and Evolution of Olfactory Diagnosis in Traditional Chinese Medicine. Modernization of Traditional Chinese Medicine and Materia Medica-World Science and Technology. 2025, 27(10), 2790-2799. [In Chinese]
[13] Sun, H., Li, B., Wang, R., Fu, Y. Discussion on Several Issues in the Annotation of Traditional Chinese Medicine Ancient Book Bibliographic Databases. Chinese Journal of Information on Traditional Chinese Medicine. 2007, 14(10), 103-104. [In Chinese]
[14] Wang, G., Du, J., Hao, Z., et al. POS Tagging and Feature Recombination for Ancient Prose of TCM Diagnosis. Computer Engineering and Design. 2015, 36(3), 835-841. [In Chinese]
[15] Liu, B., Du, J., Nie, B., et al. Part-of-speech Tagging of Traditional Chinese Medicine Diagnosis Ancient Prose Based on Second order HMM. Computer Engineering. 2017, 43(7), 211-216. [In Chinese]
[16] Lin, R. Knowledge Graph of Tang-Dynasty-version Treatise on Febrile Diseases Constructed Based on Ontology Method. Master's Thesis, China Academy of Chinese Medical Sciences. Beijing, 2021. [In Chinese]
[17] Lu, Y. M. Extracting and analyzing the fine-grained relationship of clinical experiences in ancient literature of traditional Chinese Medicine. Master's Thesis, Sichuan University. Chengdu, 2022. [In Chinese]
[18] Shi, W. Y. Clinical Multi-Modal Data for Famous Doctor Inheritance Integrated Acquisition and Analysis. M.S. Thesis, China Academy of Chinese Medical Sciences, Beijing, 2024. [In Chinese]
[19] Jiao, X. Y., Wang, Y., Yao, L., et al. Analysis of the Disease-Symptom-Syndrome-Medicine Association Rules in Medical Cases of Primary Dysmenorrhea Treated by National Traditional Chinese Medicine Master GUO Chengjie Based on Complex Networks. Journal of Traditional Chinese Medicine. 2025, 31(7), 150-157. [In Chinese]
[20] Li, J., Li, X. D., Mao, S. S., et al. Research on big data knowledge engineering of TCM clinical liver diseases. Chinese Journal of Integrated Traditional and Western Medicine on Liver Diseases. 2023, 33(5). [In Chinese]
[21] Wei, Z., Shu, H., Wang, Y. Research on Traditional Chinese Medicine Symptom Name Recognition Technology Based on Sequence Labeling. International Journal of Materials Science and Applications. 2022, 11(3), 66-75. [In Chinese]
[22] You, Z., Wang, Y., Shu, H. Construction of a Part-of-Speech Tagged Corpus for Traditional Chinese Medicine Syndrome Names. International Journal of Materials Science and Applications. 2017, 6(21), 177-178. [In Chinese]
[23] Baars B J. Global workspace theory of consciousness: toward a cognitive neuroscience of human experience [M]//LAUREYS S. Progress in Brain Research. Amsterdam: Elsevier, 2005, 150: 45-53.
[24] Baars B. J., Geld N., & Kozma, R. (2021). Global Workspace Theory (GWT) and Prefrontal Cortex: Recent Developments. Frontiers in Psychology, 12, 749868.
[25] VANRULLEN R, KANAI R. Deep learning and the Global Workspace Theory [J]. Trends in Neurosciences, 2021, 44(9): 692-702.
[26] DOSSA R F J, ARULKUMARAN K, JULIANI A, et al. Design and evaluation of a global workspace agent embodied in a realistic multimodal environment [J]. Frontiers in Computational Neuroscience, 2024, 18: 1352685.
[27] Marblestone A H, Wayne G, Kording K P. Toward an Integration of Deep Learning and Neuroscience [J]. Frontiers in Computational Neuroscience, 2016, 10: 94.
[28] VanRullen R. Perception Science in the Age of Deep Neural Networks [J]. Frontiers in Psychology, 2017, 8: 142.
[29] Jaegle A., Gimeno F., Brock A., Vinyals O., Zisserman A., Carreira J., et al. (2021b). “Perceiver: general perception with iterative attention,” in Proceedings of the 38th International Conference on Machine Learning (PMLR:), 4651-4664.
[30] Butlin P., Long R., Elmoznino E., Bengio Y., Birch J., Constant A., et al. (2023). Consciousness in artificial intelligence: insights from the science of consciousness. arXiv [preprint].
[31] Zhang, Q. H., Wang, G., Hu, J. Multi-granularity Knowledge Acquisition and Uncertainty Measure. Beijing: Science Press; Beijing, 2013.
[32] Wang, Y. Research on TCM Text Relation Extraction Based on Multi-granularity Information. Master's Thesis, North China University of Science and Technology, Tangshan, China, 2021. [In Chinese]
[33] Wang H., Pan H., Zhang K., He S., Chen C. M2FNet: multi-granularity feature fusion network for medical visual question answering Pacific rim international conference on artificial intelligence, Springer (2022), pp. 141-154.
[34] Xia H., Lu L., Song S. Feature fusion of multi-granularity and multi-scale for facial expression recognition Vis Comput (2023), pp. 1-13.
[35] LIU B, HUANG Y, LI S, et al. Bone age assessment by multi-granularity and multi-attention feature encoding [J]. Quantitative Imaging in Medicine and Surgery, 2024, 14(8): 5902-5914.
[36] Zhang, P., Zeng, P., Wang, et al. Intelligent Integration Empowers Objectification of "Four Diagnostic Methods Integration" in Chinese Medicine. Chinese Journal of Integrated Traditional and Western Medicine. 2025. [In Chinese]
[37] Qi, X., Ding, W., Ju, H., et al. Epilepsy Identification Method Based on Multi-modal Multi-grained Fusion Network. Journal of Data Acquisition and Processing. 2024, 39(3), 710-723. [In Chinese]
[38] Liu, X., Xiang, A., Du, Y. Multimodal Rumor Detection Method Based on Multi-granularity Emotional Features of Image-Text. Journal of Frontiers of Computer Science and Technology. 2025, 19(4), 1021-1035. [In Chinese]
[39] Ying, X., Zhu, Y., Chen, H., Man, F., Zhang, Z. A Multimodal Named Entity Recognition Method Based on Multi-Granularity Progressive Fusion. Application Research of Computers. 2025. [In Chinese]
[40] HUANG J S, QI X Y, CHENG X Y, et al. MMF-NNs: Multi-modal Multi-granularity Fusion Neural Networks for brain networks and its application to epilepsy identification [J]. Artificial Intelligence in Medicine, 2024, 157: 102990.
[41] YU Q S, SHAN J Y, MA J, et al. Multi-modal and Multi-view Cervical Spondylosis Imaging Dataset [J]. Scientific Data, 2025, 12: 1080.
[42] PARK H, YOUM C, CHEON S M, et al. Using machine learning to identify Parkinson’s disease severity subtypes with multimodal data. Journal of Neuro Engineering and Rehabilitation, 2025, 22: 126.
[43] Kang, P., Kang, Y., Bi, S., Zhao, F. Current Status of Unimodal and Multimodal Research on Identifying Depression Based on Inspection and Listening and Smelling Diagnosis. Information on Traditional Chinese Medicine. 2025. [In Chinese] [Online] Available from:
[44] Li, R., Xu, Q., Wu, Y., Liu, X. Construction of Multimodal Knowledge Graph Based on Materia Medica. Modern Computer. 2022, 28(13), 10-17. [In Chinese]
[45] Ma, L., Cai, H., Zhang, X., Fan, Y. Transformation and Upgrading of Chinese Medicine Industry in the Context of Digital Economy: Take Anguo Digital Chinese Medicine Capital as an Example. Journal of Hebei University of Technology (Social Sciences Edition). 2024, 16(3), 1-9.
[46] State Administration for Market Regulation, Standardization Administration of the People's Republic of China. Artificial intelligence—Code of practice for data labeling of machine learning. GB/T 42755-2023. Beijing, China: Standards Press of China; 2023.
[47] State Administration for Market Regulation, Standardization Administration of the People's Republic of China. Health informatics—Semantic network framework of traditional Chinese medicine language system. GB/T 38324-2019. Beijing, China: Standards Press of China; 2019.
[48] Alsaad R, Abd alrazaq A, Boughorbel S, et al. Multimodal Large Language Models in Health Care: Applications, Challenges, and Future Outlook [J]. JMIR Preprints, 2024, 26: e59505.
[49] Li, H., Wang, Y., Liu, X., et al. Large Language Models in Chinese Medicine: Potential Limitations. Modernization of Traditional Chinese Medicine and Materia Medica-World Science and Technology. 2025, 27(10).
Cite This Article
  • APA Style

    Yutong, D., Yuan, Y., Hongwei, Z. (2025). A Review of the Current State of Multimodal Data Annotation in Traditional Chinese Medicine. American Journal of Health Research, 13(6), 343-351. https://doi.org/10.11648/j.ajhr.20251306.17

    Copy | Download

    ACS Style

    Yutong, D.; Yuan, Y.; Hongwei, Z. A Review of the Current State of Multimodal Data Annotation in Traditional Chinese Medicine. Am. J. Health Res. 2025, 13(6), 343-351. doi: 10.11648/j.ajhr.20251306.17

    Copy | Download

    AMA Style

    Yutong D, Yuan Y, Hongwei Z. A Review of the Current State of Multimodal Data Annotation in Traditional Chinese Medicine. Am J Health Res. 2025;13(6):343-351. doi: 10.11648/j.ajhr.20251306.17

    Copy | Download

  • @article{10.11648/j.ajhr.20251306.17,
      author = {Du Yutong and Yao Yuan and Zhou Hongwei},
      title = {A Review of the Current State of Multimodal Data Annotation in Traditional Chinese Medicine},
      journal = {American Journal of Health Research},
      volume = {13},
      number = {6},
      pages = {343-351},
      doi = {10.11648/j.ajhr.20251306.17},
      url = {https://doi.org/10.11648/j.ajhr.20251306.17},
      eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ajhr.20251306.17},
      abstract = {Integrating artificial intelligence with traditional Chinese medicine (TCM) is a vital step towards the modernization and intelligent development of TCM. The aim of this article is to explore the current state of research and development in the annotation of multimodal data for TCM. First, the paper elucidates the background and core significance of multimodal data annotation. Then, it provides a systematic analysis of the various sources of TCM data, including textual materials such as classical texts and modern medical case records; imaging data such as facial, tongue, and ocular diagnoses; and signal data such as pulse and auscultation diagnoses. It also covers the technical characteristics of the annotation of these data types. Subsequently, the article explores detailed annotation methodologies tailored to various application scenarios. These include knowledge representation for classical Chinese medicine texts, in-depth annotation to capture the expertise of renowned traditional Chinese medicine practitioners, and standardized annotation for modern clinical information systems. Finally, this paper focuses on exploring the cross-disciplinary integration of global workspace theory (GWT) in cognitive neuroscience with multi-granularity computing and multimodal annotation in computer science. GWT offers an approach to understanding how diverse diagnostic information is synthesized into a coherent awareness, mirroring the process of TCM pattern differentiation. Multi-granularity computation is a methodological approach to processing this information through attention mechanisms at different levels. By integrating these advanced concepts with a practical framework for multimodal annotation, this paper aims to provide theoretical support and methodological insights for constructing intelligent diagnostic and therapeutic systems tailored to the cognitive characteristics of TCM.},
     year = {2025}
    }
    

    Copy | Download

  • TY  - JOUR
    T1  - A Review of the Current State of Multimodal Data Annotation in Traditional Chinese Medicine
    AU  - Du Yutong
    AU  - Yao Yuan
    AU  - Zhou Hongwei
    Y1  - 2025/12/11
    PY  - 2025
    N1  - https://doi.org/10.11648/j.ajhr.20251306.17
    DO  - 10.11648/j.ajhr.20251306.17
    T2  - American Journal of Health Research
    JF  - American Journal of Health Research
    JO  - American Journal of Health Research
    SP  - 343
    EP  - 351
    PB  - Science Publishing Group
    SN  - 2330-8796
    UR  - https://doi.org/10.11648/j.ajhr.20251306.17
    AB  - Integrating artificial intelligence with traditional Chinese medicine (TCM) is a vital step towards the modernization and intelligent development of TCM. The aim of this article is to explore the current state of research and development in the annotation of multimodal data for TCM. First, the paper elucidates the background and core significance of multimodal data annotation. Then, it provides a systematic analysis of the various sources of TCM data, including textual materials such as classical texts and modern medical case records; imaging data such as facial, tongue, and ocular diagnoses; and signal data such as pulse and auscultation diagnoses. It also covers the technical characteristics of the annotation of these data types. Subsequently, the article explores detailed annotation methodologies tailored to various application scenarios. These include knowledge representation for classical Chinese medicine texts, in-depth annotation to capture the expertise of renowned traditional Chinese medicine practitioners, and standardized annotation for modern clinical information systems. Finally, this paper focuses on exploring the cross-disciplinary integration of global workspace theory (GWT) in cognitive neuroscience with multi-granularity computing and multimodal annotation in computer science. GWT offers an approach to understanding how diverse diagnostic information is synthesized into a coherent awareness, mirroring the process of TCM pattern differentiation. Multi-granularity computation is a methodological approach to processing this information through attention mechanisms at different levels. By integrating these advanced concepts with a practical framework for multimodal annotation, this paper aims to provide theoretical support and methodological insights for constructing intelligent diagnostic and therapeutic systems tailored to the cognitive characteristics of TCM.
    VL  - 13
    IS  - 6
    ER  - 

    Copy | Download

Author Information
  • Sections