Premium
Data processing and analysis in real‐world traditional Chinese medicine clinical data: challenges and approaches
Author(s) -
Liu Baoyan,
Zhou Xuezhong,
Wang Yinhui,
Hu Jingqing,
He Liyun,
Zhang Runshun,
Chen Shibo,
Guo Yufeng
Publication year - 2011
Publication title -
statistics in medicine
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.996
H-Index - 183
eISSN - 1097-0258
pISSN - 0277-6715
DOI - 10.1002/sim.4417
Subject(s) - terminology , clinical trial , data science , data collection , computer science , data pre processing , data warehouse , data management , scale (ratio) , data quality , big data , data mining , medicine , metric (unit) , philosophy , linguistics , statistics , physics , mathematics , operations management , pathology , quantum mechanics , economics
Traditional Chinese medicine (TCM) is a clinical‐based discipline in which real‐world clinical practice plays a significant role for both the development of clinical therapy and theoretical research. The large‐scale clinical data generated during the daily clinical operations of TCM provide a highly valuable knowledge source for clinical decision making. Secondary analysis of these data would be a vital task for TCM clinical studies before the randomised controlled trials are conducted. In this article, we discuss the challenges and issues, such as structured data curation, data preprocessing and quality, large‐scale data management and complex data analysis requirements, in the data processing and analysis of real‐world TCM clinical data. Furthermore, we also discuss related state‐of‐the‐art research and solutions in China. We have shown that the clinical data warehouse based on the collection of structured electronic medical record data and clinical terminology would be a promising approach for generating clinical hypotheses and helping the discovery of clinical knowledge from large‐scale real‐world TCM clinical data. Copyright © 2011 John Wiley & Sons, Ltd.