Human health status can be measured on the basis of many different parameters. Statistical relationships among these different health parameters will enable several possible health care applications and an approximation of the current health status of individuals, which will allow for more personalized and preventive health care by informing the potential risks and developing personalized interventions. Furthermore, a better understanding of the modifiable risk factors related to lifestyle, diet, and physical activity will facilitate the design of optimal treatment approaches for individuals.
This study aims to provide a high-dimensional, cross-sectional data set of comprehensive health care information to construct a combined statistical model as a single joint probability distribution and enable further studies on individual relationships among the multidimensional data obtained.
In this cross-sectional observational study, data were collected from a population of 1000 adult men and women (aged ≥20 years) matching the age ratio of the typical adult Japanese population. Data include biochemical and metabolic profiles from blood, urine, saliva, and oral glucose tolerance tests; bacterial profiles from feces, facial skin, scalp skin, and saliva; messenger RNA, proteome, and metabolite analyses of facial and scalp skin surface lipids; lifestyle surveys and questionnaires; physical, motor, cognitive, and vascular function analyses; alopecia analysis; and comprehensive analyses of body odor components. Statistical analyses will be performed in 2 modes: one to train a joint probability distribution by combining a commercially available health care data set containing large amounts of relatively low-dimensional data with the cross-sectional data set described in this paper and another to individually investigate the relationships among the variables obtained in this study.
Recruitment for this study started in October 2021 and ended in February 2022, with a total of 997 participants enrolled. The collected data will be used to build a joint probability distribution called a Virtual Human Generative Model. Both the model and the collected data are expected to provide information on the relationships between various health statuses.