Health octo tool matches personalized health with rate of aging

Within each organ system, we defined disease states, and their severities based on accepted medically pre-defined disease criteria (Table S1), using combinations of medical history, medical laboratory and physical examinations. The related health questions are listed in the Table S2, and the disease definitions and their levels are also summarized in Table S1. The organ systems considered were cardiovascular (CV), renal (Re), metabolic (Me), gastrointestinal and liver (GL), respiratory (Res), Thyroid (Th), hematopoietic (He), oral health [i.e., periodontitis (Pe)], musculoskeletal (MS), sensory (Se), and central nervous system (CNS).
Body organ disease number (BODN)
BODN was determined as the number of organs with at least one disease, serving as a new measure of system morbidity. Pathology at the specific organ level was established based on predefined disease/impairment criteria, either deviating from normal or as used for disease diagnoses in clinical practice (Table S1). For instance, the cardiovascular system encompasses diseases such as hypertension [defined using systolic and diastolic blood pressure above the cut-points], ischemic heart disease [clinically defined as developing chest pain when exercising and other related symptoms identified on the Electrocardiogram (ECG)], peripheral artery disease (measured by ankle-brachial index), different arrhythmias (based on ECG), and congestive heart failure (based on signs and symptoms such as shortness of breath, low ejection fraction value [<40], and other medical signs and symptoms), each varying in severity. We considered an organ system to have morbidity if it exhibited at least one of the organ-specific diseases or deviation from normal health (subclinical states in some of diseases like sub-clinical hypothyroidism). It is important to note that disease diagnosis in the medical field relies on a combination of factors including medical history, physical examination, organ-specific laboratory tests, and ongoing prescribed treatments. The questions and medical examinations are summarized in Tables S1 and S2. Therefore, BODN is derived by combining all this clinical information. The definition of BODN, reflecting progressive system morbidities, using the 11 organ systems was then extended to include cerebrovascular accidents (CA), and cancer (Can) (Fig. 1, S1, Tables S1, S2), because CA involves multiple mechanisms, such as hypertension, coagulopathy and genetics, and cancers have both shared and distinct mechanisms across organs. Therefore, we considered them as separate system entities from other organ systems.
Since organ systems operate as interconnected parts of a whole, their pathologies do not occur in isolation but progressively, with uneven transitions from one to the another, often across different systems. Considering these transitional states, BODN was considered as a progressive arithmetic measure, characterized by variable intervals between consecutive values. This design allows for an ordinal scale with cumulative family28. In Bayesian statistical models, cumulative family allows cut-points to be introduced to the continuous latent BODN using probability information from the data, where differing disease severities or stages contribute to these irregularly spaced cut-points, enabling quantification of variable contributions of disease severity or levels to its non-equidistant values. We employed the concepts of ordinal outcome with cumulative family in a Bayesian framework to model BODN28. Moreover, ordinal numbers were assigned to the disease levels as covariates of the models, representing the severity of each organ-specific pathology, serving as lagged predictors of BODN (e.g., no hypertension, hypertension with no treatment, and hypertension with treatment would be coded 1, 2, and 3, respectively; Table S1). A monotonic effect function described in Bayesian statistics was used to capture the maximum effect of disease as well as level-specific effects. This approach allows for detailed and nuanced quantification of disease effects on BODN29.
Health octo tool metrics
Through post hoc analyses using the posterior prediction function, we derived predicted values of BODN from the models that included organ-specific diseases as covariates and created Bodily System-Specific Clocks (BSCs). By considering all diseases collectively, we developed the Body Clock, a measure of health entropy that is obtained from prediction of the model, where all disease burdens were covariates to predict BODN over time without adding chronological age. Using Gamma distribution models with a log link function, we regressed chronological age against BSCs and the Body Clock to obtain Bodily System-Specific Age and Body Age, as proxies for organ-specific and whole-body systems rates of aging, respectively (Table S3). Walking speed is one of the phenotypes of aging most related to health in older persons. To understand the effect of whole bodily systems’ health on walking speed, we estimated the effect of Body Clock on walking speed and the predicted value is called Speed-Body Clock. Capturing function and whole system aging heterogeneity, the chronological age was regressed over Speed-Body Clock to obtain Speed-Body Age.
We developed a Disability Index (DI) by applying the Zero Inflated Beta Binomial (ZIBB) approach in a probability theory framework30 (Formula [S1] and [S2] in the Supplemental Methods), combining information on physical and cognitive function, falls, and urinary or bowel incontinence (Table S4). Regressing DI against the Body Clock allowed us to understand how body health entropy predicts DI and obtained the Disability-Body Clock from the predictive value of the model. Regressing chronological age over Disability-Body Clock using Gamma distribution family, its age estimator is Disability Body Age—a proxy for the rate of aging that incorporates bodily system health and disability states. Together, the four sets of clocks and their corresponding rates of aging comprise the Health Octo Tool (Fig.1). The equations and description of each model are summarized in Table S3.
We performed longitudinal data analyses in the Baltimore Longitudinal Study of Aging (BLSA)31,32 and replicated the analyses using the Invecchiare in Chianti (InCHIANTI) study33. Cumulative multilevel ordinal regression was employed to estimate the posterior coefficient values for lagged diseases contributing to BODN28,30,34,35. To manifest increasing weight of health entropy compared to chronological age, various models were developed, including those for chronological age, single diseases, single system diseases, multiple-system diseases, and global-system entropy, including all systems as co-variates. Model performance was assessed using leave-one-out cross-validation (LOO-CV)36, and the model weights were compared with the chronological age model as a reference using “stacking” in the Bayesian framework to evaluate model performance37.
Validation assessment using BLSA parameters as a training set applied to InCHIANTI and NHANES data as test sets
In the Bayesian framework, employing parameters from the training data (BLSA) and applying them to new data provides out-of-sample predictive accuracy. The parameters of the BLSA entropy model, fitted with BLSA data, were tested by making predictions for individuals in the InCHIANTI study(n = 986, women = 551) and subsequently for 40,700 participants in the National Health and Nutrition Examination Survey (NHANES) data from 2003 to 201838. We compared the observed values with the predicted Body Clock using the visualization function pp_check from the brms package. Additionally, we examined the relationship between the Body Clock derived from the replicated analysis of InCHIANTI and the validated version obtained by applying BLSA data parameters to the InCHIANTI data.
The characteristics of the BLSA study by age group are summarized in Table S5. The schematic representation of BODN systems and combinations of systems’ morbidities are shown in online interactive graphs (Body Organ Disease Patterns (shinyapps.io), S1, and S2 (Supplemental Results). The graphs illustrate a wide heterogeneity in system combinations within BODN, which increases with age. This pattern aligns with health entropy, characterized by the random expansion of health deterioration.
To demonstrate that entropy increases with the addition of diseases and affected systems, we initiated the models with longitudinal BODN as the ordinal outcome and lagged single diseases as ordinal covariates (predictors/features). We then sequentially added diseases within a single organ system (organ-specific diseases are defined in Table S1), multi-system diseases, and, ultimately, whole entropy, incorporating all organ-system diseases as covariates.
Analyses of the BLSA data and then InCHIANTI as replication revealed significant and varied effects of single diseases and organ system-specific diseases on the accumulation of system multimorbidity, as expressed by the longitudinal BODN (Table S6). Comparing the effects of organ system-specific diseases to those of individual diseases in the regression models, we found that, in most cases, single diseases had a greater contribution to BODN than the aggregated organ systems’ diseases. For example, in the BLSA study, peripheral artery disease (PAD) contributed to BODN with posterior estimate [b]=3.43, 95% CI: 2.05–4.8, while the cardiovascular (CV) system’s contribution was b = 1.21, 95% CI: 0.03–2.46 (Table S6A–B). Similarly, in the InCHIANTI data, the effect size of PAD as a single disease was b = 1.56, 95% CI: 1.06–2.06 (Table S6G–H), which was attenuated within the organ system (b = 0.8, 95% CI: 0.8–1.26) (Table S6G–H), indicating the link between diseases within the same organ system and potential shared mechanisms. Separately, we analyzed the effect of chronological age on BODN, and it emerged as a robust predictor of BODN (BLSA, time-1: b = 0.20 ± 0.01, 95% CI = 0.18–0.22; time-2: b = 0.24 ± 0.01, 95% CI = 0.21–0.26; InCHIANTI: b = 0.14, 95% CI = 0.12–0.14). Comparing model performance weights, called average model weight stacking (Supplemental Methods), indicated that the model including the chronological age as only predictor, the age-only model, outperformed individual disease or single organ-system models (Fig. 2A–D; Table S7), highlighting the necessity of developing models that explain the variability of health entropy with aging beyond what is explained by a single disease. On the other hand, these findings underscore that chronological age still strongly affects the deterioration of whole body health in a way that cannot be simply captured by single pathological or physiological measures.

Model weights for single diseases in the BLSA at time 1 (A) and time 2 (B), single systems (C, D), and multisystem (E, F) are compared to the corresponding age-only model weights. The data illustrate that the weight of single systems surpasses that of single diseases. As the number of systems increases, indicating a rise in entropy, the weight of multisystem models significantly surpasses that of chronological age, enabling a more optimal model for the prediction of longitudinal BODN. Sample size n = 907, Women = 451. The detailed weights are reported in Table S7.
Multiple systems with morbidity outperform chronological age in predicting longitudinal BODN
Stepwise analyses that included multiple-systems diseases with increasing entropy to predict longitudinal BODN compared to chronological age alone substantially and significantly maximized and explained higher levels of entropy. This was evident in the superior model performance (larger ELPD: expected log pointwise predictive density) and greater model weights compared to chronological age-only models (Fig. 2E, F, Table S7). The higher ELPD indicates better predictive performance, as the model is more accurate in predicting observations. Furthermore, the full models, including all systems, outperformed chronological age in predicting longitudinal BODN in both the BLSA and InCHIANTI datasets (Fig. 2E, F, Table 1, Table S7). This suggests that capturing the health state of all organ systems as a comprehensive measure, referred to as whole-body entropy, can serve as a proxy for intrinsic aging.
In the BLSA dataset, incorporating all body disease burdens at time-2 provided a better prediction of BODN compared to chronological age (Fig. 2F). The full entropy model at time-2 exhibited higher model weights and ELPD than the time-1 model, indicating that the predictive power of intrinsic biological age increases with advancing chronological age (Table 1).
Bodily System-Specific Clocks (BSC) and Bodily System-Specific Ages (BSA)
Given the diverse contribution of different organ systems to the BODN for each person, our study further explored heterogeneous organ-specific intrinsic aging as clocks, when organ-specific diseases are predictors of BODN, and rates of aging, when chronological age is regressed over such clocks. We propose the concept of Bodily System-Specific Clocks (BSC) and Bodily System-Specific Age (BSA) to underline the idea that human health is shaped by the entropic accumulation of damage in cellular structures and functions that act at the overall body organ system level (entropy that drives the rate of organ system aging, captured by BSA), as well as predisposition to specific organ pathology due to genetic susceptibility or specific environmental exposures (captured by BSC). Indeed, examination of correlations among the 13 organ-system BSAs revealed varying degrees of correlation between each pair of BSA values, indicating disparate aging rates among organ systems and suggesting that organs do not age synchronously (Fig. S3A, B).
The ordinal cumulative regression model was built using weak priors (in the Bayesian formula, the posterior coefficient is quantified via multiplication of prior knowledge on the coefficient by the likelihood of the coefficients, Supplemental Methods). The Bayesian posterior inference model was sampled using a Markov chain Monte Carlo approach (Supplemental Methods)30. In the BLSA study, BSA of several systems, including kidney, sensory, cardiovascular, musculoskeletal, metabolic, stroke, CNS, GI, liver, and thyroid, were consistently older than chronological age and even exceeded the reported maximum chronological age (Fig. 3). These results emphasize a biological aging process at the organ level that extends beyond 125 years, further suggesting that the emergence of pathology in human life results from the accelerated aging of specific organ systems. Interventions to delay aging may focus on reestablishing harmonious aging across different systems (Fig. 3). Notably, a similar trend emerged in the InCHIANTI dataset, confirming the replication of this approach (Fig. S3C).

The graph reveals that BSA for cardiovascular (CV), renal (Re), Res, CNS, and MS systems can exceed 120, surpassing the maximum reported human chronological age. System abbreviations same as Fig. 1. Sample size n = 907, Women = 451.
To assess the degree to which each disease level contributes to BODN, we quantified posterior coefficient estimates of each disease level and 95% credible intervals using a monotonic effect in the Bayesian approach. The detailed monotonic effect function is described in the Supplemental Methods29. In both BLSA and InCHIANTI datasets, collectively including all diseases of organ systems led to a heterogeneous and significant contribution to BODN. However, for some organ-system diseases, such as peripheral artery disease, hyperthyroidism, thrombocytopenia, adult-onset asthma, and Parkinson’s disease, there was substantial uncertainty in the estimate, reflected by wider 95% credible intervals, possibly because the prevalence of these conditions in the population is relatively low (Fig. 4A, B, Fig. S4 A, B, Table S6C, S6F, I). Interestingly, milder states of organ system diseases (e.g., transitional ischemic attack, impaired glucose tolerance, mild anemia, subclinical hypothyroidism, cataract, mild liver disease, gingivitis without edentulous, osteopenia, mild osteoarthritis, and poor hearing that improves with a hearing aid) had larger estimates contributing to longitudinal BODN compared to more severe states of these diseases (Figs. 4A, B, S4 A&B, Table S6C, F, I). Similarly, diseases without pharmacological treatment (e.g., hypertension [HTN], type-2 diabetes mellitus, hyperlipidemia, chronic bronchitis, gastrointestinal disease, depression, and Parkinson’s) had larger posterior coefficient estimates for predicting BODN (Fig. S4A, B Table S6). For example, HTN without treatment was a stronger contributor to BODN (b = 0.32, 95% CI: 0.23–0.39) than HTN with treatment (b = 0.17, 95% CI:0.13–0.22). Similarly, congestive heart failure with preserved ejection fraction, a common type of heart failure in older adults, had a larger posterior coefficient estimate (b = 0.22, 95% CI:0.12–0.23) compared to the congestive heart failure with a low ejection fraction (b = 0.09, 95% CI: 0.05–0.13). Arrhythmias, such as sinus bradycardia, elongated QTc, and atrial fibrillation, also significantly affected BODN (Fig. S4A, B; Table S6C, F, I). Stage-1 age-related chronic kidney disease (CKD), decoupled from diabetic kidney failure, exhibited stronger incorporation into BODN (b = 0.53, 95% CI: 0.38–0.71) compared to stage-2 (b = 0.07, 95% CI:0.05–0.10) and stage-3 CKD (b = 0.1,95% CI:0.05–0.10), as well as end-stage renal disease (b = 0.20, 95% CI:0.15–0.28). Similar results were observed in the InCHIANTI study for stage-1 CKD (Table S6). These patterns were also observed in the time-2 full model BLSA, where hyperthyroidism and Parkinson’s significantly incorporated into BODN (Fig. S4A, B, Table S6). Overall, as pointed out above, model assessments using LOO-CV approach indicated that full entropy models, incorporating all disease severities, with larger ELPD (Table 1) had the best performance in predicting longitudinal BODN (Fig. S5). These findings suggest that the overall health assessment of an individual should consider pathology across a wide range of severities in all organ systems.

The lagged full-model estimates reveal heterogenous contributions of diseases into BODN. A Maximum contribution of diseases into longitudinal BODN at time 1. B Maximum contributions of diseases into longitudinal BODN at time 2. Larger posterior estimates, accompanied by narrower 95% credible intervals (CI), yield more accurate predictions. The maximum effects of most diseases increase with time. Significance is determined when the 95% CI does not include 0. Disease Abbreviations: HTN hypertension, IHD ischemic heart disease, CHF congestive heart failure, Arr arrhythmia, PAD peripheral artery disease, CKD chronic kidney disease, DM diabetes mellitus, GID gastrointestinal disease, COPD chronic obstructive pulmonary disease, Hypoth hypothyroidism, Hyperth hyperthyroidism, OA osteoarthritis, Osteop osteoporosis, Periodon periodontal disease, Dep depression, Park Parkinson’s disease, CI cognitive impairment. System abbreviations: same as for Fig. 1. For the values of mean posterior estimates and 95% CI, please see the values in Table S6C, F for full models. Sample size n = 907, Women = 451.
Interestingly, there was a slight discrepancy between the BLSA and InCHIANTI results. In the BLSA, peripheral artery disease (PAD) showed significant incorporation into BODN either as a single disease or as part of a single-organ system, but its significance was not retained in the full model; while in the InCHIANTI study, PAD retained its significance in the full model, albeit with a smaller magnitude (Fig. S4A, B, Table S6C, F, I). These findings suggest a potential shared pathophysiology among diseases within the cardiovascular system. This conclusion was further supported by comparing model weights using Bayesian Stacking37 (Supplemental Method), which revealed that model weights of 49.7%, 48.3%, and 2% for HTN, congestive heart failure (CHF), and arrhythmia, respectively, while the weights for PAD and ischemic heart disease (IHD) models were zero, indicating a higher degree of shared pathophysiology among certain cardiovascular-related diseases. In the InCHIANTI study, the weights for HTN, CHF, and arrhythmia were 58%, 41%, and 1%, respectively.
Personalized Body Clock and Body Age
We developed the individualized Body Clock by extracting the predicted value of BODN from the post hoc analysis of the multilevel ordinal regression model that included all organ systems diseases and their severities for each individual. Briefly, the Eqs. (1) and (2) in the brms R package are as follows, where mo stands for the monotonic function for the ordinal predictor:
$$trial\_id= (trial\sim ({{\rmtrial}})+{{\rmid}}({{\rm}}) \\ +{{\rm}}({{\rm{IschemicHeartDisease}}})+{{\rm{mo}}}({{\rm{Arrhythmia}}})+{{\rm{mo}}}({{\rm{Kidney}}}) \\ +{{\rm{mo}}}({{\rm{Diabetes}}})+{{\rm{mo}}}({{\rm{Hyperlipidemia}}})+{{\rm{PrepheralArteryDisease}}} \\ +{{\rm{mo}}}({{\rm{Stroke}}})+{{\rm{mo}}}({{\rm{Anemia}}})+{{\rm{Thrombocytopnia}}} \\ +{{\rm{mo}}}({{\rm{GastrointestinalDisease}}})+{{\rm{mo}}}({{\rm{Liver}}})+{{\rm{mo}}}({{\rm{COPD}}})+{{\rm{Asthma}}} \\ +{{\rm{mo}}}({{\rm{OralHealth}}})+{{\rm{mo}}}({{\rm{Hypothyroisism}}})+{{\rm{Hyperthyroisism}}} \\ +{{\rm{mo}}}({{\rm{OsteoArtheristis}}})+{{\rm{mo}}}({{\rm{Osteoporosis}}})+{{\rm{mo}}}({{\rm{Hearing}}})+{{\rm{mo}}}({{\rm{Eye}}}) \\ +{{\rm{mo}}}({{\rm{Depression}}})+{{\rm{mo}}}({{\rm{sParkinsons}}})+{{\rm{mo}}}({{\rm{Cognition}}})+{{\rm{Cancer}}} \\ +{{\rm{yrs}}}+(1+1|{{\rm{id}}}),{{\rm{data}}}={{\rm{BLSA}}})$$
(1)
$${{\rm{Body}}}\; {{\rm{Clock}}}\_{{\rm{BLSA}}}={{\rm{posterior}}}\_{{\rm{predict}}}({{\rm{fit}}}\_{{\rm{BLSA}}},{{\rm{data}}}={{\rm{BLSA}}})$$
The above equation was applied to the InCHIANTI data for replication, and the predicted model Eq. (2) is:
$${{\rm{Body}}}\; {{\rm{Clock}}}\_{{\rm{InCHIANTI}}}={{\rm{posterior}}}\_{{\rm{predict}}}({{\rm{fit}}}\_{{\rm{InCHIANTI}}},{{\rm{data}}}={{\rm{InCHIANTI}}})$$
(2)
Overall, the median Body Clock in the BLSA dataset was 6.1 (range: 2.1–11.5) and for InCHIANTI was 6.6 (range: 2.4–11.1). It is worth noting that the BLSA enrolls individuals who are very healthy at baseline, whereas the INCHIANTI dataset is population-based and primarily includes individuals over the age of 50. In the InCHIANTI study, women 65 to 85 years old exhibited a higher Body Clock value than men, with measurements of 7.1 ± 1.1 compared to 6.8 ± 1.2, respectively. However, in the BLSA study, there were no sex differences in Body Clock values.
We used the monotonic effect to assess severity-specific coefficients and derive the latent maximum disease severity effects, which form a linear scale from minimum to maximum effect, to obtain the Body Clock at the individual level (Supplemental Methods). Furthermore, individualized trajectory plots for Body Clock vividly demonstrate diverse trajectories among individuals of the same chronological age. These trajectories exhibit varying slopes and magnitudes, underscoring the unique maximum intrinsic aging experienced by different individuals (Fig. S6A, B). Additionally, age was categorized into the following groups: <45, 45–54, 55–64, 65–74, 75–84, and 85 and older. The Body Clock increases with each successive 10-year age interval, and its values are heterogeneously distributed within each age group, as depicted in both the BLSA and InCHIANTI studies (Fig. S7A, B). These findings emphasize the diversity of the aging process. Subsequently, the chronological age was regressed over the Body Clock using a Gamma distribution family and a log link function, allowing the fitting of non-linear, exponential points into the models (equations [3] and [4]).
$${{\rm{Fi}}}{{{\rm{t}}}}_{{{\rm{Age}}}}={{\rm{Chronological\; Age}}} \sim {{\rm{Body\; Clock}}},{{\rm{data}}}={{\rm{BLSA}}},\,$$
(3)
$${{\rm{Body}}}\; {{\rm{Age}}}={{\rm{posterio}}}\; {{\rm{predict}}}({{\rm{Fi}}}{{{\rm{t}}}}_{{{\rm{Age}}}})$$
(4)
The same formula was applied to the InCHIANTI data.
We obtained the post hoc prediction for each individual using posterior predict, termed Body Age (Eq. 4, Fig. S7C, D). At the population level, given the constant value of 3.78 and the mean posterior estimate of 0.12 obtained from the Bayesian Gamma distribution model with BLSA, where chronological age was regressed over Body Clock, if an individual has a Body Clock value of 9, the Body Age would be 129 (calculated as e (3.78+0.12×9) = 129.02, e is the natural logarithm). However, it is crucial to avoid generalizing the population mean posterior estimate, rather Body Age should be quantified for an individual, obtained from the prediction model using individual-specific posterior estimates, to ensure accurate predictions for precision-medicine decisions.
Speed-Body Clock and Speed-Body Age
Walking speed serves as a fundamental measure of health commonly utilized as an important risk factor, biomarker, and outcome at the old age. Walking speed was recorded (in meters per second) over 6 meters in the BLSA and 7 meters in the InCHIANTI studies. To assess the influence of the Body Clock on walking speed, we predicted longitudinal changes of walking speed by the Body Clock, adjusting for height using a Gaussian regression and modeled the walking speed variance (sigma) over the Body Clock to develop another health metric that we called Speed-Body Clock (equation [5] and [6]; Fig. S8 A, B).
$${{\rm{Fit}}}\_{{\rm{BLSA}}}={{\rm{Walking}}}\; {{\rm{Speed}}} \sim {{\rm{Body}}}\; {{\rm{Clock}}}+{{\rm{height}}},{{\rm{data}}}={{\rm{BLSA}}}$$
(5)
$${{\rm{Speed}}}-{{\rm{Body}}}\; {{\rm{Clock}}}={{\rm{Postrior}}}\; {{\rm{predict}}}({{\rm{Fit}}}\_{{\rm{BLSA}}})$$
(6)
The same formula was applied to the InCHIANTI study for replication.
Because of the inverse relationship between the Body Clock and walking speed, a smaller Speed-Body Clock indicates a negative impact of system’s health on physical function. In the BLSA study, the Speed-Body Clock was lower in women compared to men, even after adjustment for height, with values of 1.09 ± 0.1 for women and 1.15 ± 0.1 for men. At the population level, for each unit increase in the Body Clock, the posterior coefficient of walking speed decreased by 0.062 m/s (0.065–0.062) in men and 0.064 m/s (0.063–0.061) in women (Fig. S8A). In the InCHIANTI study, a one-unit increase in Body Clock corresponded to a decrease in the posterior coefficient of walking speed by -0.03 m/s in both sexes. Again, women exhibited a smaller Speed-Body Clock than men, with values of 1.29 ± 0.15 for women and 1.56 ± 0.14 for men (Fig. S8B). For BLSA, the mean Speed-Body Clock was significantly different between age groups so that the youngest group (<45 years old) with a 1.26 value had the largest, and the oldest group with 1.01 value had the smallest Speed-Body Clock. Similar results were observed in the InCHIANTI study with 1.55 vs. 1.29 values for the youngest and oldest groups, respectively. These findings indicate that bodily system health entropy (Body Clock) can contribute to a decline in walking speed, which accelerates after age 65.
Additionally, individuals of the same chronological age may have different rates of aging in terms of how the Body Clock affects walking speed. This heterogeneity in the aging rate is captured by regressing chronological age on the Speed-Body Clock using a Gamma distribution and the predicted value was denoted as Speed-Body Age (equation [7] and [8]; Fig. S8 C, D).
$${{\rm{Fit}}}\_{{\rm{BLSA}}}={{\rm{Age}}} \sim {{\rm{Spreed}}}-{{\rm{Body}}}\; {{\rm{Clock}}}+{{\rm{Height}}},{data}={BLSA}$$
(7)
$${{\rm{Speed}}}-{{\rm{Body}}}\; {{\rm{Age}}}={{\rm{psoterior}}}\; {{\rm{predict}}}({{\rm{Fit}}}\_{{\rm{BLSA}}})$$
(8)
The same formula was applied to the InCHIANTI data for replication.
With a one unit increase in the Speed-Body Clock, the Speed-Body Age decreases in both BLSA (intercept=5.34, b = −0.99, 95% CI (−1.06 to −0.95) and InCHIANTI (intercept = 4.8, b = −0.43, 95% CI = −0.48 to −0.39). That is, using a Gamma distribution interpretation, we predicted that a Speed-Body Clock of 0.8 resulted in a speed Speed-Body Age of e(4.8-0.43*0.8) = 86.14 in InCHIANTI and 94.44 in BLSA. The difference between these estimates may be explained by the healthier people enrolled in BLSA, while InCHIANTI is a population-based sample. A larger Speed-Body Age represents resilience to functional or system function decline and can reflect aging heterogeneity. Note that we relied on individual-based metrics using a Bayesian approach that can be obtained from prediction models rather than the population level metrics and thus reduced the chance of bias due to population variability. In the InCHIANTI study, the Speed-Body Age could extend up to 105 units, while in the BLSA, the maximum Speed-Body Age did not surpass 97 units. Despite the smaller average population state within InCHIANTI, some individuals manifested resilience indicated by a higher Speed-Body Age. The data revealed increasing heterogeneity in Speed-Body Age after age 70 as depicted in Fig. S8C, D.
Disability index (DI), Disability-Body Clock, and Disability-Body Age
To capture late-onset outcomes experienced by individuals, we devised a new Bayesian DI, in which each individual could have multiple components of physical and cognitive disability. We incorporated a total of 47 measured components as the total number of trials in the DI (Table S4). The components representing these outcomes were modeled using the Zero Inflated Beta Binomial (ZIBB) distribution, estimating the probability distribution of the number of observed components (events) given the total number of measured components (trials) for each individual. This approach allowed for flexible probabilities that could vary from person to person. Moreover, ZIBB handled model instability resulting from zero values in young populations without age-related disability or in the older adults exhibiting resilience to late-onset outcomes. The detailed ZIBB formula is described in the Supplemental Methods (equations [S1], [S2], and below, equations [9] and [10]).
$${{\rm{Fit}}}\_{{\rm{BLSA}}}={{\rm{probability}}}\; {{\rm{of}}}({{\rm{events}}}){{\rm{|trial}}}({{\rm{total}}}\; {{\rm{events}}}) \sim 1+(1{{\rm{|id}}}),{{\rm{data}}}={{\rm{BLSA}}}$$
(9)
$${{\rm{Disability}}}\; {{\rm{Index}}}={{\rm{posterior}}}\; {{\rm{predict}}}({{\rm{Fit}}}\_{{\rm{BLSA}}})$$
(10)
The same formula was applied to the InCHIANTI data for replication. This approach enabled us to explore how the Body Clock predicts the model-based physical and cognitive DI, based on events within the total 47 measured (trials) components (Table S3).
Not everyone with an elevated Body Clock experiences disability. This heterogeneous process can be captured using the Disability-Body Clock (defined as the sum of positive events conditioned on the total number of measured events and predicted by the Body Clock). This approach enabled us to explore how the Body Clock predicts the model-based physical and cognitive DI, based on events within the total 47 measured components (Table S4; equations [11 and [12]).
$${{Fit}}_{{BLSA}}= {probability}\; {of}\left(\left({events}\right)\right){|trial}\left({total}\; {events}\right) \sim {Body}\; {Clock} \\ +\left(1{|id}\right),{data}={BLSA}$$
(11)
$${Disability}-{Body}\; {Clock}={posterior}\; {predict}\left({Fit}{{\_}}{BLSA}\right)$$
(12)
The same formula was applied to the InCHIANTI data for replication.
In the BLSA data, the median DI was 0.03 (range 0.007–0.71). The median Disability-Body Clock was 1.2 (range 0.12–9.48) and not different between men and women (women: 1.5 ± 1.04; men: 1.52 ± 1.08) (Fig. S9.A). In the InCHIANTI study, the median DI was 0.22 (range 0.05–0.95) with a larger mean DI in women (0.3 ± 0.16) than men (0.22 ± 0.13). The median Disability-Body Clock was 6.3 (range: 1.45–12.8) with a larger mean in women (6.5 ± 1.82) than men (6.17 ± 1.74) (Fig. S9B). Additionally, individuals of varying ages may exhibit different effects of the Body Clock on disability. To account for this heterogeneity in such aging rate, we regressed the chronological age against the Disability-Body Clock. The predicted value from this model is referred to as the Disability-Body Age (equations [13] and [14]).
$${Fi}{t}_{{BLSA}}={Age} \sim {Disability}-{Body}\; {Clock},{data}={BLSA}$$
(13)
$${Disability}-{Body}\; {Age} \sim {posterior}\; {predict}\left({Fit}{{\_}}{BLSA}\right)$$
(14)
The same formula was applied to the InCHIANTI data for replication.
In the BLSA data, the median Disability-Body Age was 62.71 (range 60.4–135.67) with high heterogeneity after age 60. Moreover, in some individuals of either sex, the biological age surpassed 120 (Fig. 5). In the InCHIANTI data, the median Disability-Body Age was 67.3 (range 43.5–120.2) with women exhibiting a higher average biological age compared to men (69.56 ± 11.55 vs. 67.2 ± 10.7) (Fig. S9C).

Disability-Body Age surpasses the reported chronological age in humans. There is increasing heterogeneity in Disability-Body Age with increases in chronological age. Sample size n = 907, Women=451.
Body Clock predicts binary short physical performance battery (SPPB), disability, geriatric syndrome, and mortality
To evaluate the predictive capability of the Body Clock for age-related late-life binary outcomes, we utilized Multilevel Negative Binomial Regression models within the Bayesian framework, accounting for time. We focused on outcomes including SPPB < 9, disability (Activities of Daily Living scale [ADL] >1 and/or Instrumental Activities of Daily Living scale [IADL] >1), geriatric syndrome (injurious fall or urinary/bowel incontinence as dichotomous variables), and mortality. Our analysis revealed that individuals with higher Body Clock scores had an elevated risk of SPPB < 9 (BLSA: hazard ratio [HR] = 2.7, 95% CI = 1.7–10.6; InCHIANTI: HR = 2.0, 95% CI = 1.7–2.3), geriatric syndrome (BLSA: HR = 1.6, 95% CI = 1.5–1.7; InCHIANTI: HR = 1.2, 95% CI = 1.2–1.3), disability (BLSA: HR = 1.7, 95% CI = 1.5–1.8; InCHIANTI: HR = 1.6, 95% CI = 1.5–1.7), and mortality (BLSA: HR = 1.8, 95% CI = 1.5–2.2; InCHIANTI: HR = 1.5, 95% CI: 1.4–1.6). These findings indicate that the Body Clock could significantly predict age-related late-life outcomes.
The Body Clock superseded Frailty Index to predict late-onset binary outcomes. We used the same list of diseases described in Table S1 to manually develop disease-based FI scores, which is a common metric used to capture age-related deficits11,39,40. Rather than considering different levels of disease severity, each condition was treated as a separate binary variable. For example, in the case of ischemic heart disease, angina pectoris and acute myocardial infarction are considered distinct deficits in the FI, whereas the Body Clock treats them as different levels of the same condition. Similarly, for eye diseases, the Body Clock treats cataract, glaucoma, and macular degeneration as varying levels of eye disease, while in the FI, each condition is treated as a separate binary variable (e.g., having cataracts vs. no cataract). Overall, the same diseases were considered in both the FI and Body Clock. The primary difference lies in the methods (Bayesian-based Body Clock vs. manually developed FI) used to develop these health metrics.
It is important to note that while the FI and Body Clock exhibit strong correlation (r = 0.83, 95% CI: 0.816–0.84), the Body Clock more effectively captures health heterogeneity, as evidenced by instances where identical FI values correspond to varying Body Clock values (Figs. 6, S10A, B). We utilized Receiver Operating Characteristic (ROC) and Area Under the Curve (AUC) analyses to compare the predictive performance of the FI and Body Clock for various binary outcomes including SPPB, geriatric syndrome, disability, and mortality. The AUC values consistently indicate that the Body Clock provided stronger predictions for all these outcomes across both the BLSA and InCHIANTI studies (Figs. 7, S11A).

Body Clock also captures heterogeneity better than FI: for heterogeneous values of the Body Clock—serving as a proxy for the intrinsic age and health entropy—FI values remain largely unchanged. Sample size n = 907, Women = 451.

The Body Clock predicts binary SPPB < 9, geriatric syndrome, and disability, superseding the FI score with more than 90% accuracy. Sample size n = 907, Women=451.
We used a Bayesian model for the FI and Body Clock to predict Bayesian DI. To assess DI predictions, we compared the Body Clock’s performance comparing ELPD, which averages the logarithm of predictive densities for each data point and serves as a metric for comparing models, indicating how effectively they predict observed or new data points. Higher ELPD values signify better predictive performance and discern which model offers the most accurate predictions for the data. Our results demonstrated that the Body Clock exhibited better model performance compared to FI, with differences in ELPD favoring the Body Clock, indicated by the larger ELPD with both BLSA and InCHIANTI data. The ELPD difference presented as negative values in differences of ELPD (ELPD_DIFF) from the FI and Body Clock models where the Body Clock with the higher ELPD is the reference model (FI in BLSA: ELPD_DIFF = −27.9 ± 7.5; FI in InCHIANTI: ELPD_DIFF = −26.6 ± 6).
To gain deeper insights into the relationship between the Body Clock and SPPB, we explored the association between SPPB < 9 at time-1 and the Body Clock. This analysis revealed a reciprocal relationship between lagged functional impairment. With BLSA, we observed that SPPB < 9 was associated with a higher Body Clock value (b = 1.9, 95% CI = 1.6–2.4). Similarly, in the InCHIANTI dataset, SPPB < 9 predicted an increased Body Clock value (b = 2.1, 95% CI = 1.6–2.8). These results underscore the reciprocal nature of the association between the Body Clock and SPPB, suggesting when impaired physical performance is established it can significantly contribute to health deterioration as reflected in the Body Clock.
Replication of Body Clock using NHANES data
We applied the same statistical pipeline described for BLSA and InCHIANTI to predict BODN in 40,700 individuals in the NHANES data, spanning the years 2003–2018, and derived personalized Body Clocks. As chronological age increased, the Body Clock demonstrated heightened heterogeneity after age 60, mirroring the pattern observed in the BLSA and InCHIANTI studies (Fig. S12A). The distribution of the BODN number suggests that morbidity in more than 5 organs becomes dominant after age 50 and older. However, some older adults with low Body Clock values exhibited bodily system resilience. In the NHANES data, women exhibited larger Body Clock values than men, suggesting that in general, women experienced poorer health compared to men. In addition, in NHANES data, Body Clock outperformed FIs specifically in predicting mortality, with an accuracy of 80% (Fig. S13).
The chronological age was then regressed over the Body Clock. Similar to the BLSA and InCHIANTI data, the Disability-Body Age surpassed the reported ceiling in humans (Fig. S12B). The Body Clock predicted SPPB, disability, geriatric syndrome, and all-cause mortality (Table 2). Of note, in the NHANES data, there was only a small sample size for adults older than 85 years and, thus, it is likely that our estimates were affected by information censoring.
Applying the BLSA parameters (training set) to the NHANES and InCHIANTI data (validation sets) to develop the Body Clock
Using out-of-data input, the validation is the same concept as applying training data parameters (BLSA model) to a new data (InCHIANTI or NHANES) as test sets to obtain the Body Clock. We used the BLSA full entropy model to predict out-of-data input in the NHANES and the InCHIANTI data (test sets). The predicted and observed values in both studies were matched as depicted using posterior predicted check in the Bayesian framework (Fig. S14 A, B).
$${Fi}{t}_{{BLSA}}={BODN} \sim {mo}\left({Disease}1\right)\cdots+{mo}\left({DiseasesN}\right)+{time}+\left(1|{id}\right),{data}={BLSA}$$
(15)
$${{\rm{Predict}}}\; {{\rm{in}}}\; {{\rm{InCHIANTI}}}({{\rm{test}}}\; {{\rm{set}}})={{\rm{posterior}}}\; {{\rm{predict}}}({{\rm{Fit}}}\_{{\rm{BLSA}}},{{\rm{data}}}={{\rm{INCHIANTI}}})$$
(16)
$${{\rm{Predict}}}\; {{\rm{in}}}\; {{\rm{NHANES}}}({{\rm{test}}}\; {{\rm{set}}})={{\rm{posterior}}}\; {{\rm{predict}}}({{\rm{Fit}}}\_{{\rm{BLSA}}},{{\rm{data}}}={{\rm{NHANES}}})$$
(17)
There is a strong correlation between the Body Clock derived directly from the analysis of the InCHIANTI dataset (as a replication) and the Body Clock obtained by applying the BLSA parameters to the InCHIANTI dataset (as a validation test set) (Fig. 8). These results suggest that a valid model can be used to estimate the Body Clock in new data as a test set.

With the in-data method the full model (all diseases predicting longitudinal BODN) was performed, and the Body Clock (replication) was obtained. Using the out-of-data method the parameters of the BLSA model (training set) were applied to the InCHIANTI data (test set). The graph depicts a high correlation between in-data and out-of-data methods of developing the Body Clock. The Pearson correlation for men was r = 0.76, 95% CI (0.73–0.78) and for women was r = 0.70, 95% CI (0.68–0.72). The parameters of a valid model can be used to predict new data. Sample size in InCHIANTI data n = 986, women = 551.
link