, Cheol Min Shin2
, Kyungtaek Park3, Jinyeon Jo4, Ah Ra Do1, Sungkyoung Choi5, Jung Hun Ohn2, Sejoon Lee6, Jeongseon Kim7, Sun Ha Jee8, Seung Joo Kang9, Nayoung Kim2,9
, Sungho Won1,4,10
1Interdisciplinary Program of Bioinformatics, Seoul National University, Seoul, Korea
2Department of Internal Medicine, Seoul National University Bundang Hospital, Seongnam, Korea
3Institute of Health and Environment, Seoul National University, Seoul, Korea
4Department of Public Health Sciences, Seoul National University, Seoul, Korea
5Department of Mathematical Data Science, Hanyang University (ERICA), Ansan, Korea
6Precision Medicine Center, Seoul National University Bundang Hospital, Seongnam, Korea
7Center for Gastric Cancer, National Cancer Center Hospital, National Cancer Center, Goyang, Korea
8Department of Epidemiology and Health Promotion, Institute for Health Promotion, Graduate School of Public Health, Yonsei University, Seoul, Korea
9Department of Internal Medicine and Liver Research Institute, Seoul National University College of Medicine, Seoul, Korea
10RexSoft Inc., Seoul, Korea
Copyright © 2026 by the Korean Cancer Association
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Ethical Statement
The study protocol was approved by the Ethics Committee of Seoul National University Bundang Hospital (IRB No. B-1610-366-303 and B-2008777-301) and Seoul National University (E2308/001-020). Written informed consent was obtained from all subjects following ethical principles for medical research of the 64th World Medical Association Declaration of Helsinki.
Author Contributions
Conceived and designed the analysis: Kim N, Won S.
Collected the data: Jee SH, Kang SJ, Kim N, Won S.
Contributed data or analysis tools: Yie GE, Shin CM, Park K, Jo J, Do AR, Choi S, Ohn JH, Lee S, Kim J, Jee SH, Kang SJ, Kim N, Won S.
Performed the analysis: Yie GE, Shin CM, Park K, Jo J, Do AR, Kim N, Won S.
Wrote the paper: Yie GE, Shin CM, Kim N, Won S.
Writing - review & editing: Yie GE, Shin CM, Park K, Jo J, Do AR, Choi S, Ohn JH, Lee S, Kim J, Jee SH, Kang SJ, Kim N, Won S.
Conflict of Interest
Conflict of interest relevant to this article was not reported.
Funding
This study was conducted with bioresources from the National Biobank of Korea, the Korea Disease Control and Prevention Agency, Republic of Korea (KBN-2020-101). Statistical analyses were supported by the national supercomputing center with supercomputing resources including technical support (KSC-2022-CRE-0319). This work was supported by grant no. 13-2019-001, 02-2020-041, and 02-2023-0012 from the Seoul National University Bundang Hospital Research fund. This work was supported by the National Research Foundation of Korea (NRF) grants funded by the Korean government (MSIT) (Nos. RS-2024-00337453 and RS-2024-00346850).
Values are presented as number (%) or mean±SD. p < 0.05 indicates statistical significance. BMI, body mass index; GC, gastric cancer; GENIE, Gene-EnvironmeNtal IntEraction and phenotype; KoGES, Korean Genome and Epidemiology Study; NA, not applicable; SD, standard deviation; SNUBH, Seoul National University Bundang Hospital.
Model 1 included PRS-GC and PRS-Alcohol. Model 2 included conventional risk factors (age, sex, BMI, smoking, salted food intake, family history of GC and alcohol consumption). Model 3 included variables in the model 2, PRS-GC and PRS-Alcohol. Model 4 included variables in the model 3 and interaction term of alcohol consumption with PRS-GC and PRS-Alcohol. p < 0.05 indicates statistical significance. AIC, Akaike information criterion; BMI, body mass index; CI, confidence interval; GC, gastric cancer; PRS, polygenic risk score.
| Variable | Validation set (SNUBH-GENIE) |
Test set (KoGES) |
||||
|---|---|---|---|---|---|---|
| GC case (n=531) | Control (n=8,315) | p-value | Progressor to GC (n=313) | Non-progressor (n=67,458) | p-value | |
| Male sex | 371 (69.9) | 5,064 (60.9) | < 0.001 | 205 (65.5) | 24,436 (36.2) | < 0.001 |
| Age (yr) | 59.9±11.4 | 48.8±9.9 | < 0.001 | 57.6±8.0 | 54.0±8.3 | < 0.001 |
| BMI (kg/m2) | 23.9±3.2 | 23.3±3.1 | < 0.001 | 24.2±2.8 | 24.0±2.9 | 0.426 |
| Salted food intake (g/day) | NA | NA | NA | 175.4±148.3 | 150.0±119.4 | 0.003 |
| Salt preference | ||||||
| Non-salty | 115 (22.7) | 3,252 (41.1) | < 0.001 | NA | NA | NA |
| Mild | 225 (44.5) | 3,784 (47.8) | NA | NA | ||
| Salty | 166 (32.8) | 884 (11.2) | NA | NA | ||
| Smoking status | ||||||
| Never | 190 (36.3) | 3,946 (51.6) | < 0.001 | 142 (45.4) | 48,475 (71.9) | < 0.001 |
| Past | 223 (42.6) | 2,283 (29.8) | 100 (31.9) | 10,647 (15.8) | ||
| Current | 111 (21.2) | 1,420 (18.6) | 71 (22.7) | 8,306 (12.3) | ||
| Alcohol consumption (g/day) | 10.5±14.2 | 8.9±12.5 | 0.012 | 13.0±25.4 | 7.2±19.0 | < 0.001 |
| Family history of GC | 117 (22.0) | 1,081 (13.0) | < 0.001 | 41 (13.1) | 6,596 (9.8) | 0.061 |
| Helicobacter pylori–positive | 515 (97.0) | 3,063 (36.8) | < 0.001 | NA | NA | NA |
| Lauren classification | ||||||
| Intestinal | 301 (58.1) | NA | NA | NA | NA | NA |
| Diffuse | 153 (29.5) | NA | NA | NA | ||
| NA | 64 (12.4) | NA | NA | NA | ||
| Variable | Model 1 |
Model 2 |
Model 3 |
Model 4 |
||||
|---|---|---|---|---|---|---|---|---|
| β (95% CI) | p-value | β (95% CI) | p-value | β (95% CI) | p-value | β (95% CI) | p-value | |
| Age (yr) | - | - | 0.046 (0.032 to 0.060) | 3.7×10−11 | 0.046 (0.033 to 0.060) | 3.0×10−11 | 0.046 (0.033 to 0.060) | 3.0×10−11 |
| Sex (female vs. male) | - | - | −0.715 (−1.056 to −0.366) | 4.8×10−5 | −0.714 (−1.055 to −0.364) | 5.0×10−5 | −0.725 (−1.067 to −0.375) | 4.0×10−5 |
| BMI (kg/m2) | - | - | −0.015 (−0.056 to 0.025) | 0.457 | −0.014 (−0.055 to 0.026) | 0.485 | −0.014 (−0.055 to 0.026) | 0.482 |
| Smoking | ||||||||
| Past (vs. never) | - | - | 0.455 (0.106 to 0.814) | 0.012 | 0.451 (0.102 to 0.811) | 0.013 | 0.451 (0.101 to 0.811) | 0.013 |
| Current (vs. never) | - | - | 0.546 (0.174 to 0.920) | 0.004 | 0.545 (0.173 to 0.902) | 0.004 | 0.544 (0.171 to 0.920) | 0.004 |
| Salted food intake (per 20 g/day) | - | - | 0.021 (0.005 to 0.036) | 0.008 | 0.022 (0.005 to 0.037) | 0.007 | 0.022 (0.005 to 0.037) | 0.007 |
| Family history of GC | - | - | 0.366 (0.023 to 0.685) | 0.030 | 0.342 (−0.002 to 0.661) | 0.043 | 0.342 (−0.002 to 0.661) | 0.043 |
| Alcohol consumption (per 20 g/day) | - | - | 0.041 (−0.043 to 0.088) | 0.188 | 0.045 (−0.033 to 0.089) | 0.113 | 0.017 (−0.090 to 0.075) | 0.664 |
| PRS | ||||||||
| PRS-GC | 0.289 (0.213 to 0.365) | 8.2×10−14 | - | - | 0.277 (0.165 to 0.390) | 1.4×10−6 | 0.224 (0.105 to 0.344) | 2.3×10−4 |
| PRS-Alcohol | −0.052 (−0.127 to 0.023) | 0.173 | - | - | −0.103 (−0.214 to 0.009) | 0.069 | −0.04 (−0.159 to 0.082) | 0.521 |
| Interaction terms | ||||||||
| Alcohol×PRS-GC | - | - | - | - | - | - | 0.088 (0.024 to 0.161) | 0.008 |
| Alcohol×PRS-Alcohol | - | - | - | - | - | - | −0.142 (−0.235 to −0.047) | 0.003 |
| Model AIC | 7,566.9 | 3,766.0 | 3,743.3 | 3,738.0 | ||||
Values are presented as number (%) or mean±SD. p < 0.05 indicates statistical significance. BMI, body mass index; GC, gastric cancer; GENIE, Gene-EnvironmeNtal IntEraction and phenotype; KoGES, Korean Genome and Epidemiology Study; NA, not applicable; SD, standard deviation; SNUBH, Seoul National University Bundang Hospital.
Model 1 included PRS-GC and PRS-Alcohol. Model 2 included conventional risk factors (age, sex, BMI, smoking, salted food intake, family history of GC and alcohol consumption). Model 3 included variables in the model 2, PRS-GC and PRS-Alcohol. Model 4 included variables in the model 3 and interaction term of alcohol consumption with PRS-GC and PRS-Alcohol. p < 0.05 indicates statistical significance. AIC, Akaike information criterion; BMI, body mass index; CI, confidence interval; GC, gastric cancer; PRS, polygenic risk score.
