1
|
Kobayashi M, Yamashina A, Satomi K, Tezuka A, Duarte K, Ito S, Asakura M, Kitakaze M, Girerd N. Effect of eplerenone in acute heart failure using a win ratio approach. Clin Res Cardiol 2024:10.1007/s00392-024-02578-0. [PMID: 39565385 DOI: 10.1007/s00392-024-02578-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/30/2024] [Accepted: 11/07/2024] [Indexed: 11/21/2024]
Affiliation(s)
- Masatake Kobayashi
- Department of Cardiology, Tokyo Medical University, Tokyo, Japan
- Université de Lorraine, INSERM, Centre d'Investigations Cliniques Plurithématique 1433, Inserm U1116, CHRU de Nancy and F-CRIN INI-CRCT, Nancy, France
| | - Akira Yamashina
- Department of Cardiology, Tokyo Medical University, Tokyo, Japan
| | - Kazuhiro Satomi
- Department of Cardiology, Tokyo Medical University, Tokyo, Japan
| | - Ayako Tezuka
- Department of Cardiology, Tokyo Medical University, Tokyo, Japan
| | - Kevin Duarte
- Université de Lorraine, INSERM, Centre d'Investigations Cliniques Plurithématique 1433, Inserm U1116, CHRU de Nancy and F-CRIN INI-CRCT, Nancy, France
| | - Shin Ito
- Department of Clinical Research and Development, National Cerebral and Cardiovascular Center, Osaka, Japan
| | - Masanori Asakura
- Department of Cardiovascular and Renal Medicine, Hyogo Medical University, Hyogo, Japan
| | - Masafumi Kitakaze
- Department of Clinical Research and Development, National Cerebral and Cardiovascular Center, Osaka, Japan.
- Hanwa Memorial Hospital, Osaka, Japan.
- Hanwa Daini Senboku Hospital, 3176 Fukaikitamach, i, Naka-Ku Sakai City, Osaka, Japan.
| | - Nicolas Girerd
- Université de Lorraine, INSERM, Centre d'Investigations Cliniques Plurithématique 1433, Inserm U1116, CHRU de Nancy and F-CRIN INI-CRCT, Nancy, France
| |
Collapse
|
2
|
Mu Y, Hu A, Kan H, Li Y, He Y, Fan W, Liu H, Li Q, Zheng Y. Preterm Prelabor Rupture of Membranes Linked to Vaginal Bacteriome of Pregnant Females in the Early Second Trimester: a Case-Cohort Design. Reprod Sci 2023; 30:2324-2335. [PMID: 36725814 PMCID: PMC9891760 DOI: 10.1007/s43032-022-01153-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Accepted: 12/13/2022] [Indexed: 02/03/2023]
Abstract
Preterm prelabor rupture of membranes (PPROM) is a major cause of spontaneous preterm birth (sPTB), one of the greatest challenges facing obstetrics with complicated pathogenesis. This case-cohort study investigated the association between vaginal bacteriome of singleton pregnant females in the early second trimester and PPROM. The study included 35,255 and 180 pregnant females with PPROM as cases and term-birth without prelabor rupture of membranes (TWPROM) and term prelabor rupture of membranes (TPROM) pregnant females as controls, respectively. Using 16S rRNA sequencing, the vaginal microbiome traits were analyzed. Females with PPROM had higher alpha and beta diversity (P < 0.05) than TWPROM and TPROM. The presence of L. mulieris was associated with a decreased risk of PPROM (adjusted odds ratio [aOR] = 0.35; 95% confidence interval [CI]: 0.17-0.72) compared with TWPROM. Meanwhile, the presence of Megasphaera genus (aOR = 2.27; 95% CI: 1.09-4.70), Faecalibacterium genus (aOR = 3.29; 95% CI: 1.52-7.13), Bifidobacterium genus (aOR = 3.26; 95% CI: 1.47-7.24), Xanthomonadales genus (aOR = 2.76; 95% CI: 1.27-6.01), Gammaproteobacteria class (aOR = 2.36; 95% CI: 1.09-5.14), and Alphaproteobacteria class (aOR = 2.45; 95% CI: 1.14-5.26) was associated with an increased risk of PPROM compared with TWPROM. Our results indicated that the risk of PPROM can decrease with vaginal L. mulieris but increase with high alpha or beta diversity, and several vaginal bacteria in pregnant females may be involved in the occurrence of PPROM.
Collapse
Affiliation(s)
- Yutong Mu
- Key Laboratory for Health Technology Assessment, National Commission of Health and Family Planning, Fudan University, Shanghai, 200032, China
- Department of Epidemiology, School of Public Health, Fudan University, Shanghai, 200032, China
| | - Anqun Hu
- Department of Clinical Laboratory, Anqing Municipal Hospital, Anqing, 246003, China
| | - Hui Kan
- Key Laboratory for Health Technology Assessment, National Commission of Health and Family Planning, Fudan University, Shanghai, 200032, China
- Department of Epidemiology, School of Public Health, Fudan University, Shanghai, 200032, China
| | - Yijie Li
- Key Laboratory for Health Technology Assessment, National Commission of Health and Family Planning, Fudan University, Shanghai, 200032, China
- Department of Epidemiology, School of Public Health, Fudan University, Shanghai, 200032, China
| | - Yining He
- Key Laboratory for Health Technology Assessment, National Commission of Health and Family Planning, Fudan University, Shanghai, 200032, China
- Department of Epidemiology, School of Public Health, Fudan University, Shanghai, 200032, China
- Biostatistics Office, Clinical Research Unit, Shanghai Ninth People's Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, 200011, China
| | - Wei Fan
- Key Laboratory for Health Technology Assessment, National Commission of Health and Family Planning, Fudan University, Shanghai, 200032, China
- Department of Epidemiology, School of Public Health, Fudan University, Shanghai, 200032, China
| | - Haiyan Liu
- Biostatistics Office, Clinical Research Unit, Shanghai Ninth People's Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, 200011, China.
- Department of Blood Transfusion, Anqing Municipal Hospital, Anqing, 246003, China.
| | - Qing Li
- Department of Obstetrics and Gynecology, Anqing Municipal Hospital, Anqing, 246003, China.
| | - Yingjie Zheng
- Key Laboratory for Health Technology Assessment, National Commission of Health and Family Planning, Fudan University, Shanghai, 200032, China.
- Department of Epidemiology, School of Public Health, Fudan University, Shanghai, 200032, China.
| |
Collapse
|
3
|
Buczak P, Chen JJ, Pauly M. Analyzing the Effect of Imputation on Classification Performance under MCAR and MAR Missing Mechanisms. ENTROPY (BASEL, SWITZERLAND) 2023; 25:521. [PMID: 36981409 PMCID: PMC10048089 DOI: 10.3390/e25030521] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 03/10/2023] [Accepted: 03/14/2023] [Indexed: 06/18/2023]
Abstract
Many datasets in statistical analyses contain missing values. As omitting observations containing missing entries may lead to information loss or greatly reduce the sample size, imputation is usually preferable. However, imputation can also introduce bias and impact the quality and validity of subsequent analysis. Focusing on binary classification problems, we analyzed how missing value imputation under MCAR as well as MAR missingness with different missing patterns affects the predictive performance of subsequent classification. To this end, we compared imputation methods such as several MICE variants, missForest, Hot Deck as well as mean imputation with regard to the classification performance achieved with commonly used classifiers such as Random Forest, Extreme Gradient Boosting, Support Vector Machine and regularized logistic regression. Our simulation results showed that Random Forest based imputation (i.e., MICE Random Forest and missForest) performed particularly well in most scenarios studied. In addition to these two methods, simple mean imputation also proved to be useful, especially when many features (covariates) contained missing values.
Collapse
Affiliation(s)
- Philip Buczak
- Department of Statistics, TU Dortmund University, 44227 Dortmund, Germany
| | - Jian-Jia Chen
- Department of Computer Science, TU Dortmund University, 44227 Dortmund, Germany
| | - Markus Pauly
- Department of Statistics, TU Dortmund University, 44227 Dortmund, Germany
- UA Ruhr, Research Center Trustworthy Data Science and Security, 44227 Dortmund, Germany
| |
Collapse
|
4
|
Kertel M, Pauly M. Estimating Gaussian Copulas with Missing Data with and without Expert Knowledge. ENTROPY (BASEL, SWITZERLAND) 2022; 24:1849. [PMID: 36554254 PMCID: PMC9778345 DOI: 10.3390/e24121849] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 12/07/2022] [Accepted: 12/10/2022] [Indexed: 06/17/2023]
Abstract
In this work, we present a rigorous application of the Expectation Maximization algorithm to determine the marginal distributions and the dependence structure in a Gaussian copula model with missing data. We further show how to circumvent a priori assumptions on the marginals with semiparametric modeling. Further, we outline how expert knowledge on the marginals and the dependency structure can be included. A simulation study shows that the distribution learned through this algorithm is closer to the true distribution than that obtained with existing methods and that the incorporation of domain knowledge provides benefits.
Collapse
Affiliation(s)
- Maximilian Kertel
- BMW Group, Battery Cell Competence Centre, 80788 Munich, Germany
- Department of Statistics, TU Dortmund University, 44227 Dortmund, Germany
| | - Markus Pauly
- Department of Statistics, TU Dortmund University, 44227 Dortmund, Germany
- Research Center Trustworthy Data Science and Security, UA Ruhr, 44227 Dortmund, Germany
| |
Collapse
|
5
|
Differential Effect of Vaginal Microbiota on Spontaneous Preterm Birth among Chinese Pregnant Women. BIOMED RESEARCH INTERNATIONAL 2022; 2022:3536108. [PMID: 36506912 PMCID: PMC9731763 DOI: 10.1155/2022/3536108] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 10/25/2022] [Accepted: 11/03/2022] [Indexed: 12/02/2022]
Abstract
Objective The effect of vaginal microbiota on spontaneous preterm birth (sPTB) has not been fully addressed, and few studies have explored the associations between vaginal taxa and sPTB in the gestational diabetes mellitus (GDM) and non-GDM groups, respectively. Study Design. To minimize external interference, a total of 41 pregnant women with sPTB and 308 controls (pregnant women without sPTB) from same regain were enrolled in this case-cohort study. Controls were randomly selected at baseline. With the exception of GDM, other characteristics were not significantly different between the two groups. Vaginal swabs were collected at early second trimester. Using 16S amplicon sequencing, the main bioinformatics analysis was performed on the platform of QIIME 2. Vaginal microbiota traits of the sPTB group were compared with controls. Finally, the effects of binary taxa on sPTB in the GDM group and the non-GDM group were analyzed, respectively. Results The proportion of GDM in the sPTB (19.51%) was higher than the controls (7.47%, P = 0.018). The vaginal microbiota of pregnant women with sPTB exhibited higher alpha diversity metrics (observed features, P = 0.001; Faith's phylogenetic diversity, P = 0.013) and different beta diversity metrics (unweighted UniFrac, P = 0.006; Jaccard's distance, P = 0.004), compared with controls. The presence of Lactobacillus paragasseri/gasseri (aOR: 3.12, 95% CI: 1.24-7.84), Streptococcus (aOR: 3.58, 95% CI: 1.68-7.65), or Proteobacteria (aOR: 3.39, 95% CI: 1.55-7.39) was associated with an increased risk of sPTB in the non-GDM group (P < 0.05). However, the relative abundance of novel L. mulieris (a new species of the L. delbrueckii group) was associated with a decreased risk of sPTB (false discovery rate, 0.10) in all pregnant women. Conclusion GDM may modify the association of vaginal taxa with sPTB, suggesting that maternal GDM should be considered when using vaginal taxa to identify pregnant women at high risk of sPTB.
Collapse
|
6
|
On the Relation between Prediction and Imputation Accuracy under Missing Covariates. ENTROPY 2022; 24:e24030386. [PMID: 35327897 PMCID: PMC8947649 DOI: 10.3390/e24030386] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/16/2022] [Revised: 02/23/2022] [Accepted: 02/23/2022] [Indexed: 02/01/2023]
Abstract
Missing covariates in regression or classification problems can prohibit the direct use of advanced tools for further analysis. Recent research has realized an increasing trend towards the use of modern Machine-Learning algorithms for imputation. This originates from their capability of showing favorable prediction accuracy in different learning problems. In this work, we analyze through simulation the interaction between imputation accuracy and prediction accuracy in regression learning problems with missing covariates when Machine-Learning-based methods for both imputation and prediction are used. We see that even a slight decrease in imputation accuracy can seriously affect the prediction accuracy. In addition, we explore imputation performance when using statistical inference procedures in prediction settings, such as the coverage rates of (valid) prediction intervals. Our analysis is based on empirical datasets provided by the UCI Machine Learning repository and an extensive simulation study.
Collapse
|
7
|
Rubarth K, Pauly M, Konietschke F. Ranking procedures for repeated measures designs with missing data: Estimation, testing and asymptotic theory. Stat Methods Med Res 2021; 31:105-118. [PMID: 34841991 PMCID: PMC8721540 DOI: 10.1177/09622802211046389] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
We develop purely nonparametric methods for the analysis of repeated measures designs with missing values. Hypotheses are formulated in terms of purely nonparametric treatment effects. In particular, data can have different shapes even under the null hypothesis and therefore, a solution to the nonparametric Behrens-Fisher problem in repeated measures designs will be presented. Moreover, global testing and multiple contrast test procedures as well as simultaneous confidence intervals for the treatment effects of interest will be developed. All methods can be applied for the analysis of metric, discrete, ordinal, and even binary data in a unified way. Extensive simulation studies indicate a satisfactory control of the nominal type-I error rate, even for small sample sizes and a high amount of missing data (up to 30%). We apply the newly developed methodology to a real data set, demonstrating its application and interpretation.
Collapse
Affiliation(s)
- Kerstin Rubarth
- 14903Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Institute of Biometry and Clinical Epidemiology, Charitéplatz 1, Berlin, Germany.,Berlin Institute of Health (BIH), Anna-Louisa-Karsch-Straße 2, Berlin, Germany
| | - Markus Pauly
- Department of Statistics, TU Dortmund University, Dortmund, Germany
| | - Frank Konietschke
- 14903Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Institute of Biometry and Clinical Epidemiology, Charitéplatz 1, Berlin, Germany.,Berlin Institute of Health (BIH), Anna-Louisa-Karsch-Straße 2, Berlin, Germany
| |
Collapse
|
8
|
Friedrich S, Antes G, Behr S, Binder H, Brannath W, Dumpert F, Ickstadt K, Kestler HA, Lederer J, Leitgöb H, Pauly M, Steland A, Wilhelm A, Friede T. Is there a role for statistics in artificial intelligence? ADV DATA ANAL CLASSI 2021. [DOI: 10.1007/s11634-021-00455-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
AbstractThe research on and application of artificial intelligence (AI) has triggered a comprehensive scientific, economic, social and political discussion. Here we argue that statistics, as an interdisciplinary scientific field, plays a substantial role both for the theoretical and practical understanding of AI and for its future development. Statistics might even be considered a core element of AI. With its specialist knowledge of data evaluation, starting with the precise formulation of the research question and passing through a study design stage on to analysis and interpretation of the results, statistics is a natural partner for other disciplines in teaching, research and practice. This paper aims at highlighting the relevance of statistical methodology in the context of AI development. In particular, we discuss contributions of statistics to the field of artificial intelligence concerning methodological development, planning and design of studies, assessment of data quality and data collection, differentiation of causality and associations and assessment of uncertainty in results. Moreover, the paper also discusses the equally necessary and meaningful extensions of curricula in schools and universities to integrate statistical aspects into AI teaching.
Collapse
|
9
|
Amro L, Pauly M, Ramosaj B. Asymptotic-based bootstrap approach for matched pairs with missingness in a single arm. Biom J 2021; 63:1389-1405. [PMID: 34240446 DOI: 10.1002/bimj.202000051] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2020] [Revised: 12/11/2020] [Accepted: 01/20/2021] [Indexed: 11/06/2022]
Abstract
The issue of missing values is an arising difficulty when dealing with paired data. Several test procedures are developed in the literature to tackle this problem. Some of them are even robust under deviations and control type-I error quite accurately. However, most of these methods are not applicable when missing values are present only in a single arm. For this case, we provide asymptotic correct resampling tests that are robust under heteroskedasticity and skewed distributions. The tests are based on a meaningful restructuring of all observed information in quadratic form-type test statistics. An extensive simulation study is conducted exemplifying the tests for finite sample sizes under different missingness mechanisms. In addition, illustrative data examples based on real life studies are analyzed.
Collapse
Affiliation(s)
- Lubna Amro
- Mathematical Statistics and Applications in Industry, Faculty of Statistics, Technical University of Dortmund, Dortmund, Germany
| | - Markus Pauly
- Mathematical Statistics and Applications in Industry, Faculty of Statistics, Technical University of Dortmund, Dortmund, Germany
| | - Burim Ramosaj
- Mathematical Statistics and Applications in Industry, Faculty of Statistics, Technical University of Dortmund, Dortmund, Germany
| |
Collapse
|
10
|
Peralta M, Jannin P, Haegelen C, Baxter JSH. Data imputation and compression for Parkinson's disease clinical questionnaires. Artif Intell Med 2021; 114:102051. [PMID: 33875162 DOI: 10.1016/j.artmed.2021.102051] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2019] [Revised: 01/27/2021] [Accepted: 02/21/2021] [Indexed: 10/22/2022]
Abstract
Medical questionnaires are a valuable source of information but are often difficult to analyse due to both their size and the high possibility of them having missing values. This is a problematic issue in biomedical data science as it may complicate how individual questionnaire data is represented for statistical or machine learning analysis. In this paper, we propose a deeply-learnt residual autoencoder to simultaneously perform non-linear data imputation and dimensionality reduction. We present an extensive analysis of the dynamics of the performance of this autoencoder regarding the compression rate and the proportion of missing values. This method is evaluated on motor and non-motor clinical questionnaires of the Parkinson's Progression Markers Initiative (PPMI) database and consistently outperforms linear coupled imputation and reduction approaches.
Collapse
Affiliation(s)
- Maxime Peralta
- Laboratoire Traitement du Signal et de l'Image - INSERM UMR 1099, Université de Rennes 1, F-35000 Rennes, France
| | - Pierre Jannin
- Laboratoire Traitement du Signal et de l'Image - INSERM UMR 1099, Université de Rennes 1, F-35000 Rennes, France
| | - Claire Haegelen
- Laboratoire Traitement du Signal et de l'Image - INSERM UMR 1099, Université de Rennes 1, F-35000 Rennes, France; Neurosurgery Department, Centre Hospitalier Universitaire de Rennes, F-35000 Rennes, France
| | - John S H Baxter
- Laboratoire Traitement du Signal et de l'Image - INSERM UMR 1099, Université de Rennes 1, F-35000 Rennes, France.
| |
Collapse
|