1
|
Ibrahimi E, Lopes MB, Dhamo X, Simeon A, Shigdel R, Hron K, Stres B, D’Elia D, Berland M, Marcos-Zambrano LJ. Overview of data preprocessing for machine learning applications in human microbiome research. Front Microbiol 2023; 14:1250909. [PMID: 37869650 PMCID: PMC10588656 DOI: 10.3389/fmicb.2023.1250909] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 09/22/2023] [Indexed: 10/24/2023] Open
Abstract
Although metagenomic sequencing is now the preferred technique to study microbiome-host interactions, analyzing and interpreting microbiome sequencing data presents challenges primarily attributed to the statistical specificities of the data (e.g., sparse, over-dispersed, compositional, inter-variable dependency). This mini review explores preprocessing and transformation methods applied in recent human microbiome studies to address microbiome data analysis challenges. Our results indicate a limited adoption of transformation methods targeting the statistical characteristics of microbiome sequencing data. Instead, there is a prevalent usage of relative and normalization-based transformations that do not specifically account for the specific attributes of microbiome data. The information on preprocessing and transformations applied to the data before analysis was incomplete or missing in many publications, leading to reproducibility concerns, comparability issues, and questionable results. We hope this mini review will provide researchers and newcomers to the field of human microbiome research with an up-to-date point of reference for various data transformation tools and assist them in choosing the most suitable transformation method based on their research questions, objectives, and data characteristics.
Collapse
Affiliation(s)
- Eliana Ibrahimi
- Department of Biology, Faculty of Natural Sciences, University of Tirana, Tirana, Albania
| | - Marta B. Lopes
- Department of Mathematics, Center for Mathematics and Applications (NOVA Math), NOVA School of Science and Technology, Caparica, Portugal
- UNIDEMI, Department of Mechanical and Industrial Engineering, NOVA School of Science and Technology, Caparica, Portugal
| | - Xhilda Dhamo
- Department of Applied Mathematics, Faculty of Natural Sciences, University of Tirana, Tirana, Albania
| | - Andrea Simeon
- BioSense Institute, University of Novi Sad, Novi Sad, Serbia
| | - Rajesh Shigdel
- Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacký University Olomouc, Olomouc, Czechia
| | - Blaž Stres
- Department of Catalysis and Chemical Reaction Engineering, National Institute of Chemistry, Ljubljana, Slovenia
- Faculty of Civil and Geodetic Engineering, Institute of Sanitary Engineering, Ljubljana, Slovenia
- Department of Automation, Biocybernetics and Robotics, Jožef Stefan Institute, Ljubljana, Slovenia
- Department of Animal Science, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | - Domenica D’Elia
- Department of Biomedical Sciences, National Research Council, Institute for Biomedical Technologies, Bari, Italy
| | - Magali Berland
- INRAE, MetaGenoPolis, Université Paris-Saclay, Jouy-en-Josas, France
| | - Laura Judith Marcos-Zambrano
- Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, Madrid, Spain
| |
Collapse
|
2
|
Jašková P, Palarea-Albaladejo J, Gába A, Dumuid D, Pedišić Ž, Pelclová J, Hron K. Compositional functional regression and isotemporal substitution analysis: Methods and application in time-use epidemiology. Stat Methods Med Res 2023; 32:2064-2080. [PMID: 37590096 PMCID: PMC10563378 DOI: 10.1177/09622802231192949] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/19/2023]
Abstract
The distribution of time that people spend in physical activity of various intensities has important health implications. Physical activity (commonly categorised by the intensity into light, moderate and vigorous physical activity), sedentary behaviour and sleep, should not be analysed separately, because they are parts of a time-use composition with a natural constraint of 24 h/day. To find out how are relative reallocations of time between physical activity of various intensities associated with health, herewith we describe compositional scalar-on-function regression and a newly developed compositional functional isotemporal substitution analysis. Physical activity intensity data can be considered as probability density functions, which better reflects the continuous character of their measurement using accelerometers. These probability density functions are characterised by specific properties, such as scale invariance and relative scale, and they are geometrically represented using Bayes spaces with the Hilbert space structure. This makes possible to process them using standard methods of functional data analysis in the L 2 space, via centred logratio (clr) transformation. The scalar-on-function regression with clr transformation of the explanatory probability density functions and compositional functional isotemporal substitution analysis were applied to a dataset from a cross-sectional study on adiposity conducted among school-aged children in the Czech Republic. Theoretical reallocations of time to physical activity of higher intensities were found to be associated with larger and more progressive expected decreases in adiposity. We obtained a detailed insight into the dose-response relationship between physical activity intensity and adiposity, which was enabled by using the compositional functional approach.
Collapse
Affiliation(s)
- Paulína Jašková
- Faculty of Science, Palacký University Olomouc, Olomoucký, Czech Republic
| | - Javier Palarea-Albaladejo
- Department of Computer Science, Applied Mathematics and Statistics, University of Girona, Catalunya, Spain
| | - Aleš Gába
- Faculty of Physical Culture, Palacký University Olomouc, Olomoucký, Czech Republic
| | - Dorothea Dumuid
- Alliance for Research in Exercice, Nutrition and Activity, Allied Health and Human Performance, University of South Australia, Adelaide, SA, Australia
- Centre for Adolescent Health, Murdoch Children’s Research Institute, Parkville, VC, Australia
| | - Željko Pedišić
- Institute for Health and Sport, Victoria University, Melbourne, Australia
| | - Jana Pelclová
- Faculty of Physical Culture, Palacký University Olomouc, Olomoucký, Czech Republic
| | - Karel Hron
- Faculty of Science, Palacký University Olomouc, Olomoucký, Czech Republic
| |
Collapse
|
3
|
Lund Rasmussen C, Gába A, Stanford T, Dygrýn J, Dumuid D, Janda D, Hron K. The Goldilocks Day for healthy adiposity measures among children and adolescents. Front Public Health 2023; 11:1158634. [PMID: 37841713 PMCID: PMC10569221 DOI: 10.3389/fpubh.2023.1158634] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Accepted: 09/12/2023] [Indexed: 10/17/2023] Open
Abstract
Background The optimal balance of time spent on daily movement behaviors ("The Goldilocks Day") associated with childhood obesity remains unknown. Objective To estimate the optimal durations of sleep, sedentary behavior (SB), light physical activity (LPA), and moderate-to-vigorous physical activity (MPVA) associated with excess adiposity in a paediatric population. Methods Accelerometer-measured 24-h movement behaviors were obtained from 659 Czech children and adolescents (8-18-year-olds). Adiposity indicators were body mass index z-score, fat mass percentage, fat-free mass index, and visceral adipose tissue. Excess adiposity was defined as exceeding the 85th percentile for an adiposity indicator. Compositional regression analyses were used investigate the associations between movement behaviors and adiposity indicators and estimating "The Goldilocks Day." Results The movement behavior composition was associated with visceral adipose tissue (Fdf1 = 3,df2 = 317 = 3.672, p = 0.013) and fat mass percentage (Fdf1 = 3,df2 = 289 = 2.733, p = 0.044) among children and adolescents. The Goldilocks Day consisted of 8.5 h of sleep, 10.8 h of SB, 3.9 h of LPA, and 0.8 h of MVPA among children and 7.5 h of sleep, 12.4 h of SB, 3.6 h of LPA, and 0.5 h of MVPA among adolescents. Conclusion Optimizing the time spent sleeping, and in sedentary and physical activities appears to be important in the prevention of excess adiposity.
Collapse
Affiliation(s)
- Charlotte Lund Rasmussen
- Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czechia
- Department of Public Health and Nursing, Norwegian University of Science and Technology, Trondheim, Norway
- School of Allied Health, Curtin University, Perth, WA, Australia
| | - Aleš Gába
- Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czechia
| | - Tyman Stanford
- Alliance for Research in Exercise, Nutrition, and Activity, Allied Health and Human Performance, University of South Australia, Adelaide, SA, Australia
| | - Jan Dygrýn
- Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czechia
| | - Dorothea Dumuid
- Alliance for Research in Exercise, Nutrition, and Activity, Allied Health and Human Performance, University of South Australia, Adelaide, SA, Australia
| | - David Janda
- Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czechia
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University Olomouc, Olomouc, Czechia
| |
Collapse
|
4
|
D’Elia D, Truu J, Lahti L, Berland M, Papoutsoglou G, Ceci M, Zomer A, Lopes MB, Ibrahimi E, Gruca A, Nechyporenko A, Frohme M, Klammsteiner T, Pau ECDS, Marcos-Zambrano LJ, Hron K, Pio G, Simeon A, Suharoschi R, Moreno-Indias I, Temko A, Nedyalkova M, Apostol ES, Truică CO, Shigdel R, Telalović JH, Bongcam-Rudloff E, Przymus P, Jordamović NB, Falquet L, Tarazona S, Sampri A, Isola G, Pérez-Serrano D, Trajkovik V, Klucar L, Loncar-Turukalo T, Havulinna AS, Jansen C, Bertelsen RJ, Claesson MJ. Advancing microbiome research with machine learning: key findings from the ML4Microbiome COST action. Front Microbiol 2023; 14:1257002. [PMID: 37808321 PMCID: PMC10558209 DOI: 10.3389/fmicb.2023.1257002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 09/05/2023] [Indexed: 10/10/2023] Open
Abstract
The rapid development of machine learning (ML) techniques has opened up the data-dense field of microbiome research for novel therapeutic, diagnostic, and prognostic applications targeting a wide range of disorders, which could substantially improve healthcare practices in the era of precision medicine. However, several challenges must be addressed to exploit the benefits of ML in this field fully. In particular, there is a need to establish "gold standard" protocols for conducting ML analysis experiments and improve interactions between microbiome researchers and ML experts. The Machine Learning Techniques in Human Microbiome Studies (ML4Microbiome) COST Action CA18131 is a European network established in 2019 to promote collaboration between discovery-oriented microbiome researchers and data-driven ML experts to optimize and standardize ML approaches for microbiome analysis. This perspective paper presents the key achievements of ML4Microbiome, which include identifying predictive and discriminatory 'omics' features, improving repeatability and comparability, developing automation procedures, and defining priority areas for the novel development of ML methods targeting the microbiome. The insights gained from ML4Microbiome will help to maximize the potential of ML in microbiome research and pave the way for new and improved healthcare practices.
Collapse
Affiliation(s)
- Domenica D’Elia
- Department of Biomedical Sciences, National Research Council, Institute for Biomedical Technologies, Bari, Italy
| | - Jaak Truu
- Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| | - Leo Lahti
- Department of Computing, University of Turku, Turku, Finland
| | - Magali Berland
- Université Paris-Saclay, INRAE, MetaGenoPolis, Jouy-en-Josas, France
| | - Georgios Papoutsoglou
- JADBio Gnosis DA S.A., Science and Technology Park of Crete, Heraklion, Greece
- Department of Computer Science, University of Crete, Heraklion, Greece
| | - Michelangelo Ceci
- Department of Computer Science, University of Bari Aldo Moro, Bari, Italy
| | - Aldert Zomer
- Department of Biomolecular Health Sciences (Infectious Diseases and Immunology), Faculty of Veterinary Medicine, Utrecht University, Utrecht, Netherlands
| | - Marta B. Lopes
- Center for Mathematics and Applications (NOVA Math), NOVA School of Science and Technology, Caparica, Portugal
- UNIDEMI, Department of Mechanical and Industrial Engineering, NOVA School of Science and Technology, Caparica, Portugal
| | - Eliana Ibrahimi
- Department of Biology, University of Tirana, Tirana, Albania
| | - Aleksandra Gruca
- Department of Computer Networks and Systems, Silesian University of Technology, Gliwice, Poland
| | - Alina Nechyporenko
- Systems Engineering Department, Kharkiv National University of Radio Electronics, Kharkiv, Ukraine
- Department of Molecular Biotechnology and Functional Genomics, Technical University of Applied Sciences Wildau, Wildau, Germany
| | - Marcus Frohme
- Department of Molecular Biotechnology and Functional Genomics, Technical University of Applied Sciences Wildau, Wildau, Germany
| | - Thomas Klammsteiner
- Department of Microbiology, Universität Innsbruck, Innsbruck, Austria
- Department of Ecology, Universität Innsbruck, Innsbruck, Austria
| | - Enrique Carrillo-de Santa Pau
- Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, CEI UAM+CSIC, Madrid, Spain
| | - Laura Judith Marcos-Zambrano
- Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, CEI UAM+CSIC, Madrid, Spain
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacký University, Olomouc, Czechia
| | - Gianvito Pio
- Department of Computer Science, University of Bari Aldo Moro, Bari, Italy
| | - Andrea Simeon
- BioSense Institute, University of Novi Sad, Novi Sad, Serbia
| | - Ramona Suharoschi
- Molecular Nutrition and Proteomics Research Laboratory, Department of Food Science, University of Agricultural Sciences and Veterinary Medicine of Cluj-Napoca, Cluj-Napoca, Romania
| | - Isabel Moreno-Indias
- Department of Endocrinology and Nutrition, Virgen de la Victoria University Hospital, the Biomedical Research Institute of Malaga and Platform in Nanomedicine (IBIMA-BIONAND Platform), University of Malaga, Malaga, Spain
| | - Andriy Temko
- Department of Electrical and Electronic Engineering, University College Cork, Cork, Ireland
| | | | - Elena-Simona Apostol
- Computer Science and Engineering Department, Faculty of Automatic Control and Computers, University Politehnica of Bucharest, Bucharest, Romania
| | - Ciprian-Octavian Truică
- Computer Science and Engineering Department, Faculty of Automatic Control and Computers, University Politehnica of Bucharest, Bucharest, Romania
| | - Rajesh Shigdel
- Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Jasminka Hasić Telalović
- Department of Computer Science, University Sarajevo School of Science and Technology, Sarajevo, Bosnia and Herzegovina
| | - Erik Bongcam-Rudloff
- Swedish University of Agricultural Sciences, Department of Animal Breeding and Genetics, Uppsala, Sweden
| | | | - Naida Babić Jordamović
- Computational Biology, International Centre for Genetic Engineering and Biotechnology, Trieste, Italy
- Verlab Research Institute for BIomedical Engineering, Medical Devices and Artificial Intelligence, Sarajevo, Bosnia and Herzegovina
| | - Laurent Falquet
- University of Fribourg and Swiss Institute of Bioinformatics, Fribourg, Switzerland
| | - Sonia Tarazona
- Department of Applied Statistics and Operations Research and Quality, Universitat Politècnica de València, València, Spain
| | - Alexia Sampri
- British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom
- Victor Phillip Dahdaleh Heart and Lung Research Institute, University of Cambridge, Cambridge, United Kingdom
| | - Gaetano Isola
- Department of General Surgery and Surgical-Medical Specialties, School of Dentistry, University of Catania, Catania, Italy
| | - David Pérez-Serrano
- Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, CEI UAM+CSIC, Madrid, Spain
| | | | - Lubos Klucar
- Institute of Molecular Biology, Slovak Academy of Sciences, Bratislava, Slovakia
| | | | - Aki S. Havulinna
- Finnish Institute for Health and Welfare, Helsinki, Finland
- Institute for Molecular Medicine Finland, FIMM-HiLIFE, Helsinki, Finland
| | - Christian Jansen
- Biome Diagnostics GmbH, Vienna, Austria
- Institute of Science and Technology Austria (ISTA), Klosterneuburg, Austria
| | | | | |
Collapse
|
5
|
Segura-Jiménez V, Pedišić Ž, Gába A, Dumuid D, Olds T, Štefelová N, Hron K, Gómez-Martínez S, Marcos A, Castro-Piñero J. Longitudinal reallocations of time between 24-h movement behaviours and their associations with inflammation in children and adolescents: the UP&DOWN study. Int J Behav Nutr Phys Act 2023; 20:72. [PMID: 37322451 DOI: 10.1186/s12966-023-01471-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Accepted: 05/27/2023] [Indexed: 06/17/2023] Open
Abstract
BACKGROUND While there is evidence that physical activity, sedentary behaviour (SB) and sleep may all be associated with modified levels of inflammatory markers in adolescents and children, associations with one movement behaviour have not always been adjusted for other movement behaviours, and few studies have considered all movement behaviours in the 24-hour day as an exposure. PURPOSE The aim of the study was to explore how longitudinal reallocations of time between moderate-to-vigorous physical activity (MVPA), light physical activity (LPA), SB and sleep are associated with changes in inflammatory markers in children and adolescents. METHODS A total of 296 children/adolescents participated in a prospective cohort study with a 3-year follow-up. MVPA, LPA and SB were assessed by accelerometers. Sleep duration was assessed using the Health Behavior in School-aged Children questionnaire. Longitudinal compositional regression models were used to explore how reallocations of time between movement behaviours are associated with changes in inflammatory markers. RESULTS Reallocations of time from SB to sleep were associated with increases in C3 levels (difference for 60 min/d reallocation [d60] = 5.29 mg/dl; 95% confidence interval [CI] = 0.28, 10.29) and TNF-α (d60 = 1.81 mg/dl; 95% CI = 0.79, 15.41) levels. Reallocations from LPA to sleep were also associated with increases in C3 levels (d60 = 8.10 mg/dl; 95% CI = 0.79, 15.41). Reallocations from LPA to any of the remaining time-use components were associated with increases in C4 levels (d60 ranging from 2.54 to 3.63 mg/dl; p < 0.05), while any reallocation of time away from MVPA was associated with unfavourable changes in leptin (d60 ranging from 3088.44 to 3448.07 pg/ml; p < 0.05). CONCLUSIONS Reallocations of time between 24-h movement behaviours are prospectively associated with some inflammatory markers. Reallocating time away from LPA appears to be most consistently unfavourably associated with inflammatory markers. Given that higher levels of inflammation during childhood and adolescence are associated with an increased risk of chronic diseases in adulthood, children and adolescents should be encouraged to maintain or increase the level of LPA to preserve a healthy immune system.
Collapse
Affiliation(s)
- Víctor Segura-Jiménez
- UGC Medicina Física y Rehabilitación, Hospital de Neurotraumatología y Rehabilitación, Hospital Universitario Virgen de las Nieves, Granada, Spain.
- Instituto de Investigación Biosanitaria ibs.GRANADA, Avda. de Madrid, 15, Granada, 18012, Spain.
- GALENO research group, Department of Physical Education, Faculty of Education Sciences, University of Cádiz, Av. República Saharaui, 12, Puerto Real, Cádiz, 11519, Spain.
- Biomedical Research and Innovation Institute of Cádiz (INiBICA) Research Unit, Puerta del Mar University Hospital, Cádiz, Spain.
| | - Željko Pedišić
- Institute for Health and Sport, Victoria University, Melbourne, Australia
| | - Aleš Gába
- Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czech Republic
| | - Dorothea Dumuid
- Alliance for Research in Exercise, Nutrition and Activity, Allied Health & Human Performance, University of South Australia, Adelaide, Australia
- Centre for Adolescent Health, Murdoch Children's Research Institute, Melbourne, Australia
| | - Timothy Olds
- Alliance for Research in Exercise, Nutrition and Activity, Allied Health & Human Performance, University of South Australia, Adelaide, Australia
| | - Nikola Štefelová
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacký University Olomouc, Olomouc, Czech Republic
- Czech Advanced Technology and Research Institute, Palacký University, Olomouc, Czech Republic
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacký University Olomouc, Olomouc, Czech Republic
| | - Sonia Gómez-Martínez
- Immunonutrition Group, Department of Metabolism and Nutrition, Institute of Food Science, Technology and Nutrition (ICTAN), Spanish National Research Council (CSIC), Madrid, Spain
| | - Ascensión Marcos
- Immunonutrition Group, Department of Metabolism and Nutrition, Institute of Food Science, Technology and Nutrition (ICTAN), Spanish National Research Council (CSIC), Madrid, Spain
| | - José Castro-Piñero
- GALENO research group, Department of Physical Education, Faculty of Education Sciences, University of Cádiz, Av. República Saharaui, 12, Puerto Real, Cádiz, 11519, Spain.
- Biomedical Research and Innovation Institute of Cádiz (INiBICA) Research Unit, Puerta del Mar University Hospital, Cádiz, Spain.
| |
Collapse
|
6
|
Lund Rasmussen C, Holtermann A, Hron K, Dumuid D, Nørregaard Rasmussen CD. The Use of Time Flow Analysis to Describe Changes in Physical Ergonomic Work Behaviours Following a Cluster-Randomized Controlled Participatory Ergonomic Intervention. Ann Work Expo Health 2022; 66:1199-1209. [PMID: 35975806 PMCID: PMC9664235 DOI: 10.1093/annweh/wxac058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Revised: 07/08/2022] [Accepted: 08/02/2022] [Indexed: 12/14/2022] Open
Abstract
AIM Evaluations of participatory ergonomic interventions are often challenging as these types of interventions are tailored to the context and need of the workplace in which they are implemented. We aimed to describe how time flow analysis can be used to describe changes in work behaviours following a participatory ergonomic intervention. METHOD This study was based on data from a two-arm cluster-randomized controlled trial with 29 childcare institutions and 116 workers (intervention: n = 60, control: n = 56). Physical behaviours at work were technically measured at baseline and 4-month follow-up. Physical behaviours were expressed in terms of relative work time spent forward bending of the back ≥30°, kneeling, active (i.e. walking, stair climbing and running) and sedentary. Average time flow from baseline to follow-up were calculated for both groups to investigate if work time was allocated differently at follow-up. RESULTS A total of 116 workers (60 in the intervention and 56 in the control group) had valid accelerometer at baseline and follow-up. The largest group difference in time flowing from baseline to follow-up was observed for forward bending of the back and time spent kneeling. Compared to the control, the intervention group had less time flowing from forward bending of the back to kneeling (intervention: +11 min day, control: +16 min day) and more time flowing from kneeling to sedentary behaviours (intervention: +15 min day, control: +10 min day). CONCLUSION The results of this study showed that time flow analysis can be used to reveal changes in work time-use following a participatory ergonomic intervention. For example, the analysis revealed that the intervention group had replaced more work time spent kneeling with sedentary behaviours compared to the control group. This type of information on group differences in time reallocations would not have been possible to obtain by comparing group differences in work time-use following the intervention, supporting the usefulness of time flow analysis as a tool to evaluate complex, context-specific interventions.
Collapse
Affiliation(s)
| | - Andreas Holtermann
- National Research Centre for the Working Environment, Lersø Parkalle 105, 2100 Copenhagen, Denmark
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University Olomouc, Olomouc, Czech Republic
| | - Dorothea Dumuid
- Alliance for Research in Exercise, Nutrition, and Activity (ARENA), Allied Health and Human Performance, University of South Australia, Adelaide, South Australia, Australia
| | | |
Collapse
|
7
|
Fačevicová K, Filzmoser P, Hron K. Compositional cubes: a new concept for multi-factorial compositions. Stat Pap (Berl) 2022; 64:955-985. [PMID: 35971537 PMCID: PMC9366844 DOI: 10.1007/s00362-022-01350-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Revised: 05/24/2022] [Accepted: 07/26/2022] [Indexed: 11/29/2022]
Abstract
Compositional data are commonly known as multivariate observations carrying relative information. Even though the case of vector or even two-factorial compositional data (compositional tables) is already well described in the literature, there is still a need for a comprehensive approach to the analysis of multi-factorial relative-valued data. Therefore, this contribution builds around the current knowledge about compositional data a general theoretical framework for k-factorial compositional data. As a main finding it turns out that, similar to the case of compositional tables, also the multi-factorial structures can be orthogonally decomposed into an independent and several interactive parts and, moreover, a coordinate representation allowing for their separate analysis by standard analytical methods can be constructed. For the sake of simplicity, these features are explained in detail for the case of three-factorial compositions (compositional cubes), followed by an outline covering the general case. The three-dimensional structure is analyzed in depth in two practical examples, dealing with systems of spatial and time dependent compositional cubes. The methodology is implemented in the R package robCompositions.
Collapse
Affiliation(s)
- Kamila Fačevicová
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University Olomouc, 17. listopadu 12, 77146 Olomouc, Czech Republic
| | - Peter Filzmoser
- Institute of Statistics and Mathematical Methods in Economics, TU Wien, Wiedner Hauptstraße 8-10, 1040 Vienna, Austria
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University Olomouc, 17. listopadu 12, 77146 Olomouc, Czech Republic
| |
Collapse
|
8
|
Rubín L, Gába A, Pelclová J, Štefelová N, Jakubec L, Dygrýn J, Hron K. Changes in sedentary behavior patterns during the transition from childhood to adolescence and their association with adiposity: a prospective study based on compositional data analysis. Arch Public Health 2022; 80:1. [PMID: 34983643 PMCID: PMC8725475 DOI: 10.1186/s13690-021-00755-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 12/08/2021] [Indexed: 01/04/2023] Open
Abstract
Background To date, no longitudinal study using a compositional approach has examined sedentary behavior (SB) patterns in relation to adiposity in the pediatric population. Therefore, our aims were to (1) investigate the changes in SB patterns and adiposity from childhood to adolescence, (2) analyze the prospective compositional associations between changes in SB patterns and adiposity, and (3) estimate the changes in adiposity associated with substituting SB with physical activity (PA) of different intensities. Methods The study presents a longitudinal design with a 5-year follow-up. A total of 88 participants (61% girls) were included in the analysis. PA and SB were monitored for seven consecutive days using a hip-worn accelerometer. Adiposity markers (fat mass percentage [FM%], fat mass index [FMI], and visceral adiposity tissue [VAT]) were assessed using the multi-frequency bioimpedance analysis. The prospective associations were examined using compositional data analysis. Results Over the follow-up period, the proportion of time spent in total SB increased by 154.8 min/day (p < 0.001). The increase in total SB was caused mainly by an increase in middle and long sedentary bouts, as these SB periods increased by 79.8 min/day and 62.0 min/day (p < 0.001 for both), respectively. FM%, FMI, and VAT increased by 2.4% points, 1.0 kg/m2, and 31.5 cm2 (p < 0.001 for all), respectively. Relative to the remaining movement behaviors, the increase in time spent in middle sedentary bouts was significantly associated with higher FM% (βilr1 = 0.27, 95% confidence interval [CI]: 0.02 to 0.53) at follow-up. Lower VAT by 3.3% (95% CI: 0.8 to 5.7), 3.8% (95% CI: 0.03 to 7.4), 3.9% (95% CI: 0.8 to 6.9), and 3.8% (95% CI: 0.7 to 6.9) was associated with substituting 15 min/week spent in total SB and in short, middle, and long sedentary bouts, respectively, with an equivalent amount of time spent in vigorous PA. Conclusions This study showed unfavorable changes in SB patterns and adiposity status in the transition from childhood to adolescence. Incorporating high-intensity PA at the expense of SB appears to be an appropriate approach to reduce the risk of excess adiposity in the pediatric population. Supplementary Information The online version contains supplementary material available at 10.1186/s13690-021-00755-5.
Collapse
Affiliation(s)
- Lukáš Rubín
- Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czech Republic.,Faculty of Science, Humanities and Education, Technical University of Liberec, Liberec, Czech Republic
| | - Aleš Gába
- Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czech Republic.
| | - Jana Pelclová
- Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czech Republic
| | - Nikola Štefelová
- Faculty of Science, Palacký University Olomouc, Olomouc, Czech Republic
| | - Lukáš Jakubec
- Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czech Republic
| | - Jan Dygrýn
- Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czech Republic
| | - Karel Hron
- Faculty of Science, Palacký University Olomouc, Olomouc, Czech Republic
| |
Collapse
|
9
|
Štefelová N, Palarea‐Albaladejo J, Hron K. Weighted pivot coordinates for partial least squares‐based marker discovery in high‐throughput compositional data. Stat Anal Data Min 2021. [DOI: 10.1002/sam.11514] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
| | | | - Karel Hron
- Faculty of Science Palacký University Olomouc Czech Republic
| |
Collapse
|
10
|
Rasmussen CL, Dumuid D, Hron K, Gupta N, Jørgensen MB, Nabe-Nielsen K, Holtermann A. Day-to-day pattern of work and leisure time physical behaviours: are low socioeconomic status adults couch potatoes or work warriors? BMC Public Health 2021; 21:1342. [PMID: 34233666 PMCID: PMC8265073 DOI: 10.1186/s12889-021-11409-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Accepted: 06/28/2021] [Indexed: 11/13/2022] Open
Abstract
Background Most studies on day-to-day patterns of physical behaviours (i.e. physical activities and sedentary behaviour) are based on adults with high socioeconomic status (SES) and without differentiating between work and leisure time. Thus, we aimed to characterise the day-to-day leisure time physical behaviours patterns among low SES adults and investigate the influence of work physical behaviours. Methods This cross-sectional study included 963 adults from low SES occupations (e.g. manufacturing, cleaning and transportation). The participants wore accelerometers for 1–7 days to measure physical behaviours during work and leisure time, expressed as time-use compositions consisting of time spent sedentary, standing or being active (walking, running, stair climbing, or cycling). Compositional multivariate multilevel models were used to regress daily leisure time-use composition against work time-use compositions. Interaction between weekday and (1) type of day, (i.e., work/non-work) and (2) the work time-use composition were tested. Compositional isotemporal substitution was used to interpret the estimates from the models. Results Each weekday, workers consistently spent most leisure time being sedentary and most work time standing. Leisure time physical behaviours were associated with type of day (p < 0.005, more sedentary on workdays vs. non-workdays), weekday (p < 0.005, more sedentary on Friday, Saturday and Sunday), standing work (p < 0.005, more sedentary and less standing and active leisure time on Sunday), and active work (p < 0.005, less sedentary and more standing and active leisure time on Sunday). Sedentary leisure time increased by 18 min, while standing and active leisure time decreased by 11 and 7 min, respectively, when 30 min were reallocated to standing at work on Sunday. Conversely, sedentary leisure time decreased by 25 min, and standing and active leisure time increased by 15 and 10 min, respectively, when 30 min were reallocated to active time at work on Sunday. Conclusions While low SES adults’ leisure time was mostly sedentary, their work time was predominantly standing. Work physical behaviours differently influenced day-to-day leisure time behaviours. Thus, public health initiatives aiming to change leisure time behaviours among low SES adults should consider the influence of work physical behaviours. Supplementary Information The online version contains supplementary material available at 10.1186/s12889-021-11409-0.
Collapse
Affiliation(s)
| | - Dorothea Dumuid
- Allied Health & Human Performance, Alliance for Research in Exercise, Nutrition, and Activity (ARENA), University of South Australia, Adelaide, South Australia, Australia
| | - Karel Hron
- Allied Health & Human Performance, Alliance for Research in Exercise, Nutrition, and Activity (ARENA), University of South Australia, Adelaide, South Australia, Australia.,Department of Mathematical Analysis and Applications of Mathematics, Palacký University, Olomouc, Czech Republic
| | - Nidhi Gupta
- National Research Centre for the Working Environment, Copenhagen, Denmark
| | | | - Kirsten Nabe-Nielsen
- Section of Social Medicine, Department of Public Health, University of Copenhagen, Copenhagen, Denmark
| | - Andreas Holtermann
- National Research Centre for the Working Environment, Copenhagen, Denmark.,Department of Sports Science and Clinical Biomechanics, University of Southern Denmark, Odense, Denmark
| |
Collapse
|
11
|
Pelclová J, Štefelová N, Olds T, Dumuid D, Hron K, Chastin S, Pedišić Ž. A study on prospective associations between adiposity and 7-year changes in movement behaviors among older women based on compositional data analysis. BMC Geriatr 2021; 21:203. [PMID: 33757454 PMCID: PMC7988941 DOI: 10.1186/s12877-021-02148-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2020] [Accepted: 03/10/2021] [Indexed: 12/25/2022] Open
Abstract
INTRODUCTION It is unclear whether adiposity leads to changes in movement behaviors, and there is a lack of compositional analyses of longitudinal data which focus on these associations. Using a compositional approach, this study aimed to examine the associations between baseline adiposity and 7-year changes in physical activity (PA) and sedentary behavior (SB) among elderly women. We also explored the longitudinal associations between change in adiposity and change in movement-behavior composition. METHODS This longitudinal study included 176 older women (mean baseline age 62.8 (4.1) years) from Central Europe. Movement behavior was assessed by accelerometers and adiposity was measured by bioelectrical impedance analysis at baseline and follow-up. A set of multivariate least-squares regression analyses was used to examine the associations of baseline adiposity and longitudinal changes in adiposity as explanatory variables with longitudinal changes in a 3-part movement-behavior composition consisting of SB, light PA (LPA) and moderate-to-vigorous PA (MVPA) as outcome variables. RESULTS No significant associations were found between baseline adiposity and longitudinal changes in the movement-behavior composition (p > 0.05). We found significant associations of changes in body mass index (BMI) and fat mass percentage (FM%) with changes in the movement-behavior composition. An increase in BMI was associated with an increase of SB at the expense of LPA and MVPA (β = 0.042, p = 0.009) and with a decrease of MVPA in favor of SB and LPA (β = - 0.059, p = 0.037). An increase in FM% was significantly associated only with an increase of SB at the expense of LPA and MVPA (β = 0.019, p = 0.031). CONCLUSIONS This study did not support the assumption that baseline adiposity is associated with longitudinal changes in movement behaviors among elderly women, but we found evidence for change-to-change associations, suggesting that a 7-year increase in adiposity is associated with a concurrent increase of SB at the expense of LPA and MVPA and with a concurrent decrease of MVPA in favor of LPA and SB. Public health interventions are needed to simultaneously prevent weight gain and promote physically active lifestyle among elderly women.
Collapse
Affiliation(s)
- Jana Pelclová
- Faculty of Physical Culture, Palacký University Olomouc, Institute of Active Lifestyle, Tř. Míru 117, 771 11, Olomouc, Czech Republic.
| | - Nikola Štefelová
- Faculty of Science, Palacký University Olomouc, Olomouc, Czech Republic
| | - Timothy Olds
- Alliance for Research in Exercise, Nutrition and Activity (ARENA), University of South Australia, Adelaide, South Australia, Australia
| | - Dorothea Dumuid
- Alliance for Research in Exercise, Nutrition and Activity (ARENA), University of South Australia, Adelaide, South Australia, Australia
| | - Karel Hron
- Faculty of Science, Palacký University Olomouc, Olomouc, Czech Republic
| | - Sebastien Chastin
- School of Health and Life Science, Glasgow Caledonian University, Glasgow, UK
| | - Željko Pedišić
- Institute for Health and Sport, Victoria University, Melbourne, Australia
| |
Collapse
|
12
|
Moreno-Indias I, Lahti L, Nedyalkova M, Elbere I, Roshchupkin G, Adilovic M, Aydemir O, Bakir-Gungor B, Santa Pau ECD, D’Elia D, Desai MS, Falquet L, Gundogdu A, Hron K, Klammsteiner T, Lopes MB, Marcos-Zambrano LJ, Marques C, Mason M, May P, Pašić L, Pio G, Pongor S, Promponas VJ, Przymus P, Saez-Rodriguez J, Sampri A, Shigdel R, Stres B, Suharoschi R, Truu J, Truică CO, Vilne B, Vlachakis D, Yilmaz E, Zeller G, Zomer AL, Gómez-Cabrero D, Claesson MJ. Statistical and Machine Learning Techniques in Human Microbiome Studies: Contemporary Challenges and Solutions. Front Microbiol 2021; 12:635781. [PMID: 33692771 PMCID: PMC7937616 DOI: 10.3389/fmicb.2021.635781] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Accepted: 01/28/2021] [Indexed: 12/23/2022] Open
Abstract
The human microbiome has emerged as a central research topic in human biology and biomedicine. Current microbiome studies generate high-throughput omics data across different body sites, populations, and life stages. Many of the challenges in microbiome research are similar to other high-throughput studies, the quantitative analyses need to address the heterogeneity of data, specific statistical properties, and the remarkable variation in microbiome composition across individuals and body sites. This has led to a broad spectrum of statistical and machine learning challenges that range from study design, data processing, and standardization to analysis, modeling, cross-study comparison, prediction, data science ecosystems, and reproducible reporting. Nevertheless, although many statistics and machine learning approaches and tools have been developed, new techniques are needed to deal with emerging applications and the vast heterogeneity of microbiome data. We review and discuss emerging applications of statistical and machine learning techniques in human microbiome studies and introduce the COST Action CA18131 "ML4Microbiome" that brings together microbiome researchers and machine learning experts to address current challenges such as standardization of analysis pipelines for reproducibility of data analysis results, benchmarking, improvement, or development of existing and new tools and ontologies.
Collapse
Affiliation(s)
- Isabel Moreno-Indias
- Instituto de Investigación Biomédica de Málaga (IBIMA), Unidad de Gestión Clìnica de Endocrinologìa y Nutrición, Hospital Clìnico Universitario Virgen de la Victoria, Universidad de Málaga, Málaga, Spain
- Centro de Investigación Biomeìdica en Red de Fisiopatologtìa de la Obesidad y la Nutrición (CIBEROBN), Instituto de Salud Carlos III, Madrid, Spain
| | - Leo Lahti
- Department of Computing, University of Turku, Turku, Finland
| | - Miroslava Nedyalkova
- Human Genetics and Disease Mechanisms, Latvian Biomedical Research and Study Centre, Riga, Latvia
| | - Ilze Elbere
- Latvian Biomedical Research and Study Centre, Riga, Latvia
| | | | - Muhamed Adilovic
- Department of Genetics and Bioengineering, International University of Sarajevo, Sarajevo, Bosnia and Herzegovina
| | - Onder Aydemir
- Department of Electrical and Electronics Engineering, Karadeniz Technical University, Trabzon, Turkey
| | - Burcu Bakir-Gungor
- Department of Computer Engineering, Abdullah Gul University, Kayseri, Turkey
| | | | - Domenica D’Elia
- Department for Biomedical Sciences, Institute for Biomedical Technologies, National Research Council, Bari, Italy
| | - Mahesh S. Desai
- Department of Infection and Immunity, Luxembourg Institute of Health, Esch-sur-Alzette, Luxembourg
- Odense Research Center for Anaphylaxis, Department of Dermatology and Allergy Center, Odense University Hospital, University of Southern Denmark, Odense, Denmark
| | - Laurent Falquet
- Department of Biology, University of Fribourg, Fribourg, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Aycan Gundogdu
- Department of Microbiology and Clinical Microbiology, Faculty of Medicine, Erciyes University, Kayseri, Turkey
- Metagenomics Laboratory, Genome and Stem Cell Center (GenKök), Erciyes University, Kayseri, Turkey
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University, Olomouc, Czechia
| | | | - Marta B. Lopes
- NOVA Laboratory for Computer Science and Informatics (NOVA LINCS), FCT, UNL, Caparica, Portugal
- Centro de Matemática e Aplicações (CMA), FCT, UNL, Caparica, Portugal
| | - Laura Judith Marcos-Zambrano
- Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, Madrid, Spain
| | - Cláudia Marques
- CINTESIS, NOVA Medical School, NMS, Universidade Nova de Lisboa, Lisbon, Portugal
| | - Michael Mason
- Computational Oncology, Sage Bionetworks, Seattle, WA, United States
| | - Patrick May
- Bioinformatics Core, Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Lejla Pašić
- Sarajevo Medical School, University Sarajevo School of Science and Technology, Sarajevo, Bosnia and Herzegovina
| | - Gianvito Pio
- Department of Computer Science, University of Bari Aldo Moro, Bari, Italy
| | - Sándor Pongor
- Faculty of Information Tehnology and Bionics, Pázmány University, Budapest, Hungary
| | - Vasilis J. Promponas
- Bioinformatics Research Laboratory, Department of Biological Sciences, University of Cyprus, Nicosia, Cyprus
| | - Piotr Przymus
- Faculty of Mathematics and Computer Science, Nicolaus Copernicus University, Toruñ, Poland
| | - Julio Saez-Rodriguez
- Institute of Computational Biomedicine, Heidelberg University, Faculty of Medicine and Heidelberg University Hospital, Heidelberg, Germany
| | - Alexia Sampri
- Division of Informatics, Imaging and Data Sciences, School of Health Sciences, University of Manchester, Manchester, United Kingdom
| | - Rajesh Shigdel
- Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Blaz Stres
- Jozef Stefan Institute, Ljubljana, Slovenia
- Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
- Faculty of Civil and Geodetic Engineering, University of Ljubljana, Ljubljana, Slovenia
| | - Ramona Suharoschi
- Molecular Nutrition and Proteomics Lab, Faculty of the Food Science and Technology, Institute of Life Sciences, University of Agricultural Sciences and Veterinary Medicine of Cluj-Napoca, Cluj-Napoca, Romania
| | - Jaak Truu
- Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| | - Ciprian-Octavian Truică
- Department of Computer Science and Engineering, Faculty of Automatic Control and Computers, University Politehnica of Bucharest, Bucharest, Romania
| | - Baiba Vilne
- Bioinformatics Research Unit, Riga Stradins University, Riga, Latvia
| | - Dimitrios Vlachakis
- Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Ercument Yilmaz
- Department of Computer Technologies, Karadeniz Technical University, Trabzon, Turkey
| | - Georg Zeller
- European Molecular Biology Laboratory, Structural and Computational Biology Unit, Heidelberg, Germany
| | - Aldert L. Zomer
- Department of Infectious Diseases and Immunology, Faculty of Veterinary Medicine, Utrecht University, Utrecht, Netherlands
| | - David Gómez-Cabrero
- Navarrabiomed, Complejo Hospitalario de Navarra (CHN), IdiSNA, Universidad Pública de Navarra (UPNA), Pamplona, Spain
| | - Marcus J. Claesson
- School of Microbiology and APC Microbiome Ireland, University College Cork, Cork, Ireland
| |
Collapse
|
13
|
Marcos-Zambrano LJ, Karaduzovic-Hadziabdic K, Loncar Turukalo T, Przymus P, Trajkovik V, Aasmets O, Berland M, Gruca A, Hasic J, Hron K, Klammsteiner T, Kolev M, Lahti L, Lopes MB, Moreno V, Naskinova I, Org E, Paciência I, Papoutsoglou G, Shigdel R, Stres B, Vilne B, Yousef M, Zdravevski E, Tsamardinos I, Carrillo de Santa Pau E, Claesson MJ, Moreno-Indias I, Truu J. Applications of Machine Learning in Human Microbiome Studies: A Review on Feature Selection, Biomarker Identification, Disease Prediction and Treatment. Front Microbiol 2021; 12:634511. [PMID: 33737920 PMCID: PMC7962872 DOI: 10.3389/fmicb.2021.634511] [Citation(s) in RCA: 113] [Impact Index Per Article: 37.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Accepted: 02/01/2021] [Indexed: 12/19/2022] Open
Abstract
The number of microbiome-related studies has notably increased the availability of data on human microbiome composition and function. These studies provide the essential material to deeply explore host-microbiome associations and their relation to the development and progression of various complex diseases. Improved data-analytical tools are needed to exploit all information from these biological datasets, taking into account the peculiarities of microbiome data, i.e., compositional, heterogeneous and sparse nature of these datasets. The possibility of predicting host-phenotypes based on taxonomy-informed feature selection to establish an association between microbiome and predict disease states is beneficial for personalized medicine. In this regard, machine learning (ML) provides new insights into the development of models that can be used to predict outputs, such as classification and prediction in microbiology, infer host phenotypes to predict diseases and use microbial communities to stratify patients by their characterization of state-specific microbial signatures. Here we review the state-of-the-art ML methods and respective software applied in human microbiome studies, performed as part of the COST Action ML4Microbiome activities. This scoping review focuses on the application of ML in microbiome studies related to association and clinical use for diagnostics, prognostics, and therapeutics. Although the data presented here is more related to the bacterial community, many algorithms could be applied in general, regardless of the feature type. This literature and software review covering this broad topic is aligned with the scoping review methodology. The manual identification of data sources has been complemented with: (1) automated publication search through digital libraries of the three major publishers using natural language processing (NLP) Toolkit, and (2) an automated identification of relevant software repositories on GitHub and ranking of the related research papers relying on learning to rank approach.
Collapse
Affiliation(s)
- Laura Judith Marcos-Zambrano
- Computational Biology Group, Precision Nutrition and Cancer Research Program, IMDEA Food Institute, Madrid, Spain
| | | | | | - Piotr Przymus
- Faculty of Mathematics and Computer Science, Nicolaus Copernicus University, Toruń, Poland
| | - Vladimir Trajkovik
- Faculty of Computer Science and Engineering, Ss. Cyril and Methodius University, Skopje, North Macedonia
| | - Oliver Aasmets
- Institute of Genomics, Estonian Genome Centre, University of Tartu, Tartu, Estonia
- Department of Biotechnology, Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| | - Magali Berland
- Université Paris-Saclay, INRAE, MGP, Jouy-en-Josas, France
| | - Aleksandra Gruca
- Department of Computer Networks and Systems, Silesian University of Technology, Gliwice, Poland
| | - Jasminka Hasic
- University Sarajevo School of Science and Technology, Sarajevo, Bosnia and Herzegovina
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University, Olomouc, Czechia
| | | | - Mikhail Kolev
- South West University “Neofit Rilski”, Blagoevgrad, Bulgaria
| | - Leo Lahti
- Department of Computing, University of Turku, Turku, Finland
| | - Marta B. Lopes
- NOVA Laboratory for Computer Science and Informatics (NOVA LINCS), FCT, UNL, Caparica, Portugal
- Centro de Matemática e Aplicações (CMA), FCT, UNL, Caparica, Portugal
| | - Victor Moreno
- Oncology Data Analytics Program, Catalan Institute of Oncology (ICO)Barcelona, Spain
- Colorectal Cancer Group, Institut de Recerca Biomedica de Bellvitge (IDIBELL), Barcelona, Spain
- Consortium for Biomedical Research in Epidemiology and Public Health (CIBERESP), Barcelona, Spain
- Department of Clinical Sciences, Faculty of Medicine, University of Barcelona, Barcelona, Spain
| | - Irina Naskinova
- South West University “Neofit Rilski”, Blagoevgrad, Bulgaria
| | - Elin Org
- Institute of Genomics, Estonian Genome Centre, University of Tartu, Tartu, Estonia
| | - Inês Paciência
- EPIUnit – Instituto de Saúde Pública da Universidade do Porto, Porto, Portugal
| | | | - Rajesh Shigdel
- Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Blaz Stres
- Group for Microbiology and Microbial Biotechnology, Department of Animal Science, University of Ljubljana, Ljubljana, Slovenia
| | - Baiba Vilne
- Bioinformatics Research Unit, Riga Stradins University, Riga, Latvia
| | - Malik Yousef
- Department of Information Systems, Zefat Academic College, Zefat, Israel
- Galilee Digital Health Research Center (GDH), Zefat Academic College, Zefat, Israel
| | - Eftim Zdravevski
- Faculty of Computer Science and Engineering, Ss. Cyril and Methodius University, Skopje, North Macedonia
| | | | | | - Marcus J. Claesson
- School of Microbiology & APC Microbiome Ireland, University College Cork, Cork, Ireland
| | - Isabel Moreno-Indias
- Unidad de Gestión Clínica de Endocrinología y Nutrición, Instituto de Investigación Biomédica de Málaga (IBIMA), Hospital Clínico Universitario Virgen de la Victoria, Universidad de Málaga, Málaga, Spain
- Centro de Investigación Biomédica en Red de Fisiopatología de la Obesidad y la Nutrición (CIBEROBN), Instituto de Salud Carlos III, Madrid, Spain
| | - Jaak Truu
- Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| |
Collapse
|
14
|
|
15
|
Gába A, Dygrýn J, Štefelová N, Rubín L, Hron K, Jakubec L, Pedišić Ž. How do short sleepers use extra waking hours? A compositional analysis of 24-h time-use patterns among children and adolescents. Int J Behav Nutr Phys Act 2020; 17:104. [PMID: 32795287 PMCID: PMC7427741 DOI: 10.1186/s12966-020-01004-8] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2020] [Accepted: 07/29/2020] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND To examine compositional associations between short sleep duration and sedentary behavior (SB), light physical activity (LPA) and moderate-to-vigorous physical activity (MVPA) among children and adolescents. METHODS Multi-day 24-h data on sleep, SB, LPA and MVPA were collected using accelerometers among 343 children (8-13 years old) and 316 adolescents (14-18 years old). Children and adolescents with sleep duration of < 9 and < 8 h, respectively, were classified as short sleepers. Robust compositional regression analysis was used to examine the associations between short sleep duration and the waking-time composition. RESULTS Seventy-one percent of children and 75.3% of adolescents were classified as short sleepers. In children, being a short sleeper was associated with higher SB by 95 min/day (p < 0.001) and lower MVPA by 16 min/day (p = 0.002). Specifically, it was associated with a higher amount of time spent in long sedentary bouts (βilr1 = 0.46, 95% confidence interval [CI] = 0.29 to 0.62) and lower amounts of time spent in sporadic SB (βilr1 = - 0.17, 95% CI = -0.24 to - 0.10), sporadic LPA (βilr1 = - 0.09, 95% CI = -0.14 to - 0.04) and sporadic MVPA (βilr1 = - 0.17, 95% CI = -0.25 to - 0.10, p < 0.001 for all), relative to the remaining behaviours. In adolescents, being a short sleeper was associated with a higher amount of time spent in SB by 67 min/day (p = 0.001) and lower LPA by 2 min/day (p = 0.035). Specifically, it was associated with more time spent in sedentary bouts of 1-9 min (βilr1 = 0.08, 95% CI = 0.02 to 0.14, p = 0.007) and 10-29 min (βilr1 = 0.10, 95% CI = 0.02 to 0.18, p = 0.015), relative to the remaining behaviours. CONCLUSIONS Among children and adolescents, short sleep duration seems to be highly prevalent and associated with less healthy waking time. Public health interventions and strategies to tackle the high prevalence of short sleep duration among children and adolescents are warranted.
Collapse
Affiliation(s)
- Aleš Gába
- Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czech Republic.
| | - Jan Dygrýn
- Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czech Republic
| | - Nikola Štefelová
- Faculty of Science, Palacký University Olomouc, Olomouc, Czech Republic
| | - Lukáš Rubín
- Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czech Republic.,Faculty of Science, Humanities and Education, Technical University of Liberec, Liberec, Czech Republic
| | - Karel Hron
- Faculty of Science, Palacký University Olomouc, Olomouc, Czech Republic
| | - Lukáš Jakubec
- Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czech Republic
| | - Željko Pedišić
- Institute for Health and Sport, Victoria University, Melbourne, Australia
| |
Collapse
|
16
|
Talská R, Machalová J, Smýkal P, Hron K. A comparison of seed germination coefficients using functional regression. Appl Plant Sci 2020; 8:e11366. [PMID: 32995101 PMCID: PMC7507017 DOI: 10.1002/aps3.11366] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Accepted: 01/12/2020] [Indexed: 05/13/2023]
Abstract
PREMISE Seed germination over time is characterized by a sigmoid curve, called a germination curve, in which the percentage (or absolute number) of seeds that have completed germination is plotted against time. A number of individual coefficients have been developed to characterize this germination curve. However, as germination is considered to be a qualitative developmental response of an individual seed that occurs at one time point, but individual seeds within a given treatment respond at different time points, it has proven difficult to develop a single index that satisfactorily incorporates both percentage and rate. The aim of this paper is to develop a new coefficient, the continuous germination index (CGI), which quantifies seed germination as a continuous process, and to compare the CGI with other commonly used indexes. METHODS To create the new index, the germination curves were smoothed using nondecreasing splines and the CGI was derived as the area under the resulting spline. For the comparison of the CGI with other common indexes, a regression model with functional response was developed. RESULTS Using both an experimentally obtained wild pea (Pisum sativum subsp. elatius) seed data set and a hypothetical data set, we showed that the CGI is able to characterize the germination process better than most other indices. The CGI captures the local behavior of the germination curves particularly well. DISCUSSION The CGI can be used advantageously for the characterization of the germination process. Moreover, B-spline coefficients extracted by its construction can be employed for the further statistical processing of germination curves using functional data analysis methods.
Collapse
Affiliation(s)
- Renáta Talská
- Department of Mathematical Analysis and Applications of Mathematics Palacký University Faculty of Science 17 Listopadu 12 Olomouc 771 46 Czech Republic
| | - Jitka Machalová
- Department of Mathematical Analysis and Applications of Mathematics Palacký University Faculty of Science 17 Listopadu 12 Olomouc 771 46 Czech Republic
| | - Petr Smýkal
- Department of Botany Palacký University Faculty of Science Šlechtitelů 27 Olomouc 783 71 Czech Republic
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics Palacký University Faculty of Science 17 Listopadu 12 Olomouc 771 46 Czech Republic
| |
Collapse
|
17
|
Affiliation(s)
- Jiajia Chen
- School of Statistics, Shanxi University of Finance and Economics, Taiyuan, People's Republic of China
| | - Xiaoqin Zhang
- School of Statistics, Shanxi University of Finance and Economics, Taiyuan, People's Republic of China
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacký University, Olomouc, Czech Republic
| |
Collapse
|
18
|
Talská R, Menafoglio A, Hron K, Egozcue JJ, Palarea‐Albaladejo J. Weighting the domain of probability densities in functional data analysis. Stat (Int Stat Inst) 2020. [DOI: 10.1002/sta4.283] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
- Renáta Talská
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science Palacký University Olomouc Olomouc Czech Republic
| | | | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science Palacký University Olomouc Olomouc Czech Republic
| | - Juan José Egozcue
- Department of Civil and Environmental Engineering Polytechnic University of Catalonia Barcelona Spain
| | | |
Collapse
|
19
|
McGregor DE, Palarea-Albaladejo J, Dall PM, Hron K, Chastin S. Cox regression survival analysis with compositional covariates: Application to modelling mortality risk from 24-h physical activity patterns. Stat Methods Med Res 2020; 29:1447-1465. [PMID: 31342855 DOI: 10.1177/0962280219864125] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Survival analysis is commonly conducted in medical and public health research to assess the association of an exposure or intervention with a hard end outcome such as mortality. The Cox (proportional hazards) regression model is probably the most popular statistical tool used in this context. However, when the exposure includes compositional covariables (that is, variables representing a relative makeup such as a nutritional or physical activity behaviour composition), some basic assumptions of the Cox regression model and associated significance tests are violated. Compositional variables involve an intrinsic interplay between one another which precludes results and conclusions based on considering them in isolation as is ordinarily done. In this work, we introduce a formulation of the Cox regression model in terms of log-ratio coordinates which suitably deals with the constraints of compositional covariates, facilitates the use of common statistical inference methods, and allows for scientifically meaningful interpretations. We illustrate its practical application to a public health problem: the estimation of the mortality hazard associated with the composition of daily activity behaviour (physical activity, sitting time and sleep) using data from the U.S. National Health and Nutrition Examination Survey (NHANES).
Collapse
Affiliation(s)
- D E McGregor
- School of Health and Life Science, Glasgow Caledonian University, Glasgow, UK.,Biomathematics and Statistics Scotland, Edinburgh, UK
| | | | - P M Dall
- School of Health and Life Science, Glasgow Caledonian University, Glasgow, UK
| | - K Hron
- Faculty of Science, Department of Mathematical Analysis and Applications of Mathematics, Palacký University Olomouc, Olomouc, Czech Republic
| | - Sfm Chastin
- School of Health and Life Science, Glasgow Caledonian University, Glasgow, UK.,Department of Movement and Sport Science, Ghent University, Ghent, Belgium
| |
Collapse
|
20
|
Renzi JP, Duchoslav M, Brus J, Hradilová I, Pechanec V, Václavek T, Machalová J, Hron K, Verdier J, Smýkal P. Physical Dormancy Release in Medicago truncatula Seeds Is Related to Environmental Variations. Plants (Basel) 2020; 9:E503. [PMID: 32295289 PMCID: PMC7238229 DOI: 10.3390/plants9040503] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/30/2020] [Revised: 04/13/2020] [Accepted: 04/13/2020] [Indexed: 12/26/2022]
Abstract
Seed dormancy and timing of its release is an important developmental transition determining the survival of individuals, populations, and species in variable environments. Medicago truncatula was used as a model to study physical seed dormancy at the ecological and genetics level. The effect of alternating temperatures, as one of the causes releasing physical seed dormancy, was tested in 178 M. truncatula accessions over three years. Several coefficients of dormancy release were related to environmental variables. Dormancy varied greatly (4-100%) across accessions as well as year of experiment. We observed overall higher physical dormancy release under more alternating temperatures (35/15 °C) in comparison with less alternating ones (25/15 °C). Accessions from more arid climates released dormancy under higher experimental temperature alternations more than accessions originating from less arid environments. The plasticity of physical dormancy can probably distribute the germination through the year and act as a bet-hedging strategy in arid environments. On the other hand, a slight increase in physical dormancy was observed in accessions from environments with higher among-season temperature variation. Genome-wide association analysis identified 136 candidate genes related to secondary metabolite synthesis, hormone regulation, and modification of the cell wall. The activity of these genes might mediate seed coat permeability and, ultimately, imbibition and germination.
Collapse
Affiliation(s)
- Juan Pablo Renzi
- Instituto Nacional de Tecnología Agropecuaria, Hilario Ascasubi 8142, Argentina;
| | - Martin Duchoslav
- Department of Botany, Palacký University, Šlechtitelů 27, 783 71 Olomouc, Czech Republic; (M.D.); (I.H.)
| | - Jan Brus
- Department of Geoinformatics, Palacký University, 17. listopadu 50, 771 46 Olomouc, Czech Republic; (J.B.); (V.P.)
| | - Iveta Hradilová
- Department of Botany, Palacký University, Šlechtitelů 27, 783 71 Olomouc, Czech Republic; (M.D.); (I.H.)
| | - Vilém Pechanec
- Department of Geoinformatics, Palacký University, 17. listopadu 50, 771 46 Olomouc, Czech Republic; (J.B.); (V.P.)
| | - Tadeáš Václavek
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University, 17. listopadu 12, 771 46 Olomouc, Czech Republic; (T.V.); (J.M.); (K.H.)
| | - Jitka Machalová
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University, 17. listopadu 12, 771 46 Olomouc, Czech Republic; (T.V.); (J.M.); (K.H.)
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University, 17. listopadu 12, 771 46 Olomouc, Czech Republic; (T.V.); (J.M.); (K.H.)
| | - Jerome Verdier
- UMR 1345 Institut de Recherche en Horticulture et Semences, Agrocampus Ouest, INRA, Université d’Angers, SFR 4207 QUASAV, 49070 Beaucouzé, France;
| | - Petr Smýkal
- Department of Botany, Palacký University, Šlechtitelů 27, 783 71 Olomouc, Czech Republic; (M.D.); (I.H.)
| |
Collapse
|
21
|
Gába A, Pedišić Ž, Štefelová N, Dygrýn J, Hron K, Dumuid D, Tremblay M. Sedentary behavior patterns and adiposity in children: a study based on compositional data analysis. BMC Pediatr 2020; 20:147. [PMID: 32241269 PMCID: PMC7114780 DOI: 10.1186/s12887-020-02036-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Accepted: 03/13/2020] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND Between-person differences in sedentary patterns should be considered to understand the role of sedentary behavior (SB) in the development of childhood obesity. This study took a novel approach based on compositional data analysis to examine associations between SB patterns and adiposity and investigate differences in adiposity associated with time reallocation between time spent in sedentary bouts of different duration and physical activity. METHODS An analysis of cross-sectional data was performed in 425 children aged 7-12 years (58% girls). Waking behaviors were assessed using ActiGraph GT3X accelerometer for seven consecutive days. Multi-frequency bioimpedance measurement was used to determine adiposity. Compositional regression models with robust estimators were used to analyze associations between sedentary patterns and adiposity markers. To examine differences in adiposity associated with time reallocation, we used the compositional isotemporal substitution model. RESULTS Significantly higher fat mass percentage (FM%; βilr1 = 0.18; 95% CI: 0.01, 0.34; p = 0.040) and visceral adipose tissue (VAT; βilr1 = 0.37; 95% CI: 0.03, 0.71; p = 0.034) were associated with time spent in middle sedentary bouts in duration of 10-29 min (relative to remaining behaviors). No significant associations were found for short (< 10 min) and long sedentary bouts (≥30 min). Substituting the time spent in total SB with moderate-to-vigorous physical activity (MVPA) was associated with a decrease in VAT. Substituting 1 h/week of the time spent in middle sedentary bouts with MVPA was associated with 2.9% (95% CI: 1.2, 4.6), 3.4% (95% CI: 1.2, 5.5), and 6.1% (95% CI: 2.9, 9.2) lower FM%, fat mass index, and VAT, respectively. Moreover, substituting 2 h/week of time spent in middle sedentary bouts with short sedentary bouts was associated with 3.5% (95% CI: 0.02, 6.9) lower FM%. CONCLUSIONS Our findings suggest that adiposity status could be improved by increasing MVPA at the expense of time spent in middle sedentary bouts. Some benefits to adiposity may also be expected from replacing middle sedentary bouts with short sedentary bouts, that is, by taking standing or activity breaks more often. These findings may help design more effective interventions to prevent and control childhood obesity.
Collapse
Affiliation(s)
- Aleš Gába
- Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czech Republic.
| | - Željko Pedišić
- Institute for Health and Sport, Victoria University, Melbourne, Australia
| | - Nikola Štefelová
- Faculty of Science, Palacký University Olomouc, Olomouc, Czech Republic
| | - Jan Dygrýn
- Faculty of Physical Culture, Palacký University Olomouc, Olomouc, Czech Republic
| | - Karel Hron
- Faculty of Science, Palacký University Olomouc, Olomouc, Czech Republic
| | - Dorothea Dumuid
- Alliance for Research in Exercise, Nutrition and Activity, School of Health Sciences, University of South Australia, Adelaide, Australia
| | - Mark Tremblay
- Healthy Active Living and Obesity Research Group, Children's Hospital of Eastern Ontario, Ottawa, Canada
| |
Collapse
|
22
|
de Sousa J, Vencálek O, Hron K, Václavík J, Friedecký D, Adam T. Bayesian multiple hypotheses testing in compositional analysis of untargeted metabolomic data. Anal Chim Acta 2020; 1097:49-61. [PMID: 31910969 DOI: 10.1016/j.aca.2019.11.006] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2019] [Revised: 10/23/2019] [Accepted: 11/04/2019] [Indexed: 11/19/2022]
Abstract
Clinical metabolomics aims at finding statistically significant differences in metabolic statuses of patient and control groups with the intention of understanding pathobiochemical processes and identification of clinically useful biomarkers of particular diseases. After the raw measurements are integrated and pre-processed as intensities of chromatographic peaks, the differences between controls and patients are evaluated by both univariate and multivariate statistical methods. The traditional univariate approach relies on t-tests (or their nonparametric alternatives) and the results from multiple testing are misleadingly compared merely by p-values using the so-called volcano plot. This paper proposes a Bayesian counterpart to the widespread univariate analysis, taking into account the compositional character of a metabolome. Since each metabolome is a collection of some small-molecule metabolites in a biological material, the relative structure of metabolomic data, which is inherently contained in ratios between metabolites, is of the main interest. Therefore, a proper choice of logratio coordinates is an essential step for any statistical analysis of such data. In addition, a concept of b-values is introduced together with a Bayesian version of the volcano plot incorporating distance levels of the posterior highest density intervals from zero. The theoretical background of the contribution is illustrated using two data sets containing samples of patients suffering from 3-hydroxy-3-methylglutaryl-CoA lyase deficiency and medium-chain acyl-CoA dehydrogenase deficiency. To evaluate the stability of the proposed method as well as the benefits of the compositional approach, two simulations designed to mimic a loss of samples and a systematical measurement error, respectively, are added.
Collapse
Affiliation(s)
- Julie de Sousa
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University, 17. Listopadu 1192/12, 771 46, Olomouc, Czech Republic; Department of Clinical Biochemistry, University Hospital Olomouc, I.P. Pavlova 185/6, 779 00, Olomouc, Czech Republic; Laboratory of Metabolomics, Institute of Molecular and Translational Medicine, Faculty of Medicine and Dentistry, Palacký University, Hněvotínská 5, 779 00, Olomouc, Czech Republic.
| | - Ondřej Vencálek
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University, 17. Listopadu 1192/12, 771 46, Olomouc, Czech Republic
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University, 17. Listopadu 1192/12, 771 46, Olomouc, Czech Republic
| | - Jan Václavík
- Department of Clinical Biochemistry, University Hospital Olomouc, I.P. Pavlova 185/6, 779 00, Olomouc, Czech Republic; Laboratory of Metabolomics, Institute of Molecular and Translational Medicine, Faculty of Medicine and Dentistry, Palacký University, Hněvotínská 5, 779 00, Olomouc, Czech Republic
| | - David Friedecký
- Department of Clinical Biochemistry, University Hospital Olomouc, I.P. Pavlova 185/6, 779 00, Olomouc, Czech Republic; Laboratory of Metabolomics, Institute of Molecular and Translational Medicine, Faculty of Medicine and Dentistry, Palacký University, Hněvotínská 5, 779 00, Olomouc, Czech Republic
| | - Tomáš Adam
- Department of Clinical Biochemistry, University Hospital Olomouc, I.P. Pavlova 185/6, 779 00, Olomouc, Czech Republic; Laboratory of Metabolomics, Institute of Molecular and Translational Medicine, Faculty of Medicine and Dentistry, Palacký University, Hněvotínská 5, 779 00, Olomouc, Czech Republic
| |
Collapse
|
23
|
Affiliation(s)
- J. de Sousa
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacký University, Olomouc, Czech Republic
| | - K. Hron
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacký University, Olomouc, Czech Republic
| | - K. Fačevicová
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacký University, Olomouc, Czech Republic
| | - P. Filzmoser
- Institute of Statistics and Mathematical Methods in Economics, Vienna University of Technology, Vienna, Austria
| |
Collapse
|
24
|
Cuberek R, Pelclová J, Gába A, Pechová J, Svozilová Z, Přidalová M, Štefelová N, Hron K. Adiposity and changes in movement-related behaviors in older adult women in the context of the built environment: a protocol for a prospective cohort study. BMC Public Health 2019; 19:1522. [PMID: 31727040 PMCID: PMC6857272 DOI: 10.1186/s12889-019-7905-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2019] [Accepted: 11/05/2019] [Indexed: 12/21/2022] Open
Abstract
Background In older adults, sedentary behaviors increase while physical activity decreases over time following the compositional nature of 24-h behaviors. These changes in movement-related behaviors (MRBs) might be associated with unhealthy weight gain and several health comorbidities. However, information is lacking on how obesity influences longitudinal changes in the composition of MRBs in older adults. Furthermore, the moderating effect of the built environment on prospective associations between obesity and MRBs in older adults is not fully understood. Therefore, using an integrated time-use approach, this study aims to identify prospective associations between obesity and MRBs together with an assessment of the moderating effect of the built environment in elderly women. Methods The study was designed as a prospective 7-year follow-up study. It is based on two previous cross-sectional studies that enable the use of participant data (women aged 60+ years, n = 409) as a baseline dataset in the current study. All methods designed for 7-year follow-up are based on previous studies. The data collection comprises device-based measurement of MRBs (ActiGraph GT1M accelerometer), objective assessment of body adiposity (multi-frequency bioelectrical impedance analysis), subjective assessment of the built environment (NEWS-A questionnaire), and other possible confounding factors. Time spent in sedentary behavior, light physical activity, and moderate-to-vigorous physical activity will be used as three components in a composition reflecting individual MRBs. In linear multiple compositional regression analysis assessing the prospective association between obesity and MRBs, the 7-year follow-up composition of the three mentioned components represents the dependent variable. The 7-year changes in the percentage of body fat (body adiposity), baseline composition of MRBs, and parameters of the built environment represent regressors. Discussion This study will use an integrated time-use approach to explore causality from obesity to device-measured behaviors in older women. The design and respective analysis consider the compositional nature of MRBs data and the potential moderating effects of various factors. A comprehensive assessment of causality may help to develop multilevel interventional models that enhance physical activity in older adults.
Collapse
Affiliation(s)
- Roman Cuberek
- Institute of active lifestyle, Faculty of Physical Culture, Palacký University Olomouc, třída Míru 117, 771 11, Olomouc, Czech Republic.
| | - Jana Pelclová
- Institute of active lifestyle, Faculty of Physical Culture, Palacký University Olomouc, třída Míru 117, 771 11, Olomouc, Czech Republic
| | - Aleš Gába
- Institute of active lifestyle, Faculty of Physical Culture, Palacký University Olomouc, třída Míru 117, 771 11, Olomouc, Czech Republic
| | - Jana Pechová
- Institute of active lifestyle, Faculty of Physical Culture, Palacký University Olomouc, třída Míru 117, 771 11, Olomouc, Czech Republic
| | - Zuzana Svozilová
- Institute of active lifestyle, Faculty of Physical Culture, Palacký University Olomouc, třída Míru 117, 771 11, Olomouc, Czech Republic
| | - Miroslava Přidalová
- Institute of active lifestyle, Faculty of Physical Culture, Palacký University Olomouc, třída Míru 117, 771 11, Olomouc, Czech Republic
| | - Nikola Štefelová
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, 17. listopadu 1192/12, 771 46, Olomouc, Czech Republic
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, 17. listopadu 1192/12, 771 46, Olomouc, Czech Republic
| |
Collapse
|
25
|
|
26
|
Abstract
Ordered categorical data occur in many applied fields, such as geochemistry, econometrics, sociology and demography or even transportation research, for example, in the form of results from various questionnaires. There are different possibilities for modelling proportions of individual categories. Generalised linear models (GLMs) are traditionally used for this purpose, but also methods of compositional data analysis (CoDa) can be considered. Here, both approaches are compared in depth. Particularly, different assumptions of the models on variability are highlighted. Advantages and disadvantages of individual models are pointed out. While the CoDa model may be inappropriate when the variability of the compositional coordinates depends on the regressors, for example, due to different total counts on which the coordinates are based, the GLM may underestimate the uncertainty of the predictions considerably in case of large-scale data.
Collapse
Affiliation(s)
- Ondřej Vencálek
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacký University, Olomouc, Czech Republic
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacký University, Olomouc, Czech Republic
| | - Peter Filzmoser
- Institute of Statistics and Mathematical Methods in Economics, Vienna University of Technology, Vienna, Austria
| |
Collapse
|
27
|
Hradilová I, Duchoslav M, Brus J, Pechanec V, Hýbl M, Kopecký P, Smržová L, Štefelová N, Vaclávek T, Bariotakis M, Machalová J, Hron K, Pirintsos S, Smýkal P. Variation in wild pea ( Pisum sativum subsp. elatius) seed dormancy and its relationship to the environment and seed coat traits. PeerJ 2019; 7:e6263. [PMID: 30656074 PMCID: PMC6336014 DOI: 10.7717/peerj.6263] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Accepted: 12/11/2018] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND Seed germination is one of the earliest key events in the plant life cycle. The timing of transition from seed to seedling is an important developmental stage determining the survival of individuals that influences the status of populations and species. Because of wide geographical distribution and occurrence in diverse habitats, wild pea (Pisum sativum subsp. elatius) offers an excellent model to study physical type of seed dormancy in an ecological context. This study addresses the gap in knowledge of association between the seed dormancy, seed properties and environmental factors, experimentally testing oscillating temperature as dormancy release clue. METHODS Seeds of 97 pea accessions were subjected to two germination treatments (oscillating temperatures of 25/15 °C and 35/15 °C) over 28 days. Germination pattern was described using B-spline coefficients that aggregate both final germination and germination speed. Relationships between germination pattern and environmental conditions at the site of origin (soil and bioclimatic variables extracted from WorldClim 2.0 and SoilGrids databases) were studied using principal component analysis, redundancy analysis and ecological niche modelling. Seeds were analyzed for the seed coat thickness, seed morphology, weight and content of proanthocyanidins (PA). RESULTS Seed total germination ranged from 0% to 100%. Cluster analysis of germination patterns of seeds under two temperature treatments differentiated the accessions into three groups: (1) non-dormant (28 accessions, mean germination of 92%), (2) dormant at both treatments (29 acc., 15%) and (3) responsive to increasing temperature range (41 acc., with germination change from 15 to 80%). Seed coat thickness differed between groups with dormant and responsive accessions having thicker testa (median 138 and 140 µm) than non-dormant ones (median 84 mm). The total PA content showed to be higher in the seed coat of dormant (mean 2.18 mg g-1) than those of non-dormant (mean 1.77 mg g-1) and responsive accessions (mean 1.87 mg g-1). Each soil and bioclimatic variable and also germination responsivity (representing synthetic variable characterizing germination pattern of seeds) was spatially clustered. However, only one environmental variable (BIO7, i.e., annual temperature range) was significantly related to germination responsivity. Non-dormant and responsive accessions covered almost whole range of BIO7 while dormant accessions are found in the environment with higher annual temperature, smaller temperature variation, seasonality and milder winter. Ecological niche modelling showed a more localized potential distribution of dormant group. Seed dormancy in the wild pea might be part of a bet-hedging mechanism for areas of the Mediterranean basin with more unpredictable water availability in an otherwise seasonal environment. This study provides the framework for analysis of environmental aspects of physical seed dormancy.
Collapse
Affiliation(s)
- Iveta Hradilová
- Department of Botany, Palacký University Olomouc, Olomouc, Czech Republic
| | - Martin Duchoslav
- Department of Botany, Palacký University Olomouc, Olomouc, Czech Republic
| | - Jan Brus
- Department of Geoinformatics, Palacký University Olomouc, Olomouc, Czech Republic
| | - Vilém Pechanec
- Department of Geoinformatics, Palacký University Olomouc, Olomouc, Czech Republic
| | - Miroslav Hýbl
- The Centre of the Region Haná for Biotechnological and Agricultural Research, Crop Research Institute, Prague, Olomouc, Czech Republic
| | - Pavel Kopecký
- The Centre of the Region Haná for Biotechnological and Agricultural Research, Crop Research Institute, Prague, Olomouc, Czech Republic
| | - Lucie Smržová
- Department of Botany, Palacký University Olomouc, Olomouc, Czech Republic
| | - Nikola Štefelová
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University Olomouc, Olomouc, Czech Republic
| | - Tadeáš Vaclávek
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University Olomouc, Olomouc, Czech Republic
| | - Michael Bariotakis
- Department of Biology and Botanical Garden, University of Crete, Heraklion, Greece
| | - Jitka Machalová
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University Olomouc, Olomouc, Czech Republic
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University Olomouc, Olomouc, Czech Republic
| | - Stergios Pirintsos
- Department of Biology and Botanical Garden, University of Crete, Heraklion, Greece
| | - Petr Smýkal
- Department of Botany, Palacký University Olomouc, Olomouc, Czech Republic
| |
Collapse
|
28
|
Hradilová I, Duchoslav M, Brus J, Pechanec V, Hýbl M, Kopecký P, Smržová L, Štefelová N, Vaclávek T, Bariotakis M, Machalová J, Hron K, Pirintsos S, Smýkal P. Variation in wild pea ( Pisum sativum subsp. elatius) seed dormancy and its relationship to the environment and seed coat traits. PeerJ 2019; 7:e6263. [PMID: 30656074 DOI: 10.7717/peerj6263] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Accepted: 12/11/2018] [Indexed: 05/19/2023] Open
Abstract
BACKGROUND Seed germination is one of the earliest key events in the plant life cycle. The timing of transition from seed to seedling is an important developmental stage determining the survival of individuals that influences the status of populations and species. Because of wide geographical distribution and occurrence in diverse habitats, wild pea (Pisum sativum subsp. elatius) offers an excellent model to study physical type of seed dormancy in an ecological context. This study addresses the gap in knowledge of association between the seed dormancy, seed properties and environmental factors, experimentally testing oscillating temperature as dormancy release clue. METHODS Seeds of 97 pea accessions were subjected to two germination treatments (oscillating temperatures of 25/15 °C and 35/15 °C) over 28 days. Germination pattern was described using B-spline coefficients that aggregate both final germination and germination speed. Relationships between germination pattern and environmental conditions at the site of origin (soil and bioclimatic variables extracted from WorldClim 2.0 and SoilGrids databases) were studied using principal component analysis, redundancy analysis and ecological niche modelling. Seeds were analyzed for the seed coat thickness, seed morphology, weight and content of proanthocyanidins (PA). RESULTS Seed total germination ranged from 0% to 100%. Cluster analysis of germination patterns of seeds under two temperature treatments differentiated the accessions into three groups: (1) non-dormant (28 accessions, mean germination of 92%), (2) dormant at both treatments (29 acc., 15%) and (3) responsive to increasing temperature range (41 acc., with germination change from 15 to 80%). Seed coat thickness differed between groups with dormant and responsive accessions having thicker testa (median 138 and 140 µm) than non-dormant ones (median 84 mm). The total PA content showed to be higher in the seed coat of dormant (mean 2.18 mg g-1) than those of non-dormant (mean 1.77 mg g-1) and responsive accessions (mean 1.87 mg g-1). Each soil and bioclimatic variable and also germination responsivity (representing synthetic variable characterizing germination pattern of seeds) was spatially clustered. However, only one environmental variable (BIO7, i.e., annual temperature range) was significantly related to germination responsivity. Non-dormant and responsive accessions covered almost whole range of BIO7 while dormant accessions are found in the environment with higher annual temperature, smaller temperature variation, seasonality and milder winter. Ecological niche modelling showed a more localized potential distribution of dormant group. Seed dormancy in the wild pea might be part of a bet-hedging mechanism for areas of the Mediterranean basin with more unpredictable water availability in an otherwise seasonal environment. This study provides the framework for analysis of environmental aspects of physical seed dormancy.
Collapse
Affiliation(s)
- Iveta Hradilová
- Department of Botany, Palacký University Olomouc, Olomouc, Czech Republic
| | - Martin Duchoslav
- Department of Botany, Palacký University Olomouc, Olomouc, Czech Republic
| | - Jan Brus
- Department of Geoinformatics, Palacký University Olomouc, Olomouc, Czech Republic
| | - Vilém Pechanec
- Department of Geoinformatics, Palacký University Olomouc, Olomouc, Czech Republic
| | - Miroslav Hýbl
- The Centre of the Region Haná for Biotechnological and Agricultural Research, Crop Research Institute, Prague, Olomouc, Czech Republic
| | - Pavel Kopecký
- The Centre of the Region Haná for Biotechnological and Agricultural Research, Crop Research Institute, Prague, Olomouc, Czech Republic
| | - Lucie Smržová
- Department of Botany, Palacký University Olomouc, Olomouc, Czech Republic
| | - Nikola Štefelová
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University Olomouc, Olomouc, Czech Republic
| | - Tadeáš Vaclávek
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University Olomouc, Olomouc, Czech Republic
| | - Michael Bariotakis
- Department of Biology and Botanical Garden, University of Crete, Heraklion, Greece
| | - Jitka Machalová
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University Olomouc, Olomouc, Czech Republic
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University Olomouc, Olomouc, Czech Republic
| | - Stergios Pirintsos
- Department of Biology and Botanical Garden, University of Crete, Heraklion, Greece
| | - Petr Smýkal
- Department of Botany, Palacký University Olomouc, Olomouc, Czech Republic
| |
Collapse
|
29
|
Affiliation(s)
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics Palacký University Olomouc
| | | | - Matthias Templ
- Institute of Data Analysis and Process Design Zurich University of Applied Sciences
| |
Collapse
|
30
|
Muller I, Hron K, Fiserova E, Smahaj J, Cakirpaloglu P, Vancakova J. Interpretation of Compositional Regression with Application to Time Budget Analysis. AJS 2018. [DOI: 10.17713/ajs.v47i2.652] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Regression with compositional response or covariates, or even regression between parts of a composition, is frequently employed in social sciences. Among other possible applications, it may help to reveal interesting features in time allocation analysis. As individual activities represent relative contributions to the total amount of time, statistical processing of raw data (frequently represented directly as proportions or percentages) using standard methods may lead to biased results. Specific geometrical features of time budget variables are captured by the logratio methodology of compositional data, whose aim is to build (preferably orthonormal) coordinates to be applied with popular statistical methods. The aim of this paper is to present recent tools of regression analysis within the logratio methodology and apply them to reveal potential relationships among psychometric indicators in a real-world data set. In particular, orthogonal logratio coordinates have been introduced to enhance the interpretability of coefficients in regression models.
Collapse
|
31
|
Reimann C, Filzmoser P, Hron K, Kynčlová P, Garrett RG. A new method for correlation analysis of compositional (environmental) data - a worked example. Sci Total Environ 2017; 607-608:965-971. [PMID: 28724228 DOI: 10.1016/j.scitotenv.2017.06.063] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2017] [Revised: 06/08/2017] [Accepted: 06/08/2017] [Indexed: 06/07/2023]
Abstract
Most data in environmental sciences and geochemistry are compositional. Already the unit used to report the data (e.g., μg/l, mg/kg, wt%) implies that the analytical results for each element are not free to vary independently of the other measured variables. This is often neglected in statistical analysis, where a simple log-transformation of the single variables is insufficient to put the data into an acceptable geometry. This is also important for bivariate data analysis and for correlation analysis, for which the data need to be appropriately log-ratio transformed. A new approach based on the isometric log-ratio (ilr) transformation, leading to so-called symmetric coordinates, is presented here. Summarizing the correlations in a heat-map gives a powerful tool for bivariate data analysis. Here an application of the new method using a data set from a regional geochemical mapping project based on soil O and C horizon samples is demonstrated. Differences to 'classical' correlation analysis based on log-transformed data are highlighted. The fact that some expected strong positive correlations appear and remain unchanged even following a log-ratio transformation has probably led to the misconception that the special nature of compositional data can be ignored when working with trace elements. The example dataset is employed to demonstrate that using 'classical' correlation analysis and plotting XY diagrams, scatterplots, based on the original or simply log-transformed data can easily lead to severe misinterpretations of the relationships between elements.
Collapse
Affiliation(s)
- C Reimann
- Geological Survey of Norway, P.O.·Box 6315, 7491 Sluppen, Norway.
| | - P Filzmoser
- Institute of Statistics and Mathematical Methods in Economics, Vienna University of Technology, Wiedner Hauptstr. 8-10, 1040 Vienna, Austria
| | - K Hron
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University, 17. listopadu 12, 77146 Olomouc, Czech Republic
| | - P Kynčlová
- Institute of Statistics and Mathematical Methods in Economics, Vienna University of Technology, Wiedner Hauptstr. 8-10, 1040 Vienna, Austria
| | - R G Garrett
- Emeritus Scientist, Geological Survey of Canada, Natural Resources Canada, 601 Booth St., Ottawa, ON K1A 0E8, Canada.
| |
Collapse
|
32
|
Chen J, Zhang X, Hron K, Templ M, Li S. Regression imputation with Q-mode clustering for rounded zero replacement in high-dimensional compositional data. J Appl Stat 2017. [DOI: 10.1080/02664763.2017.1410524] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Jiajia Chen
- School of Mathematical Sciences, Shanxi University, Taiyuan, People's Republic of China
| | - Xiaoqin Zhang
- School of Mathematical Sciences, Shanxi University, Taiyuan, People's Republic of China
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacký University, Olomouc, Czech Republic
| | - Matthias Templ
- Department of Statistics and Probability Theory, Vienna University of Technology, Vienna, Austria
- Present address: Institute of Data Analysis and Process Design, Zurich University of Applied Sciences, Winterthur, Switzerland
| | - Shengjia Li
- School of Mathematical Sciences, Shanxi University, Taiyuan, People's Republic of China
| |
Collapse
|
33
|
Dumuid D, Pedišić Ž, Stanford TE, Martín-Fernández JA, Hron K, Maher CA, Lewis LK, Olds T. The compositional isotemporal substitution model: A method for estimating changes in a health outcome for reallocation of time between sleep, physical activity and sedentary behaviour. Stat Methods Med Res 2017; 28:846-857. [DOI: 10.1177/0962280217737805] [Citation(s) in RCA: 113] [Impact Index Per Article: 16.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
How people use their time has been linked with their health. For example, spending more time being physically active is known to be beneficial for health, whereas long durations of sitting have been associated with unfavourable health outcomes. Accordingly, public health messages have advocated swapping strategies to promote the reallocation of time between parts of the time-use composition, such as “Move More, Sit Less”, with the aim of achieving optimal distribution of time for health. However, the majority of research underpinning these public health messages has not considered daily time use as a composition, and has ignored the relative nature of time-use data. We present a way of applying compositional data analysis to estimate change in a health outcome when fixed durations of time are reallocated from one part of a particular time-use composition to another, while the remaining parts are kept constant, based on a multiple linear regression model on isometric log ratio coordinates. In an example, we examine the expected differences in Body Mass Index z-scores for reallocations of time between sleep, physical activity and sedentary behaviour.
Collapse
Affiliation(s)
- Dorothea Dumuid
- School of Health Sciences, University of South Australia, Adelaide, Australia
| | - Željko Pedišić
- Institute of Sport, Exercise, and Active Living (ISEAL), Victoria University, Melbourne, Australia
| | - Tyman Everleigh Stanford
- LBT Innovations, Adelaide, Australia
- School of Mathematical Sciences, University of Adelaide, Adelaide, Australia
| | | | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University, Olomouc, Czech Republic
| | - Carol A Maher
- School of Health Sciences, University of South Australia, Adelaide, Australia
| | - Lucy K Lewis
- College of Nursing and Health Sciences, Flinders University, Adelaide, Australia
| | - Timothy Olds
- School of Health Sciences, University of South Australia, Adelaide, Australia
| |
Collapse
|
34
|
Affiliation(s)
- M. A. Di Palma
- Department of Human and Social Sciences, University of Naples ‘L'Orientale’, Naples, Italy
| | - P. Filzmoser
- Institute of Statistics and Mathematical Methods in Economics, Vienna University of Technology, Vienna, Austria
| | - M. Gallo
- Department of Human and Social Sciences, University of Naples ‘L'Orientale’, Naples, Italy
| | - K. Hron
- Department of Mathematical Analysis and Applications of Mathematics, Palacký University, Olomouc, Czech Republic
| |
Collapse
|
35
|
Dumuid D, Stanford TE, Martin-Fernández JA, Pedišić Ž, Maher CA, Lewis LK, Hron K, Katzmarzyk PT, Chaput JP, Fogelholm M, Hu G, Lambert EV, Maia J, Sarmiento OL, Standage M, Barreira TV, Broyles ST, Tudor-Locke C, Tremblay MS, Olds T. Compositional data analysis for physical activity, sedentary time and sleep research. Stat Methods Med Res 2017; 27:3726-3738. [PMID: 28555522 DOI: 10.1177/0962280217710835] [Citation(s) in RCA: 222] [Impact Index Per Article: 31.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
The health effects of daily activity behaviours (physical activity, sedentary time and sleep) are widely studied. While previous research has largely examined activity behaviours in isolation, recent studies have adjusted for multiple behaviours. However, the inclusion of all activity behaviours in traditional multivariate analyses has not been possible due to the perfect multicollinearity of 24-h time budget data. The ensuing lack of adjustment for known effects on the outcome undermines the validity of study findings. We describe a statistical approach that enables the inclusion of all daily activity behaviours, based on the principles of compositional data analysis. Using data from the International Study of Childhood Obesity, Lifestyle and the Environment, we demonstrate the application of compositional multiple linear regression to estimate adiposity from children's daily activity behaviours expressed as isometric log-ratio coordinates. We present a novel method for predicting change in a continuous outcome based on relative changes within a composition, and for calculating associated confidence intervals to allow for statistical inference. The compositional data analysis presented overcomes the lack of adjustment that has plagued traditional statistical methods in the field, and provides robust and reliable insights into the health effects of daily activity behaviours.
Collapse
Affiliation(s)
- Dorothea Dumuid
- 1 School of Health Sciences, University of South Australia, Adelaide, Australia
| | - Tyman E Stanford
- 2 School of Mathematical Sciences, University of Adelaide, Adelaide, Australia
| | | | - Željko Pedišić
- 4 Institute of Sport, Exercise and Active Living, Victoria University, Melbourne, Australia
| | - Carol A Maher
- 1 School of Health Sciences, University of South Australia, Adelaide, Australia
| | - Lucy K Lewis
- 6 Department of Mathematical Analysis and Applications of Mathematics, Univerzita Palackeho, Olomouc, Czech Republic
| | - Karel Hron
- 6 Department of Mathematical Analysis and Applications of Mathematics, Univerzita Palackeho, Olomouc, Czech Republic
| | | | - Jean-Philippe Chaput
- 8 Healthy Active Living and Obesity Research, Children's Hospital of Eastern Ontario Research Institute, Ottawa, Canada
| | - Mikael Fogelholm
- 9 Department of Food and Environmental Sciences, Helsingin Yliopisto, Helsinki, Finland
| | - Gang Hu
- 7 Pennington Biomedical Research Center, Baton Rouge, LA, USA
| | - Estelle V Lambert
- 10 Department of Human Biology, University of Cape Town, Cape Town, South Africa
| | - José Maia
- 11 Faculdade de Desporto, Universidade do Porto, Porto, Portugal
| | - Olga L Sarmiento
- 12 Faculty of Medicine, Universidad de los Andes, Bogota, Colombia
| | | | - Tiago V Barreira
- 14 Department of Exercise Science, Syracuse University, Syracuse, NY, USA
| | | | | | - Mark S Tremblay
- 8 Healthy Active Living and Obesity Research, Children's Hospital of Eastern Ontario Research Institute, Ottawa, Canada
| | - Timothy Olds
- 1 School of Health Sciences, University of South Australia, Adelaide, Australia
| |
Collapse
|
36
|
Abstract
Statistical analysis of trade flows structure can significantly help to reveal or to confirm important macroeconomic phenomena. Because of relative character of these multivariate observations, application of standard multivariate methods directly to raw data can lead to meaningless results, affected by trade sizes of different countries. As a way out, it is proposed to employ the logratio methodology that is able to capture interesting features through logratios between compositional parts. Particularly, the perturbation operation together with clr coefficients for coordinate representation of compositions seem to be easy to handle and to interpret for the purpose. Popular exploratory tools, principal component analysis and PARAFAC modeling of threeway data, resulting from a long-term study of the export/import structure, are applied in the compositional context for data from OECD and WIOD databases. The results show that the logratio methodology enables to reveal interesting features of world trade flows and thus provides a preferable alternative to existing exploratory tools.
Collapse
|
37
|
Donevska S, Fišerová E, Hron K. Calibration of compositional measurements. COMMUN STAT-THEOR M 2016. [DOI: 10.1080/03610926.2013.839039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
38
|
Mert MC, Filzmoser P, Hron K. Error Propagation in Isometric Log-ratio Coordinates for Compositional Data: Theoretical and Practical Considerations. Math Geosci 2016; 48:941-961. [PMID: 28316755 PMCID: PMC5181651 DOI: 10.1007/s11004-016-9646-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Accepted: 06/15/2016] [Indexed: 06/06/2023]
Abstract
Compositional data, as they typically appear in geochemistry in terms of concentrations of chemical elements in soil samples, need to be expressed in log-ratio coordinates before applying the traditional statistical tools if the relative structure of the data is of primary interest. There are different possibilities for this purpose, like centered log-ratio coefficients, or isometric log-ratio coordinates. In both the approaches, geometric means of the compositional parts are involved, and it is unclear how measurement errors or detection limit problems affect their presentation in coordinates. This problem is investigated theoretically by making use of the theory of error propagation. Due to certain limitations of this approach, the effect of error propagation is also studied by means of simulations. This allows to provide recommendations for practitioners on the amount of error and on the expected distortion of the results, depending on the purpose of the analysis.
Collapse
Affiliation(s)
- Mehmet Can Mert
- Institute of Statistics and Mathematical Methods in
Economics, Vienna University of Technology, Wiedner Hauptstrasse 8-10, 1040 Vienna, Austria
| | - Peter Filzmoser
- Institute of Statistics and Mathematical Methods in
Economics, Vienna University of Technology, Wiedner Hauptstrasse 8-10, 1040 Vienna, Austria
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics,
Faculty of Science, Palacký University, 17. listopadu 12, 771 46 Olomouc, Czech Republic
| |
Collapse
|
39
|
Affiliation(s)
- M. Templ
- Institute of Statistics and Mathematical Methods in Economics, Vienna University of Technology, Wiedner Hauptstraße 8-10, A-1040 Vienna, Austria
| | - K. Hron
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacký University, 17. listopadu 12, CZ-77146 Olomouc, Czech Republic
| | - P. Filzmoser
- Institute of Statistics and Mathematical Methods in Economics, Vienna University of Technology, Wiedner Hauptstraße 8-10, A-1040 Vienna, Austria
| |
Collapse
|
40
|
|
41
|
Affiliation(s)
- Kamila Fačevicová
- Department of Mathematical Analysis and Applications of Mathematics Palacký University Olomouc
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics Palacký University Olomouc
| | | | - Matthias Templ
- Institute of Statistics and Probability Theory Vienna University of Technology
| |
Collapse
|
42
|
|
43
|
|
44
|
Abstract
Recent experiences with interpretation of orthonormal coordinates in compositionaldata show clearly a necessity of their better understanding in terms of logratios that formthe primary source of information within the logratio methodology. This is even morecrucial in the special case of compositional tables, where both balances and coordinateswith odds ratio interpretation are involved. The aim of the paper is to provide a decompo-sition of covariance structure of orthonormal coordinates in compositional tables in termsof logratio variances that could serve for this purpose. For their better interpretability,the formulas are also accompanied with appropriate comments and graphical illustrations,and implications for the prominent case of 2 2 compositional tables are discussed.
Collapse
|
45
|
|
46
|
Koláčková P, Růžičková G, Šafránková I, Hron K, Hrůzová K. Evaluation of the Growth Dynamics and Morphological Characteristics of Genetic Sources of Silybum marianum (L.) Gaertn. Acta Univ Agric Silvic Mendelianae Brun 2015. [DOI: 10.11118/actaun201563041141] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
|
47
|
Affiliation(s)
- Peter Filzmoser
- Vienna University of Technology, Department of Statistics and Probability Theory, Wiedner Hauptstr. 8-10, 1040 Vienna, Austria
| | - Karel Hron
- Faculty of Science, Palacký University, Department of Mathematical Analysis and Applications of Mathematics, 17. listopadu 12, 771 46 Olomouc, Czech Republic
| |
Collapse
|
48
|
Janeckova H, Kalivodova A, Najdekr L, Friedecky D, Hron K, Bruheim P, Adam T. Untargeted metabolomic analysis of urine samples in the diagnosis of some inherited metabolic disorders. Biomed Pap Med Fac Univ Palacky Olomouc Czech Repub 2014; 159:582-5. [PMID: 25482736 DOI: 10.5507/bp.2014.048] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2014] [Accepted: 09/15/2014] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND Metabolomics is becoming an important tool in clinical research and the diagnosis of human diseases. It has been used in the diagnosis of inherited metabolic disorders with pronounced biochemical abnormalities. The aim of this study was to determine if it could be applied in the diagnosis of inherited metabolic disorders (IMDs) with less clear biochemical profiles from urine samples using an untargeted metabolomic approach. METHODS A total of 14 control urine samples and 21 samples from infants with cystinuria, maple syrup urine disease, adenylosuccinate lyase deficiency and galactosemia were tested. Samples were analyzed by liquid chromatography on aminopropyl column in aqueous normal phase separation system using gradient elution of acetonitrile/ammonium acetate. Detection was performed by time-of-flight mass spectrometer fitted with electrospray ionisation in positive mode. The data were statistically processed using principal component analysis (PCA), principal component discriminant function analysis (PCA-DFA) and partial least squares (PLS) regression. RESULTS All patient samples were first distinguished from controls using unsupervised PCA. Discrimination of the patient samples was then unambiguously verified using supervised PCA-DFA. Known markers of the diseases in question were successfully confirmed and a potential new marker emerged from the PLS regression. CONCLUSION This study showed that untargeted metabolomics can be applied in the diagnosis of mild IMDs with less clear biochemical profiles.
Collapse
Affiliation(s)
- Hana Janeckova
- Laboratory for Inherited Metabolic Disorders, Department of Clinical Biochemistry, University Hospital Olomouc, Czech Republic.,Laboratory of Metabolomics, Institute of Molecular and Translational Medicine, Faculty of Medicine and Dentistry, Palacky University Olomouc
| | - Alzbeta Kalivodova
- Laboratory of Metabolomics, Institute of Molecular and Translational Medicine, Faculty of Medicine and Dentistry, Palacky University Olomouc.,Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacky University Olomouc
| | - Lukas Najdekr
- Laboratory of Metabolomics, Institute of Molecular and Translational Medicine, Faculty of Medicine and Dentistry, Palacky University Olomouc
| | - David Friedecky
- Laboratory for Inherited Metabolic Disorders, Department of Clinical Biochemistry, University Hospital Olomouc, Czech Republic.,Laboratory of Metabolomics, Institute of Molecular and Translational Medicine, Faculty of Medicine and Dentistry, Palacky University Olomouc
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacky University Olomouc
| | - Per Bruheim
- Department of Biotechnology, Norwegian University of Science and Technology, Sem Saelands vei 6/8, NO-7491 Trondheim, Norway
| | - Tomas Adam
- Laboratory for Inherited Metabolic Disorders, Department of Clinical Biochemistry, University Hospital Olomouc, Czech Republic.,Laboratory of Metabolomics, Institute of Molecular and Translational Medicine, Faculty of Medicine and Dentistry, Palacky University Olomouc
| |
Collapse
|
49
|
Abstract
Compositional data analysis deals with situations where the relevant information is contained only in the ratios between the measured variables, and not in the reported values. This article focuses on high-dimensional compositional data (in the sense of hundreds or even thousands of variables), as they appear in chemometrics (e.g., mass spectral data), proteomics or genomics. The goal of this contribution is to perform a dimension reduction of such data, where the new directions should allow for interpretability. An approach named principal balances turned out to be successful for low dimensions. Here, the concept of sparse principal component analysis is proposed for constructing principal directions, the so-called sparse principal balances. They are sparse (contain many zeros), build an orthonormal basis in the sample space of the compositional data, are efficient for dimension reduction and are applicable to high-dimensional data.
Collapse
Affiliation(s)
- Mehmet Can Mert
- Department of Statistics and Probability Theory, Vienna University of Technology, Vienna, Austria
| | - Peter Filzmoser
- Department of Statistics and Probability Theory, Vienna University of Technology, Vienna, Austria
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacký University, Olomouc, Czech Republic
- Department of Geoinformatics, Faculty of Science, Palacký University, Olomouc, Czech Republic
| |
Collapse
|
50
|
Abstract
Compositional count data are discrete vectors representing the numbers of outcomes falling into any of several mutually exclusive categories. Compositional techniques based on the log-ratio methodology are appropriate in those cases where the total sum of the vector elements is not of interest. Such compositional count data sets can contain zero values which are often the result of insufficiently large samples. That is, they refer to unobserved positive values that may have been observed with a larger number of trials or with a different sampling design. Because the log-ratio transformations require data with positive values, any statistical analysis of count compositions must be preceded by a proper replacement of the zeros. A Bayesian-multiplicative treatment has been proposed for addressing this count zero problem in several case studies. This treatment involves the Dirichlet prior distribution as the conjugate distribution of the multinomial distribution and a multiplicative modification of the non-zero values. Different parameterizations of the prior distribution provide different zero replacement results, whose coherence with the vector space structure of the simplex is stated. Their performance is evaluated from both the theoretical and the computational point of view.
Collapse
Affiliation(s)
| | - Karel Hron
- Department of Mathematical Analysis and Applications of Mathematics, Faculty of Science, Palacký University, Czech Republic
- Department of Geoinformatics, Faculty of Science, Palacký University, Czech Republic
| | - Matthias Templ
- Department of Geoinformatics, Faculty of Science, Palacký University, Czech Republic
- Department of Statistics and Probability Theory, Vienna University of Technology, Austria
- Department of Methodology, Statistics Austria, Austria
| | - Peter Filzmoser
- Department of Geoinformatics, Faculty of Science, Palacký University, Czech Republic
- Department of Statistics and Probability Theory, Vienna University of Technology, Austria
| | | |
Collapse
|