1
|
Beyond the kill: The allometry of predation behaviours among large carnivores. J Anim Ecol 2024; 93:554-566. [PMID: 38459609 DOI: 10.1111/1365-2656.14070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 02/06/2024] [Indexed: 03/10/2024]
Abstract
The costs of foraging can be high while also carrying significant risks, especially for consumers feeding at the top of the food chain. To mitigate these risks, many predators supplement active hunting with scavenging and kleptoparasitic behaviours, in some cases specializing in these alternative modes of predation. The factors that drive differential utilization of these tactics from species to species are not well understood. Here, we use an energetics approach to investigate the survival advantages of hunting, scavenging and kleptoparasitism as a function of predator, prey and potential competitor body sizes for terrestrial mammalian carnivores. The results of our framework reveal that predator tactics become more diverse closer to starvation, while the deployment of scavenging and kleptoparasitism is strongly constrained by the ratio of predator to prey body size. Our model accurately predicts a behavioural transition away from hunting towards alternative modes of predation with increasing prey size for predators spanning an order of magnitude in body size, closely matching observational data across a range of species. We then show that this behavioural boundary follows an allometric power-law scaling relationship where the predator size scales with an exponent nearing 3/4 with prey size, meaning that this behavioural switch occurs at relatively larger threshold prey body size for larger carnivores. We suggest that our approach may provide a holistic framework for guiding future observational efforts exploring the diverse array of predator foraging behaviours.
Collapse
|
2
|
Reinforcement Learning Algorithms and Applications in Healthcare and Robotics: A Comprehensive and Systematic Review. SENSORS (BASEL, SWITZERLAND) 2024; 24:2461. [PMID: 38676080 PMCID: PMC11053800 DOI: 10.3390/s24082461] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Revised: 04/04/2024] [Accepted: 04/08/2024] [Indexed: 04/28/2024]
Abstract
Reinforcement learning (RL) has emerged as a dynamic and transformative paradigm in artificial intelligence, offering the promise of intelligent decision-making in complex and dynamic environments. This unique feature enables RL to address sequential decision-making problems with simultaneous sampling, evaluation, and feedback. As a result, RL techniques have become suitable candidates for developing powerful solutions in various domains. In this study, we present a comprehensive and systematic review of RL algorithms and applications. This review commences with an exploration of the foundations of RL and proceeds to examine each algorithm in detail, concluding with a comparative analysis of RL algorithms based on several criteria. This review then extends to two key applications of RL: robotics and healthcare. In robotics manipulation, RL enhances precision and adaptability in tasks such as object grasping and autonomous learning. In healthcare, this review turns its focus to the realm of cell growth problems, clarifying how RL has provided a data-driven approach for optimizing the growth of cell cultures and the development of therapeutic solutions. This review offers a comprehensive overview, shedding light on the evolving landscape of RL and its potential in two diverse yet interconnected fields.
Collapse
|
3
|
CausNet-partial : 'Partial Generational Orderings' based search for optimal sparse Bayesian networks via dynamic programming with parent set constraints. RESEARCH SQUARE 2024:rs.3.rs-4021074. [PMID: 38496505 PMCID: PMC10942557 DOI: 10.21203/rs.3.rs-4021074/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
Background In our recent work, we developed a novel dynamic programming algorithm to find optimal Bayesian networks (BNs) with parent set constraints. This 'generational orderings' based dynamic programming search algorithm - CausNet - efficiently searches the space of possible BNs given the possible parent sets. The algorithm supports both continuous and categorical data, as well as continuous, binary and survival outcomes. In the present work, we develop a variant of CausNet - CausNet-partial - which searches the space of 'partial generational orderings', which further reduces the search space and is suited for finding smaller sparse optimal Bayesian networks; and can be applied to 1000s of variables. Results We test this method both on synthetic and real data. Our algorithm performs better than three state-of-art algorithms that are currently used extensively to find optimal BNs. We apply it to simulated continuous data and also to a benchmark discrete Bayesian network ALARM, a Bayesian network designed to provide an alarm message system for patient monitoring. We first apply the original CausNet and then CausNet-partial varying the partial order from 5 to 2. CausNet-partial discovers small sparse networks with drastically reduced runtime as expected from theory. Conclusions Our partial generational orderings based search for small optimal networks, is both an efficient and highly scalable approach for finding optimal sparse and small Bayesian Networks and can be applied to 1000s of variables. Using specifiable parameters - correlation, FDR cutoffs, in-degree, and partial order - one can increase or decrease the number of nodes and density of the networks. Availability of two scoring option - BIC and Bge - and implementation for survival outcomes and mixed data types makes our algorithm very suitable for many types of high dimensional data in a variety of fields.
Collapse
|
4
|
DERNA Enables Pareto Optimal RNA Design. J Comput Biol 2024; 31:179-196. [PMID: 38416637 DOI: 10.1089/cmb.2023.0283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/01/2024] Open
Abstract
The design of an RNA sequence v that encodes an input target protein sequence w is a crucial aspect of messenger RNA (mRNA) vaccine development. There are an exponential number of possible RNA sequences for a single target protein due to codon degeneracy. These potential RNA sequences can assume various secondary structure conformations, each with distinct minimum free energy (MFE), impacting thermodynamic stability and mRNA half-life. Furthermore, the presence of species-specific codon usage bias, quantified by the codon adaptation index (CAI), plays a vital role in translation efficiency. While earlier studies focused on optimizing either MFE or CAI, recent research has underscored the advantages of simultaneously optimizing both objectives. However, optimizing one objective comes at the expense of the other. In this work, we present the Pareto Optimal RNA Design problem, aiming to identify the set of Pareto optimal solutions for which no alternative solutions exist that exhibit better MFE and CAI values. Our algorithm DEsign RNA (DERNA) uses the weighted sum method to enumerate the Pareto front by optimizing convex combinations of both objectives. We use dynamic programming to solve each convex combination in O ( | w | 3 ) time and O ( | w | 2 ) space. Compared with a CDSfold, previous approach that only optimizes MFE, we show on a benchmark data set that DERNA obtains solutions with identical MFE but superior CAI. Moreover, we show that DERNA matches the performance in terms of solution quality of LinearDesign, a recent approach that similarly seeks to balance MFE and CAI. We conclude by demonstrating our method's potential for mRNA vaccine design for the SARS-CoV-2 spike protein.
Collapse
|
5
|
A general framework for modelling trade-offs in adaptive behaviour. Biol Rev Camb Philos Soc 2024; 99:56-69. [PMID: 37609707 DOI: 10.1111/brv.13011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Revised: 08/05/2023] [Accepted: 08/09/2023] [Indexed: 08/24/2023]
Abstract
An animal's behaviour can influence many variables, such as its energy reserves, its risk of injury or mortality, and its rate of reproduction. To identify the optimal action in a given situation, these various effects can be compared in the common currency of reproductive value. While this idea has been widely used to study trade-offs between pairs of variables, e.g. between energy gain versus survival, here we present a unified framework that makes explicit how these various trade-offs fit together. This unification covers a wide range of biological phenomena, highlighting similarities in their logical structure and helping to identify knowledge gaps. To fill one such gap, we present a new model of foraging under the risk of predation and damage accumulation. We conclude by discussing the use and limitations of state-dependent optimisation theory in predicting biological observations.
Collapse
|
6
|
Who innovates? Abundance of novel and familiar food changes which animals are most persistent. Proc Biol Sci 2024; 291:20231936. [PMID: 38228174 PMCID: PMC10791525 DOI: 10.1098/rspb.2023.1936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Accepted: 12/11/2023] [Indexed: 01/18/2024] Open
Abstract
Novel behaviours are the raw material of cultural evolution, yet we do not have a clear picture of when they are likely to arise. I use a state-dependent model to examine how individual age and energy reserves interact with the abundance of known and novel prey to promote dietary innovation (incorporating a new food item into the diet). I measure innovativeness as persistence in attempting to capture novel prey. I find a trend towards greater persistence among younger individuals. Decreased abundance of known prey and increased abundance of novel prey also favour persistence. However, many exceptions to these trends occur. These exceptions are critical because they may explain inconsistencies among studies of animal innovation. Care must be taken in experiments to control for multiple factors relevant to an animal's energy budget and foraging opportunities. We may learn more about innovation in experimental contexts by (i) manipulating the abundance of novel and familiar food resources, (ii) directly measuring animal age and condition, and-where possible-(iii) fitting nonlinear models to innovative behaviour. Results indicate that selection for persistence may also favour neophilia.
Collapse
|
7
|
A Computational Evaluation of Minimum Feature Size in Projection Two-Photon Lithography for Rapid Sub-100 nm Additive Manufacturing. MICROMACHINES 2024; 15:158. [PMID: 38276857 PMCID: PMC10820352 DOI: 10.3390/mi15010158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 01/17/2024] [Accepted: 01/19/2024] [Indexed: 01/27/2024]
Abstract
Two-photon lithography (TPL) is a laser-based additive manufacturing technique that enables the printing of arbitrarily complex cm-scale polymeric 3D structures with sub-micron features. Although various approaches have been investigated to enable the printing of fine features in TPL, it is still challenging to achieve rapid sub-100 nm 3D printing. A key limitation is that the physical phenomena that govern the theoretical and practical limits of the minimum feature size are not well known. Here, we investigate these limits in the projection TPL (P-PTL) process, which is a high-throughput variant of TPL, wherein entire 2D layers are printed at once. We quantify the effects of the projected feature size, optical power, exposure time, and photoinitiator concentration on the printed feature size through finite element modeling of photopolymerization. Simulations are performed rapidly over a vast parameter set exceeding 10,000 combinations through a dynamic programming scheme, which is implemented on high-performance computing resources. We demonstrate that there is no physics-based limit to the minimum feature sizes achievable with a precise and well-calibrated P-TPL system, despite the discrete nature of illumination. However, the practically achievable minimum feature size is limited by the increased sensitivity of the degree of polymer conversion to the processing parameters in the sub-100 nm regime. The insights generated here can serve as a roadmap towards fast, precise, and predictable sub-100 nm 3D printing.
Collapse
|
8
|
Aerosolized Insecticide Spray Distributions and Relationships to Efficacy against Stored Product Pests. INSECTS 2023; 14:914. [PMID: 38132588 PMCID: PMC10744046 DOI: 10.3390/insects14120914] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 11/07/2023] [Accepted: 11/08/2023] [Indexed: 12/23/2023]
Abstract
Aerosol insecticides are widely used in stored product insect management programs in food facilities. Previous research has shown spatial variation in aerosol efficacy within facilities, but information on how spatial patterns of aerosol droplet concentration, size distribution, dispersal, and deposition contribute to this variation in efficacy is limited. This study involved two aerosol application systems: a high-pressure cylinder containing TurboCide Py-75® with pyriproxyfen IGR (ChemTech Ltd., Des Moines, IA, USA) and a hand-held fogger containing Pyrocide 100® (MGK, Minneapolis, MN, USA) with Diacon II which contains methoprene IGR (Wellmark, Schaumburg, IL, USA). These systems were used at single or multiple application locations. The spray trials were conducted in a small-scale flour mill, Hall Ross Flour Mill (Kansas State University, Manhattan, KS, USA). The droplet size distributions were monitored at multiple positions within the room using nine aerodynamic particle sizing (APS, TSI Incorp, Shoreview, MN, USA) instruments. The APS data collected over the treatment period were summarized into a mass concentration index (MCI), which ranged from 155 to 2549 mg/m3 for Turbocide and 235-5658 mg/m3 for Pyrocide. A second parameter called the Deposition Index (Dep.Idx) was derived to estimate potential insecticide depositions on the floor and has units of g/m2. The Dep.Idx was below 5.3 g/m2 for most Turbocide applications, while the Dep.Idx was below 8.4 g/m2 for most Pyrocide applications. The MCI and Dep.Idx values varied with APS position and spray application location, with proximity to the aerosol application location and degree of obstruction between the release point and APS position contributing to this variation. We assessed the relationship between aerosol droplet parameters and insect efficacy using Tribolium confusum Jacqueline DuVal, the confused flour beetle. The adults were treated directly, while the larvae were treated two weeks later during the residual test (previously published). For Turbocide, efficacy against adults increased with MCI and Dep.Idx values, but for residual efficacy of the IGR, efficacy was high at all aerosol droplet values, so no relationship was apparent. In contrast, the relationship between Pyrocide deposition and adult insect efficacy was highly variable. But with larval insect efficacy, residual larvae control was directly related to increases in Pyrocide MCI and Dep.Idx. Contour plots of Dep.Idx values were developed, which could be used to predict areas of the mill that are not receiving an adequate application rate, and this could be used to develop more effective application strategies for aerosol insecticides in food facilities.
Collapse
|
9
|
A Comprehensive Eco-Driving Strategy for CAVs with Microscopic Traffic Simulation Testing Evaluation. SENSORS (BASEL, SWITZERLAND) 2023; 23:8416. [PMID: 37896510 PMCID: PMC10610644 DOI: 10.3390/s23208416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 10/06/2023] [Accepted: 10/10/2023] [Indexed: 10/29/2023]
Abstract
In this paper, a comprehensive deterministic Eco-Driving strategy for Connected and Autonomous Vehicles (CAVs) is presented. In this setup, multiple driving modes calculate speed profiles that are ideal for their own set of constraints simultaneously to save fuel as much as possible, while a High-Level (HL) controller ensures smooth and safe transitions between the driving modes for Eco-Driving. This Eco-Driving deterministic controller for an ego CAV was equipped with Vehicle-to-Infrastructure (V2I) and Vehicle-to-Vehicle (V2V) algorithms. This comprehensive Eco-Driving strategy and its individual components were tested by using simulations to quantify the fuel economy performance. Simulation results are used to show that the HL controller ensures significant fuel economy improvement as compared to baseline driving modes with no collisions between the ego CAV and traffic vehicles, while the driving mode of the ego CAV was set correctly under changing constraints. For the microscopic traffic simulations, a 6.41% fuel economy improvement was observed for the CAV that was controlled by this comprehensive Eco-Driving strategy.
Collapse
|
10
|
Multivariate piecewise linear regression model to predict radiosensitivity using the association with the genome-wide copy number variation. Front Oncol 2023; 13:1154222. [PMID: 37849808 PMCID: PMC10577171 DOI: 10.3389/fonc.2023.1154222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Accepted: 09/11/2023] [Indexed: 10/19/2023] Open
Abstract
Introduction The search for biomarkers to predict radiosensitivity is important not only to individualize radiotherapy of cancer patients but also to forecast radiation exposure risks. The aim of this study was to devise a machine-learning method to stratify radiosensitivity and to investigate its association with genome-wide copy number variations (CNVs) as markers of sensitivity to ionizing radiation. Methods We used the Affymetrix CytoScan HD microarrays to survey common CNVs in 129 fibroblast cell strains. Radiosensitivity was measured by the surviving fraction at 2 Gy (SF2). We applied a dynamic programming (DP) algorithm to create a piecewise (segmented) multivariate linear regression model predicting SF2 and to identify SF2 segment-related distinctive CNVs. Results SF2 ranged between 0.1384 and 0.4860 (mean=0.3273 The DP algorithm provided optimal segmentation by defining batches of radio-sensitive (RS), normally-sensitive (NS), and radio-resistant (RR) responders. The weighted mean relative errors (MRE) decreased with increasing the segments' number. The borders of the utmost segments have stabilized after partitioning SF2 into 5 subranges. Discussion The 5-segment model associated C-3SFBP marker with the most-RS and C-7IUVU marker with the most-RR cell strains. Both markers were mapped to gene regions (MCC and SLC1A6, respectively). In addition, C-3SFBP marker is also located in enhancer and multiple binding motifs. Moreover, for most CNVs significantly correlated with SF2, the radiosensitivity increased with the copy-number decrease.In conclusion, the DP-based piecewise multivariate linear regression method helps narrow the set of CNV markers from the whole radiosensitivity range to the smaller intervals of interest. Notably, SF2 partitioning not only improves the SF2 estimation but also provides distinctive markers. Ultimately, segment-related markers can be used, potentially with tissues' specific factors or other clinical data, to identify radiotherapy patients who are most RS and require reduced doses to avoid complications and the most RR eligible for dose escalation to improve outcomes.
Collapse
|
11
|
The small-bat-in-summer paradigm: Energetics and adaptive behavioural routines of bats investigated through a stochastic dynamic model. J Anim Ecol 2023; 92:2078-2093. [PMID: 37661664 DOI: 10.1111/1365-2656.13999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Accepted: 08/16/2023] [Indexed: 09/05/2023]
Abstract
Strong seasonality at high latitudes represents a major challenge for many endotherms as they must balance survival and reproduction in an environment that varies widely in food availability and temperature. To avoid energetic mismatches caused by limited foraging time and stochastic weather conditions, bats employ the energy-saving state of torpor during summer to save accumulated energy reserves. However, at high-latitude small-bats-in-summer face a particular challenge: as nocturnal foragers, they rely on the darkness at night to avoid predators and/or interspecific competition, but live in an environment with short, light summer nights, and even a lack of true night at the northernmost distributions of some bat species. To predict optimal behaviour in relation to latitudinal variation in diurnal cycles, we constructed a stochastic dynamic programming model of bats living at high latitudes. Using a stochastic dynamic programming framework with values that are representative for our study system, we show that individual energetic reserves are a strong driver of daytime use of torpor and night-time foraging behaviour alike, with these linked effects being both temperature- and photoperiod-dependent. We further used the model to predict survival probabilities at five locations across a latitudinal gradient (60.1° N to 70.9° N), finding that combinations of photoperiod and temperature conditions limited population distributions in the model. To verify our model results, we compared predictions for optimal decisions with our own empirical data collected on northern bats (Eptesicus nilssonii) from two latitudes in Norway. The similarities between our predictions and observations provide strong evidence that this model framework incorporates the most important drivers of diurnal decision-making in bat physiology and behaviour. Comparing empirical data and model predictions also revealed that bats facing lighter night conditions further north restrict their mass gain, which strengthens the hypothesis that predation threat is a main driver of bat nocturnality. Our model findings regarding state-dependent decisions in bats should contribute to the understanding of how bats cope with the summer challenges at high latitudes.
Collapse
|
12
|
The practicality of practice: A model of the function of play behaviour. Ecol Evol 2023; 13:e10521. [PMID: 37732285 PMCID: PMC10507573 DOI: 10.1002/ece3.10521] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 06/23/2023] [Accepted: 08/30/2023] [Indexed: 09/22/2023] Open
Abstract
The function of play has been a long-debated topic in animal behaviour. One popular class of accounts is that play offers practice for serious adult behaviour, but little has been done to model the circumstances where this could be true. In this paper, we model an individual who, over the juvenile and subadult ontogenetic periods, has a choice between three behaviours: foraging, playing and rest, where playing improves an individual's ability in some component of a serious adult behaviour. Using stochastic dynamic programming, we show that even when play is more energetically costly and an inferior form of practice than foraging itself, it still may be optimal to play under a variety of circumstances. We offer several instantiations of the play as practice concept to show the possibility of play improving a variety of different adult abilities: antipredatory, foraging and reproductive behaviour. These models show the environmental conditions where play might be expected, as well as the predicted occurrences of play throughout ontogeny. This is a first step in showing the ecological feasibility of the practice hypothesis of play and raises further questions about why playful activity is more beneficial than more deliberate directed practice.
Collapse
|
13
|
Research on Global Optimization Algorithm of Integrated Energy and Thermal Management for Plug-In Hybrid Electric Vehicles. SENSORS (BASEL, SWITZERLAND) 2023; 23:7149. [PMID: 37631686 PMCID: PMC10459459 DOI: 10.3390/s23167149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 07/29/2023] [Accepted: 08/11/2023] [Indexed: 08/27/2023]
Abstract
Power distribution and battery thermal management are important technologies for improving the energy efficiency of plug-in hybrid electric vehicles (PHEVs). In response to the global optimization of integrated energy thermal management strategy (IETMS) for PHEVs, a dynamic programming algorithm based on adaptive grid optimization (AGO-DP) is proposed in this paper to improve optimization performance by reducing the optimization range of SOC and battery temperature, and adaptively adjusting the grid distribution of state variables according to the actual feasible region. The simulation results indicate that through AGO-DP optimization, the reduction ratio of the state feasible region is more than 30% under different driving conditions. Meanwhile, the algorithm can obtain better global optimal driving costs more rapidly and accurately than traditional dynamic programming algorithms (DP). The computation time is reduced by 33.29-84.67%, and the accuracy of the global optimal solution is improved by 0.94-16.85% compared to DP. The optimal control of the engine and air conditioning system is also more efficient and reasonable. Furthermore, AGO-DP is applied to explore IETMS energy-saving potential for PHEVs. It is found that the IETMS energy-saving potential range is 3.68-23.74% under various driving conditions, which increases the energy-saving potential by 0.55-3.26% compared to just doing the energy management.
Collapse
|
14
|
Optimization of scheduling scheme for self-driving vehicles by simulation algorithm. Sci Prog 2023; 106:368504231188617. [PMID: 37491947 PMCID: PMC10388342 DOI: 10.1177/00368504231188617] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/27/2023]
Abstract
As a new logistics technology, self-driving electric vehicles not only improve freight efficiency but also promote energy saving and emissions reductions. Aiming at logistics technologies based on self-driving electric vehicles, planning the vehicle scheduling scheme as a whole reduces energy consumption and improves economic and environmental benefits. Targeting an actual freight problem based on a two-way single-lane road connecting the pickup and delivery points and including electric charging stations, this paper proposes a method for optimizing the scheduling scenario and parameters of self-driving vehicles through computer simulations. An optimization model based on dynamic programming is established, and an optimization simulation algorithm is designed to solve the model, effectively solving the overall planning problem of vehicle scheduling. The experimental results show that the model and algorithm have good universality. After specifying an appropriate road length, total number of vehicles, number of spare vehicle batteries, duration of freight transportation, and other necessary information, the simulation algorithm is executed and the optimal scheduling scheme and the total amount of freight transported are output. The efficiency of the algorithm is extremely high, requiring only 1.5 s to complete the whole simulation process of scheduling 150 vehicles for 1000 h over a road with a length of 10 km.
Collapse
|
15
|
Search for Dispersed Repeats in Bacterial Genomes Using an Iterative Procedure. Int J Mol Sci 2023; 24:10964. [PMID: 37446142 DOI: 10.3390/ijms241310964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2023] [Revised: 06/27/2023] [Accepted: 06/28/2023] [Indexed: 07/15/2023] Open
Abstract
We have developed a de novo method for the identification of dispersed repeats based on the use of random position-weight matrices (PWMs) and an iterative procedure (IP). The created algorithm (IP method) allows detection of dispersed repeats for which the average number of substitutions between any two repeats per nucleotide (x) is less than or equal to 1.5. We have shown that all previously developed methods and algorithms (RED, RECON, and some others) can only find dispersed repeats for x ≤ 1.0. We applied the IP method to find dispersed repeats in the genomes of E. coli and nine other bacterial species. We identify three families of approximately 1.09 × 106, 0.64 × 106, and 0.58 × 106 DNA bases, respectively, constituting almost 50% of the complete E. coli genome. The length of the repeats is in the range of 400 to 600 bp. Other analyzed bacterial genomes contain one to three families of dispersed repeats with a total number of 103 to 6 × 103 copies. The existence of such highly divergent repeats could be associated with the presence of a single-type triplet periodicity in various genes or with the packing of bacterial DNA into a nucleoid.
Collapse
|
16
|
Inferring Optimal Species Trees in the Presence of Gene Duplication and Loss: Beyond Rooted Gene Trees. J Comput Biol 2023; 30:161-175. [PMID: 36251762 DOI: 10.1089/cmb.2021.0522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
Estimating species trees from multiple genes is complicated and challenging due to gene tree-species tree discordance. One of the basic approaches to understanding differences between gene trees and species trees is gene duplication and loss events. Minimize Gene Duplication and Loss (MGDL) is a popular technique for inferring species trees from gene trees when the gene trees are discordant due to gene duplications and losses. Previously, exact algorithms for estimating species trees from rooted, binary trees under MGDL were proposed. However, gene trees are usually estimated using time-reversible mutation models, which result in unrooted trees. In this article, we propose a dynamic programming (DP) algorithm that can be used for an exact but exponential time solution for the case when gene trees are not rooted. We also show that a constrained version of this problem can be solved by this DP algorithm in time that is polynomial in the number of gene trees and taxa. We have proved important structural properties that allow us to extend the algorithms for rooted gene trees to unrooted gene trees. We propose a linear time algorithm for finding the optimal rooted version of an unrooted gene tree given a rooted species tree so that the duplication and loss cost is minimized. Moreover, we prove that the optimal rooting under MGDL is also optimal under the MDC (minimize deep coalescence) criterion. The proposed methods can be applied to both orthologous genes and gene families that by definition include both paralogs and orthologs. Therefore, we hope that these techniques will be useful for estimating species trees from genes sampled throughout the whole genome.
Collapse
|
17
|
Detecting Non-Overlapping Signals with Dynamic Programming. ENTROPY (BASEL, SWITZERLAND) 2023; 25:250. [PMID: 36832618 PMCID: PMC9955077 DOI: 10.3390/e25020250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/25/2022] [Revised: 01/23/2023] [Accepted: 01/27/2023] [Indexed: 06/18/2023]
Abstract
This paper studies the classical problem of detecting the locations of signal occurrences in a one-dimensional noisy measurement. Assuming the signal occurrences do not overlap, we formulate the detection task as a constrained likelihood optimization problem and design a computationally efficient dynamic program that attains its optimal solution. Our proposed framework is scalable, simple to implement, and robust to model uncertainties. We show by extensive numerical experiments that our algorithm accurately estimates the locations in dense and noisy environments, and outperforms alternative methods.
Collapse
|
18
|
Heating Control Strategy Based on Dynamic Programming for Building Energy Saving and Emission Reduction. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:14137. [PMID: 36361012 PMCID: PMC9653744 DOI: 10.3390/ijerph192114137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 10/24/2022] [Accepted: 10/27/2022] [Indexed: 06/16/2023]
Abstract
Finding the optimal balance between end-user's comfort, lifestyle preferences and the cost of the heating, ventilation and air conditioning (HVAC) system, which requires intelligent decision making and control. This paper proposes a heating control method for HVAC based on dynamic programming. The method first selects the most suitable modeling approach for the controlled building among three machine learning modeling techniques by means of statistical performance metrics, after which the control of the HVAC system is described as a constrained optimization problem, and the action of the controller is given by solving the optimization problem through dynamic programming. In this paper, the variable 'thermal energy storage in building' is introduced to solve the problem that dynamic programming is difficult to obtain the historical state of the building due to the requirement of no aftereffect, while the room temperature and the remaining start hours of the Primary Air Unit are selected to describe the system state through theoretical analysis and trial and error. The results of the TRNSYS/Python co-simulation show that the proposed method can maintain better indoor thermal environment with less energy consumption compared to carefully reviewed expert rules. Compared with expert rule set 'baseline-20 °C', which keeps the room temperature at the minimum comfort level, the proposed control algorithm can save energy and reduce emissions by 35.1% with acceptable comfort violation.
Collapse
|
19
|
Normative decision rules in changing environments. eLife 2022; 11:e79824. [PMID: 36282065 PMCID: PMC9754630 DOI: 10.7554/elife.79824] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Accepted: 10/20/2022] [Indexed: 11/13/2022] Open
Abstract
Models based on normative principles have played a major role in our understanding of how the brain forms decisions. However, these models have typically been derived for simple, stable conditions, and their relevance to decisions formed under more naturalistic, dynamic conditions is unclear. We previously derived a normative decision model in which evidence accumulation is adapted to fluctuations in the evidence-generating process that occur during a single decision (Glaze et al., 2015), but the evolution of commitment rules (e.g. thresholds on the accumulated evidence) under dynamic conditions is not fully understood. Here, we derive a normative model for decisions based on changing contexts, which we define as changes in evidence quality or reward, over the course of a single decision. In these cases, performance (reward rate) is maximized using decision thresholds that respond to and even anticipate these changes, in contrast to the static thresholds used in many decision models. We show that these adaptive thresholds exhibit several distinct temporal motifs that depend on the specific predicted and experienced context changes and that adaptive models perform robustly even when implemented imperfectly (noisily). We further show that decision models with adaptive thresholds outperform those with constant or urgency-gated thresholds in accounting for human response times on a task with time-varying evidence quality and average reward. These results further link normative and neural decision-making while expanding our view of both as dynamic, adaptive processes that update and use expectations to govern both deliberation and commitment.
Collapse
|
20
|
Dynamic Programming BN Structure Learning Algorithm Integrating Double Constraints under Small Sample Condition. ENTROPY (BASEL, SWITZERLAND) 2022; 24:1354. [PMID: 37420374 DOI: 10.3390/e24101354] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Revised: 09/09/2022] [Accepted: 09/21/2022] [Indexed: 07/09/2023]
Abstract
The Bayesian Network (BN) structure learning algorithm based on dynamic programming can obtain global optimal solutions. However, when the sample cannot fully contain the information of the real structure, especially when the sample size is small, the obtained structure is inaccurate. Therefore, this paper studies the planning mode and connotation of dynamic programming, restricts its process with edge and path constraints, and proposes a dynamic programming BN structure learning algorithm with double constraints under small sample conditions. The algorithm uses double constraints to limit the planning process of dynamic programming and reduces the planning space. Then, it uses double constraints to limit the selection of the optimal parent node to ensure that the optimal structure conforms to prior knowledge. Finally, the integrating prior-knowledge method and the non-integrating prior-knowledge method are simulated and compared. The simulation results verify the effectiveness of the method proposed and prove that the integrating prior knowledge can significantly improve the efficiency and accuracy of BN structure learning.
Collapse
|
21
|
Nonmyopic and pseudo-nonmyopic approaches to optimal sequential design in the presence of covariates. J STAT COMPUT SIM 2022; 93:581-603. [PMID: 36968627 PMCID: PMC10035582 DOI: 10.1080/00949655.2022.2113788] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 08/12/2022] [Indexed: 10/14/2022]
Abstract
In sequential experiments, subjects become available for the study over a period of time, and covariates are often measured at the time of arrival. We consider the setting where the sample size is fixed but covariate values are unknown until subjects enrol. Given a model for the outcome, a sequential optimal design approach can be used to allocate treatments to minimize the variance of the estimator of the treatment effect. We extend existing optimal design methodology so it can be used within a nonmyopic framework, where treatment allocation for the current subject depends not only on the treatments and covariates of the subjects already enrolled in the study, but also the impact of possible future treatment assignments within a specified horizon. The nonmyopic approach requires recursive formulae and suffers from the curse of dimensionality. We propose a pseudo-nonmyopic approach which has a similar aim to the nonmyopic approach, but does not involve recursion and instead relies on simulating trajectories of future possible decisions. Our simulation studies show that, for the simple case of a logistic regression with a single binary covariate and a binary treatment, and a more realistic case with four binary covariates, binary treatment and treatment-covariate interactions, the nonmyopic and pseudo-nonmyopic approaches provide no competitive advantage over the myopic approach, both in terms of the size of the estimated treatment effect and also the efficiency of the designs. Results are robust to the size of the horizon used in the nonmyopic approach, and the number of simulated trajectories used in the pseudo-nonmyopic approach.
Collapse
|
22
|
ARPIP: Ancestral sequence Reconstruction with insertions and deletions under the Poisson Indel Process. Syst Biol 2022:6648472. [PMID: 35866991 DOI: 10.1093/sysbio/syac050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Accepted: 07/06/2022] [Indexed: 11/12/2022] Open
Abstract
Modern phylogenetic methods allow inference of ancestral molecular sequences given an alignment and phylogeny relating present day sequences. This provides insight into the evolutionary history of molecules, helping to understand gene function and to study biological processes such as adaptation and convergent evolution across a variety of applications. Here we propose a dynamic programming algorithm for fast joint likelihood-based reconstruction of ancestral sequences under the Poisson Indel Process (PIP). Unlike previous approaches, our method, named ARPIP, enables the reconstruction with insertions and deletions based on an explicit indel model. Consequently, inferred indel events have an explicit biological interpretation. Likelihood computation is achieved in linear time with respect to the number of sequences. Our method consists of two steps, namely finding the most probable indel points and reconstructing ancestral sequences. First, we find the most likely indel points and prune the phylogeny to reflect the insertion and deletion events per site. Second, we infer the ancestral states on the pruned subtree in a manner similar to FastML. We applied ARPIP on simulated datasets and on real data from the Betacoronavirus genus. ARPIP reconstructs both the indel events and substitutions with a high degree of accuracy. Our method fares well when compared to established state-of-the-art methods such as FastML and PAML. Moreover, the method can be extended to explore both optimal and suboptimal reconstructions, include rate heterogeneity through time and more. We believe it will expand the range of novel applications of ancestral sequence reconstruction.
Collapse
|
23
|
Fast, Ungapped Reads Mapping Using Squid. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:ijerph19095442. [PMID: 35564837 PMCID: PMC9103773 DOI: 10.3390/ijerph19095442] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 03/30/2022] [Accepted: 04/22/2022] [Indexed: 01/02/2023]
Abstract
Advances in Next Generation Sequencing technologies allow us to inspect and unlock the genome to a level of detail that was unimaginable only a few decades ago. Omics-based studies are casting a light on the patterns and determinants of disease conditions in populations, as well as on the influence of microbial communities on human health, just to name a few. Through increasing volumes of sequencing information, for example, it is possible to compare genomic features and analyze the modulation of the transcriptome under different environmental stimuli. Although protocols for NGS preparation are intended to leave little to no space for contamination of any kind, a noticeable fraction of sequencing reads still may not uniquely represent what was intended to be sequenced in the first place. If a natural consequence of a sequencing sample is to assess the presence of features of interest by mapping the obtained reads to a genome of reference, sometimes it is useful to determine the fraction of those that do not map, or that map discordantly, and store this information to a new file for subsequent analyses. Here we propose a new mapper, which we called Squid, that among other accessory functionalities finds and returns sequencing reads that match or do not match to a reference sequence database in any orientation. We encourage the use of Squid prior to any quantification pipeline to assess, for instance, the presence of contaminants, especially in RNA-Seq experiments.
Collapse
|
24
|
Application of the MAHDS Method for Multiple Alignment of Highly Diverged Amino Acid Sequences. Int J Mol Sci 2022; 23:ijms23073764. [PMID: 35409125 PMCID: PMC8998981 DOI: 10.3390/ijms23073764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 03/23/2022] [Accepted: 03/23/2022] [Indexed: 12/10/2022] Open
Abstract
The aim of this work was to compare the multiple alignment methods MAHDS, T-Coffee, MUSCLE, Clustal Omega, Kalign, MAFFT, and PRANK in their ability to align highly divergent amino acid sequences. To accomplish this, we created test amino acid sequences with an average number of substitutions per amino acid (x) from 0.6 to 5.6, a total of 81 sets. Comparison of the performance of sequence alignments constructed by MAHDS and previously developed algorithms using the CS and Z score criteria and the benchmark alignment database (BAliBASE) indicated that, although the quality of the alignments built with MAHDS was somewhat lower than that of the other algorithms, it was compensated by greater statistical significance. MAHDS could construct statistically significant alignments of artificial sequences with x ≤ 4.8, whereas the other algorithms (T-Coffee, MUSCLE, Clustal Omega, Kalign, MAFFT, and PRANK) could not perform that at x > 2.4. The application of MAHDS to align 21 families of highly diverged proteins (identity < 20%) from Pfam and HOMSTRAD databases showed that it could calculate statistically significant alignments in cases when the other methods failed. Thus, MAHDS could be used to construct statistically significant multiple alignments of highly divergent protein sequences, which accumulated multiple mutations during evolution.
Collapse
|
25
|
Disentangling relationships in symptom networks using matrix permutation methods. PSYCHOMETRIKA 2022; 87:133-155. [PMID: 34282531 DOI: 10.1007/s11336-021-09760-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Revised: 02/12/2021] [Accepted: 03/18/2021] [Indexed: 06/13/2023]
Abstract
Common outputs of software programs for network estimation include association matrices containing the edge weights between pairs of symptoms and a plot of the symptom network. Although such outputs are useful, it is sometimes difficult to ascertain structural relationships among symptoms from these types of output alone. We propose that matrix permutation provides a simple, yet effective, approach for clarifying the order relationships among the symptoms based on the edge weights of the network. For directed symptom networks, we use a permutation criterion that has classic applications in electrical circuit theory and economics. This criterion can be used to place symptoms that strongly predict other symptoms at the beginning of the ordering, and symptoms that are strongly predicted by other symptoms at the end. For undirected symptom networks, we recommend a permutation criterion that is based on location theory in the field of operations research. When using this criterion, symptoms with many strong ties tend to be placed centrally in the ordering, whereas weakly-tied symptoms are placed at the ends. The permutation optimization problems are solved using dynamic programming. We also make use of branch-search algorithms for extracting maximum cardinality subsets of symptoms that have perfect structure with respect to a selected criterion. Software for implementing the dynamic programming algorithms is available in MATLAB and R. Two networks from the literature are used to demonstrate the matrix permutation algorithms.
Collapse
|
26
|
Optimal composition of chloride cells for osmoregulation in a randomly fluctuating environment. J Theor Biol 2022; 537:111016. [PMID: 35026211 DOI: 10.1016/j.jtbi.2022.111016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Revised: 12/01/2021] [Accepted: 01/05/2022] [Indexed: 11/19/2022]
Abstract
Fish live in water with a different osmotic pressure from that in the body. Their gills have chloride cells that transport ions to maintain an appropriate level of osmotic pressure in the body. The direction of ion transport is different between seawater and freshwater. There are two types of chloride cells that specialize in unidirectional transport and generalist cells that can switch their function quickly in response to environmental salinity. In species that experience salinity changes throughout life (euryhaline species), individuals may replace some chloride cells with cells of different types upon a sudden change in environmental salinity. In this paper, we develop a dynamic optimization model for the chloride cell composition of an individual living in an environment with randomly fluctuating salinity. The optimal solution is to minimize the sum of the workload of chloride cells in coping with the difference in osmotic pressure, the maintenance cost, and the temporal cost due to environmental change. The optimal fraction of generalist chloride cells increases with the frequency of salinity changes and the time needed for new cells to be fully functional but decreases with excess maintenance cost.
Collapse
|
27
|
Efficient gradient computation for optimization of hyperparameters. Phys Med Biol 2021; 67. [PMID: 34920440 DOI: 10.1088/1361-6560/ac4442] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 12/17/2021] [Indexed: 11/11/2022]
Abstract
We are interested in learning the hyperparameters in a convex objective function in a supervised setting. The complex relationship between the input data to the convex problem and the desirable hyperparameters can be modeled by a neural network; the hyperparameters and the data then drive the convex minimization problem, whose solution is then compared to training labels. In our previous work [1], we evaluated a prototype of this learning strategy in an optimization-based sinogram smoothing plus FBP reconstruction framework. A question arising in this setting is how to efficiently compute (backpropagate) the gradient from the solution of the optimization problem, to the hyperparameters to enable end-to-end training. In this work, we first develop general formulas for gradient backpropagation for a subset of convex problems, namely the proximal mapping. To illustrate the value of the general formulas and to demonstrate how to use them, we consider the specific instance of 1-D quadratic smoothing (denoising) whose solution admits a dynamic programming (DP) algorithm. The general formulas lead to another DP algorithm for exact computation of the gradient of the hyperparameters. Our numerical studies demonstrate a 55%- 65% computation time savings by providing a custom gradient instead of relying on automatic differentiation in deep learning libraries. While our discussion focuses on 1-D quadratic smoothing, our initial results (not presented) support the statement that the general formulas and the computational strategy apply equally well to TV or Huber smoothing problems on simple graphs whose solutions can be computed exactly via DP.
Collapse
|
28
|
Graph Theoretic Approach for the Analysis of Comprehensive Mass-Spectrometry (MS/MS) Data of Dissolved Organic Matter. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2021; 2021:3742-3746. [PMID: 35425661 PMCID: PMC9007174 DOI: 10.1109/bibm52615.2021.9669289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Dissolved organic matter (DOM) is a highly complex mixture of organic substances found in aquatic ecosystems. This mixture results from the degradation of primary producers within the ecosystem, groundwater, and the surrounding terrestrial sources. Understanding the chemical structure of DOM is crucial to assessing its impact on aquatic ecosystems. Although multiple studies have addressed the complexity of DOM, the molecular structure of this set of compounds remains unclear. In this work, we present a novel computational framework "Graph-DOM," to assess the comprehensive fragmentation data obtained from the analysis of DOM using the Data Independent Fragmentation strategy with ESI-FT-ICR MS/MS enabling better understanding of the structural complexity of DOM. Graph-DOM uses graph algorithms to dissect a compiled output file obtained from processing hundreds of ultra-high-resolution fragment spectra. Over half a million ordered fragmentation pathways were computed for 764 isolated precursor ions assuming up to seven vector segments categorized as neutral losses (CH2, CH3, O, CH4, H2O, CO, and CO2). Families of structurally related molecules were identified using pathway overlaps, and output files compatible with network visualization software (e.g., Cytoscape) were also generated. Graph-DOM is able to efficiently process all the pathways to discover families within only a few minutes with adjustable parameters for overlap length of fragmentation pathways as well as configuring low abundance CHOS, CHON, and CHONS compounds. Graph-DOM is available at https://github.com/Usman095/Graph-DOM.
Collapse
|
29
|
Thymus Gland: A Double Edge Sword for Coronaviruses. Vaccines (Basel) 2021; 9:vaccines9101119. [PMID: 34696231 PMCID: PMC8539924 DOI: 10.3390/vaccines9101119] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2021] [Revised: 09/23/2021] [Accepted: 09/25/2021] [Indexed: 02/06/2023] Open
Abstract
The thymus is the main lymphoid organ that regulates the immune and endocrine systems by controlling thymic cell proliferation and differentiation. The gland is a primary lymphoid organ responsible for generating mature T cells into CD4+ or CD8+ single-positive (SP) T cells, contributing to cellular immunity. Regarding humoral immunity, the thymic plasma cells almost exclusively secrete IgG1 and IgG3, the two main complement-fixing effector IgG subclasses. Deformity in the thymus can lead to inflammatory diseases. Hassall’s corpuscles’ epithelial lining produces thymic stromal lymphopoietin, which induces differentiation of CDs thymocytes into regulatory T cells within the thymus medulla. Thymic B lymphocytes produce immunoglobulins and immunoregulating hormones, including thymosin. Modulation in T cell and naive T cells decrement due to thymus deformity induce alteration in the secretion of various inflammatory factors, resulting in multiple diseases. Influenza virus activates thymic CD4+ CD8+ thymocytes and a large amount of IFNγ. IFNs limit virus spread, enhance macrophages’ phagocytosis, and promote the natural killer cell restriction activity against infected cells. Th2 lymphocytes-produced cytokine IL-4 can bind to antiviral INFγ, decreasing the cell susceptibility and downregulating viral receptors. COVID-19 epitopes (S, M, and N proteins) with ≥90% identity to the SARS-CoV sequence have been predicted. These epitopes trigger immunity for antibodies production. Boosting the immune system by improving thymus function can be a therapeutic strategy for preventing virus-related diseases. This review aims to summarize the endocrine-immunoregulatory functions of the thymus and the underlying mechanisms in the prevention of COVID-19.
Collapse
|
30
|
Nesting Synchrony and Clutch Size in Migratory Birds: Capital versus Income Breeding Determines Responses to Variable Spring Onset. Am Nat 2021; 198:E122-E135. [PMID: 34559609 DOI: 10.1086/716064] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
AbstractSynchronous reproduction of birds has often been explained by benefits from nesting together, but this concept fails to explain observed intraspecific variation and climate-mediated changes of breeding synchrony. Here, we present a theoretical model of birds that store resources for reproduction (capital breeders) to show how breeding synchrony, clutch size, and offspring recruitment respond to changes in timing of first possible breeding date. Our approach is based on individual fitness maximization when both prebreeding foraging and offspring development are time constrained. The model predicts less synchronous breeding, smaller clutch size, and higher chances for offspring recruitment in capital breeding birds that advance their nesting. For contrast, we also show that birds that need to acquire resources during egg laying (income breeders) do not change nesting synchrony but increase clutch size along with earlier breeding. The prediction of stronger nesting synchronization of capital breeders in years with late nesting onset is confirmed by empirical data on breeding synchrony of a high-latitude capital breeding sea duck, the common eider (Somateria mollissima). We predict that in warming high-latitude ecosystems, bird species that depend on stored reserves for reproduction are expected to desynchronize their nesting.
Collapse
|
31
|
Patient-specific hyperparameter learning for optimization-based CT image reconstruction. Phys Med Biol 2021; 66:10.1088/1361-6560/ac0f9a. [PMID: 34186530 PMCID: PMC8584383 DOI: 10.1088/1361-6560/ac0f9a] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Accepted: 06/29/2021] [Indexed: 11/11/2022]
Abstract
We propose a hyperparameter learning framework that learnspatient-specifichyperparameters for optimization-based image reconstruction problems for x-ray CT applications. The framework consists of two functional modules: (1) a hyperparameter learning module parameterized by a convolutional neural network, (2) an image reconstruction module that takes as inputs both the noisy sinogram and the hyperparameters from (1) and generates the reconstructed images. As a proof-of-concept study, in this work we focus on a subclass of optimization-based image reconstruction problems with exactly computable solutions so that the whole network can be trained end-to-end in an efficient manner. Unlike existing hyperparameter learning methods, our proposed framework generates patient-specific hyperparameters from the sinogram of the same patient. Numerical studies demonstrate the effectiveness of our proposed approach compared to bi-level optimization.
Collapse
|
32
|
Role of Annual Influenza Vaccination against Lung Cancer in Type 2 Diabetic Patients from a Population-Based Cohort Study. J Clin Med 2021; 10:jcm10153434. [PMID: 34362218 PMCID: PMC8347140 DOI: 10.3390/jcm10153434] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Revised: 07/14/2021] [Accepted: 07/16/2021] [Indexed: 01/16/2023] Open
Abstract
Type 2 diabetes mellitus (DM) patients are at a higher risk for developing lung cancer due to immune dysfunction and chronic inflammation. They also have increased morbidity and mortality related to influenza, and it is recommended that they receive an annual influenza vaccination. In this study, we evaluate whether influenza vaccination could reduce the incidence of lung cancer in DM patients. This cohort study included DM patients (≥55 years old) between 1 January 2002 and 31 December 2012 by using the Taiwan Health Insurance Database. Cox proportional hazard regression method was used to compare the relation between the influenza vaccination and lung cancer incidence after adjusting for potential confounders. Sub-group analyses were done according to vaccination status (unvaccinated, total number of vaccinations: 1, 2–3, ≥4) and evaluated the dose-dependent effects on lung cancer events. Among 22,252 eligible DM patients, 7860 (35.32%) received an influenza vaccination and 67.68% (14392) did not receive an influenza vaccination. Lung cancer incidence was significantly lower in the vaccinated group versus the unvaccinated group (adjusted HR 0.77; 95% CI 0.62–0.95, p < 0.05). Significant protective effects were observed among male sex (adjusted HR 0.72; 95% CI 0.55–0.94, p < 0.05) and 55–64 year (adjusted HR 0.61; 95% CI 0.40–0.94, p < 0.05) and ≥75 year (adjusted HR 0.63; 95% CI 0.42–0.92, p < 0.05) age groups, respectively. A dose-dependent protective effect was noted with a significant protective effect in those that received ≥4 vaccinations (adjusted HR 0.42; 95% CI 0.29–0.61, p < 0.001). In sub-group analysis, elder patients with ≥65 years of age were significantly protected from ≥4 vaccinations (adjusted HR 0.37; 95% CI 0.23–0.62, p < 0.001 in 65–74 years and adjusted HR 0.31; 95% CI 0.15–0.66, p = 0.002 in ≥75 years group, respectively). Male sex with ≥4 vaccinations had a significantly lower risk of lung cancer (adjusted HR 0.35; 95% CI 0.21–0.57, p < 0.001). Patients with comorbid conditions that received ≥4 vaccinations were also protected, and was especially significant among those with CCI ≥ 3 (adjusted HR 0.38; 95% CI 0.18–0.80, p = 0.009) as compared to 1 and 2–3 vaccination groups, including those with hypertension (adjusted HR 0.35; 95% CI 0.22–0.57, p < 0.001). This population-based cohort study demonstrated that annual influenza vaccination significantly reduced the lung cancer risk in DM patients and specifically demonstrates that a higher number of vaccinations is related with a more protective effect. Whether this is due to vaccine booster effects on anti-tumor immune regulation among DM patients still needs to be explored.
Collapse
|
33
|
A Novel Approach to Calculate the Spatial-Temporal Correlation for Traffic Flow Based on the Structure of Urban Road Networks and Traffic Dynamic Theory. SENSORS 2021; 21:s21144725. [PMID: 34300464 PMCID: PMC8309648 DOI: 10.3390/s21144725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Revised: 06/30/2021] [Accepted: 07/06/2021] [Indexed: 11/16/2022]
Abstract
Determining the spatial-temporal correlation (STC) between roads can help clarify the operation characteristics of road traffic. Moreover, this correlation affects the utilization quality of traffic data in related research fields. Therefore, it is of significance to provide more reasonable correlation information for other research, such as in traffic speed prediction. Most of the traditional correlation calculation methods for traffic are based on only statistical theory. These methods are simple, but their ability to explain the actual phenomenon is limited due to the lack of consideration of the actual traffic operation characteristics. Therefore, to provide more reasonable correlation information between roads, this paper analysed the influence mechanism of urban traffic based on the traffic dynamic model, and two parameters, traffic complete influence time and traffic correlation strength, were proposed to bring physical meaning to the calculation of STC. Then, an improved calculation model of the STC between different roads considering the adjacency between roads was proposed in this paper. Finally, this paper verified this method against two common traditional methods through different experiments. The verification results show that the calculation method proposed in this paper has better interpretability for the STC between different roads and can better reveal the internal traffic operation characteristics of the road network.
Collapse
|
34
|
Dynamic Programming for Resource Allocation in Multi-Allelic Trait Introgression. FRONTIERS IN PLANT SCIENCE 2021; 12:544854. [PMID: 34220873 PMCID: PMC8253225 DOI: 10.3389/fpls.2021.544854] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/23/2020] [Accepted: 05/21/2021] [Indexed: 06/13/2023]
Abstract
Trait introgression is a complex process that plant breeders use to introduce desirable alleles from one variety or species to another. Two of the major types of decisions that must be made during this sophisticated and uncertain workflow are: parental selection and resource allocation. We formulated the trait introgression problem as an engineering process and proposed a Markov Decision Processes (MDP) model to optimize the resource allocation procedure. The efficiency of the MDP model was compared with static resource allocation strategies and their trade-offs among budget, deadline, and probability of success are demonstrated. Simulation results suggest that dynamic resource allocation strategies from the MDP model significantly improve the efficiency of the trait introgression by allocating the right amount of resources according to the genetic outcome of previous generations.
Collapse
|
35
|
Abstract
Alignments of discrete objects can be constructed in a very general setting as super-objects from which the constituent objects are recovered by means of projections. Here, we focus on contact maps, i.e. undirected graphs with an ordered set of vertices. These serve as natural discretizations of RNA and protein structures. In the general case, the alignment problem for vertex-ordered graphs is NP-complete. In the special case of RNA secondary structures, i.e. crossing-free matchings, however, the alignments have a recursive structure. The alignment problem then can be solved by a variant of the Sankoff algorithm in polynomial time. Moreover, the tree or forest alignments of RNA secondary structure can be understood as the alignments of ordered edge sets.
Collapse
|
36
|
Levenshtein Distance, Sequence Comparison and Biological Database Search. IEEE TRANSACTIONS ON INFORMATION THEORY 2021; 67:3287-3294. [PMID: 34257466 PMCID: PMC8274556 DOI: 10.1109/tit.2020.2996543] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Levenshtein edit distance has played a central role-both past and present-in sequence alignment in particular and biological database similarity search in general. We start our review with a history of dynamic programming algorithms for computing Levenshtein distance and sequence alignments. Following, we describe how those algorithms led to heuristics employed in the most widely used software in bioinformatics, BLAST, a program to search DNA and protein databases for evolutionarily relevant similarities. More recently, the advent of modern genomic sequencing and the volume of data it generates has resulted in a return to the problem of local alignment. We conclude with how the mathematical formulation of Levenshtein distance as a metric made possible additional optimizations to similarity search in biological contexts. These modern optimizations are built around the low metric entropy and fractional dimensionality of biological databases, enabling orders of magnitude acceleration of biological similarity search.
Collapse
|
37
|
Detection of Highly Divergent Tandem Repeats in the Rice Genome. Genes (Basel) 2021; 12:genes12040473. [PMID: 33806152 PMCID: PMC8064497 DOI: 10.3390/genes12040473] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Revised: 03/11/2021] [Accepted: 03/23/2021] [Indexed: 11/25/2022] Open
Abstract
Currently, there is a lack of bioinformatics approaches to identify highly divergent tandem repeats (TRs) in eukaryotic genomes. Here, we developed a new mathematical method to search for TRs, which uses a novel algorithm for constructing multiple alignments based on the generation of random position weight matrices (RPWMs), and applied it to detect TRs of 2 to 50 nucleotides long in the rice genome. The RPWM method could find highly divergent TRs in the presence of insertions or deletions. Comparison of the RPWM algorithm with the other methods of TR identification showed that RPWM could detect TRs in which the average number of base substitutions per nucleotide (x) was between 1.5 and 3.2, whereas T-REKS and TRF methods could not detect divergent TRs with x > 1.5. Applied to the search of TRs in the rice genome, the RPWM method revealed that TRs occupied 5% of the genome and that most of them were 2 and 3 bases long. Using RPWM, we also revealed the correlation of TRs with dispersed repeats and transposons, suggesting that some transposons originated from TRs. Thus, the novel RPWM algorithm is an effective tool to search for highly divergent TRs in the genomes.
Collapse
|
38
|
Use of Mathematical Methods for the Biosafety Assessment of Agricultural Crops. APPL BIOCHEM MICRO+ 2021; 57:271-279. [PMID: 33727728 PMCID: PMC7952145 DOI: 10.1134/s000368382102006x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Revised: 10/30/2020] [Accepted: 11/02/2020] [Indexed: 12/01/2022]
Abstract
In Russia and around the world, there are important questions regarding the potential threats to national and biological safety created by genetic technologies and the need to improve or introduce new, justified, and adequate measures for their control, regulation, and prevention. The article shows that a significant volume of the global market is occupied by five major transgenic crops, and producers are ready to switch to crops with an edited genome that has been approved in the United States, Argentina, and other countries. We propose a qualitatively new approach to the risk assessment of edited plants, "Safe Design," and we have also developed an extremely important, fundamentally new approach to the development of methods that combine next-generation sequencing (NGS) and Bioinformatics for the assessment of the crop import biosafety. The proposed mathematical approach provides a detailed analysis of the possible insertions of DNA fragments into the genome of edited crops and a clarification of their biological significance. The developed method can be used in the rapid screening of plants for the presence of potentially dangerous genes, viral sequences, and nonspecific promoter sequences.
Collapse
|
39
|
Multiple Alignment of Promoter Sequences from the Arabidopsis thaliana L. Genome. Genes (Basel) 2021; 12:135. [PMID: 33494278 PMCID: PMC7909805 DOI: 10.3390/genes12020135] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Revised: 01/15/2021] [Accepted: 01/18/2021] [Indexed: 11/16/2022] Open
Abstract
In this study, we developed a new mathematical method for performing multiple alignment of highly divergent sequences (MAHDS), i.e., sequences that have on average more than 2.5 substitutions per position (x). We generated sets of artificial DNA sequences with x ranging from 0 to 4.4 and applied MAHDS as well as currently used multiple sequence alignment algorithms, including ClustalW, MAFFT, T-Coffee, Kalign, and Muscle to these sets. The results indicated that most of the existing methods could produce statistically significant alignments only for the sets with x < 2.5, whereas MAHDS could operate on sequences with x = 4.4. We also used MAHDS to analyze a set of promoter sequences from the Arabidopsis thaliana genome and discovered many conserved regions upstream of the transcription initiation site (from -499 to +1 bp); a part of the downstream region (from +1 to +70 bp) also significantly contributed to the obtained alignments. The possibilities of applying the newly developed method for the identification of promoter sequences in any genome are discussed. A server for multiple alignment of nucleotide sequences has been created.
Collapse
|
40
|
Demographic and Socioeconomic Determinants of Body Mass Index in People of Working Age. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:ijerph17218168. [PMID: 33167352 PMCID: PMC7663841 DOI: 10.3390/ijerph17218168] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 10/27/2020] [Accepted: 11/03/2020] [Indexed: 12/13/2022]
Abstract
Obesity is currently the most common metabolic disease, causing numerous health problems and, if untreated, leading to premature mortality. Obesity is a significant issue among people of working age since their ability to work depends directly on their health condition and psychomotor fitness. Demographic and socioeconomic factors have a significant impact on the body weight of people of working age. The aim of this study is to identify relationships between the body mass index and selected demographic and socioeconomic variables in working-age residents of the city of Wrocław, Poland. The study involved 4315 respondents (2206 women and 2109 men) aged 18–64 years from Wrocław. The sample selection was random and purposive, using multilevel stratification. The applied research tool was the authors’ own cross-sectional diagnostic questionnaire of socioeconomic status. Based on the collected data, the respondents’ body weight was categorized according to WHO criteria. The majority of respondents (60%) had a normal body weight, while 40% were categorized as overweight or obese. The difference was statistically significant (p < 0.001). Sex, age, occupational status, marital status, number of people in the household, having a steady source of income, disposable (net) income, and savings were significantly correlated (p < 0.001) with respondents’ body mass index. Public health programs aimed at promoting healthy lifestyle behaviors should be addressed primarily to groups at the highest risk of overweight and obesity.
Collapse
|
41
|
Optimization of Selective Assembly for Shafts and Holes Based on Relative Entropy and Dynamic Programming. ENTROPY (BASEL, SWITZERLAND) 2020; 22:e22111211. [PMID: 33286979 PMCID: PMC7712107 DOI: 10.3390/e22111211] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Revised: 10/18/2020] [Accepted: 10/24/2020] [Indexed: 06/12/2023]
Abstract
Selective assembly is the method of obtaining high precision assemblies from relatively low precision components. For precision instruments, the geometric error on mating surface is an important factor affecting assembly accuracy. Different from the traditional selective assembly method, this paper proposes an optimization method of selective assembly for shafts and holes based on relative entropy and dynamic programming. In this method, relative entropy is applied to evaluate the clearance uniformity between shafts and holes, and dynamic programming is used to optimize selective assembly of batches of shafts and holes. In this paper, the case studied has 8 shafts and 20 holes, which need to be assembled into 8 products. The results show that optimal combinations are selected, which provide new insights into selective assembly optimization and lay the foundation for selective assembly of multi-batch precision parts.
Collapse
|
42
|
A Novel Attribute-Based Symmetric Multiple Instance Learning for Histopathological Image Analysis. IEEE TRANSACTIONS ON MEDICAL IMAGING 2020; 39:3125-3136. [PMID: 32305904 PMCID: PMC7561004 DOI: 10.1109/tmi.2020.2987796] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Histopathological image analysis is a challenging task due to a diverse histology feature set as well as due to the presence of large non-informative regions in whole slide images. In this paper, we propose a multiple-instance learning (MIL) method for image-level classification as well as for annotating relevant regions in the image. In MIL, a common assumption is that negative bags contain only negative instances while positive bags contain one or more positive instances. This asymmetric assumption may be inappropriate for some application scenarios where negative bags also contain representative negative instances. We introduce a novel symmetric MIL framework associating each instance in a bag with an attribute which can be either negative, positive, or irrelevant. We extend the notion of relevance by introducing control over the number of relevant instances. We develop a probabilistic graphical model that incorporates the aforementioned paradigm and a corresponding computationally efficient inference for learning the model parameters and obtaining an instance level attribute-learning classifier. The effectiveness of the proposed method is evaluated on available histopathology datasets with promising results.
Collapse
|
43
|
Dynamic High-Sensitivity Quantitation of Procollagen-I by Endogenous CRISPR-Cas9 NanoLuciferase Tagging. Cells 2020; 9:cells9092070. [PMID: 32927811 PMCID: PMC7564849 DOI: 10.3390/cells9092070] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2020] [Revised: 09/04/2020] [Accepted: 09/07/2020] [Indexed: 02/07/2023] Open
Abstract
The ability to quantitate a protein of interest temporally and spatially at subcellular resolution in living cells would generate new opportunities for research and drug discovery, but remains a major technical challenge. Here, we describe dynamic, high-sensitivity protein quantitation technique using NanoLuciferase (NLuc) tagging, which is effective across microscopy and multiwell platforms. Using collagen as a test protein, the CRISPR-Cas9-mediated introduction of nluc (encoding NLuc) into the Col1a2 locus enabled the simplification and miniaturisation of procollagen-I (PC-I) quantitation. Collagen was chosen because of the clinical interest in its dysregulation in cardiovascular and musculoskeletal disorders, and in fibrosis, which is a confounding factor in 45% of deaths, including those brought about by cancer. Collagen is also the cargo protein of choice for studying protein secretion because of its unusual shape and size. However, the use of overexpression promoters (which drowns out endogenous regulatory mechanisms) is often needed to achieve good signal/noise ratios in fluorescence microscopy of tagged collagen. We show that endogenous knock-in of NLuc, combined with its high brightness, negates the need to use exogenous promoters, preserves the circadian regulation of collagen synthesis and the responsiveness to TGF-β, and enables time-lapse microscopy of intracellular transport compartments containing procollagen cargo. In conclusion, we demonstrate the utility of CRISPR-Cas9-mediated endogenous NLuc tagging to robustly quantitate extracellular, intracellular, and subcellular protein levels and localisation.
Collapse
|
44
|
A Dynamic Programming Setting for Functionally Graded Thick-Walled Cylinders. MATERIALS 2020; 13:ma13183988. [PMID: 32916876 PMCID: PMC7560176 DOI: 10.3390/ma13183988] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/16/2020] [Revised: 09/03/2020] [Accepted: 09/07/2020] [Indexed: 11/16/2022]
Abstract
Material property variation in non-homogeneous internally pressurized thick-walled cylinders is investigated within the context of dynamic programming theory. The material is assumed to be linear, elastic, isotropic, and functionally graded in the radial direction. Based on the plane stress hypothesis, a state space formulation is given and the optimal control problem is stated and solved by means of Pontryagin’s Principle for different objective functionals. Optimal Young’s modulus distribution is found to be piecewise linear along the radial domain. A brief digression on the possible existence of switching points is addressed. Finally, a numerical example is performed within a special class of derived optimal solutions, showing promising results in terms of equivalent stress reduction with respect to the most used variations in literature.
Collapse
|
45
|
An Efficient Algorithm to Count Tree-Like Graphs with a Given Number of Vertices and Self-Loops. ENTROPY (BASEL, SWITZERLAND) 2020; 22:E923. [PMID: 33286692 PMCID: PMC7597174 DOI: 10.3390/e22090923] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 08/14/2020] [Accepted: 08/18/2020] [Indexed: 12/31/2022]
Abstract
Graph enumeration with given constraints is an interesting problem considered to be one of the fundamental problems in graph theory, with many applications in natural sciences and engineering such as bio-informatics and computational chemistry. For any two integers n≥1 and Δ≥0, we propose a method to count all non-isomorphic trees with n vertices, Δ self-loops, and no multi-edges based on dynamic programming. To achieve this goal, we count the number of non-isomorphic rooted trees with n vertices, Δ self-loops and no multi-edges, in O(n2(n+Δ(n+Δ·min{n,Δ}))) time and O(n2(Δ2+1)) space, since every tree can be uniquely viewed as a rooted tree by either regarding its unicentroid as the root, or in the case of bicentroid, by introducing a virtual vertex on the bicentroid and assuming the virtual vertex to be the root. By this result, we get a lower bound and an upper bound on the number of tree-like polymer topologies of chemical compounds with any "cycle rank".
Collapse
|
46
|
How 25(OH)D Levels during Pregnancy Affect Prevalence of Autism in Children: Systematic Review. Nutrients 2020; 12:nu12082311. [PMID: 32752078 PMCID: PMC7468823 DOI: 10.3390/nu12082311] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Revised: 07/25/2020] [Accepted: 07/27/2020] [Indexed: 12/27/2022] Open
Abstract
Autism spectrum disorder (ASD) is a group of dysfunctions in social interaction, communication, and behaviors. The etiology of ASD is not yet fully understood; however, it consists of the interaction between genetics and the environment. An increasing amount of evidence points to the possibility that gestational and early-childhood vitamin D deficiency may be involved in the etiology of some cases of ASD. Herein, we systematically review the literature for studies on vitamin D status during pregnancy and ASD outcomes. Forty-three studies in the PubMed and 124 studies in EMBASE databases were initially found. After screening, 26 were identified as candidate studies for inclusion. Finally, 14 articles met the inclusion criteria, which originated from nine countries. The studies included 10 original research studies and four review studies conducted between 2012 and 2020. The strength of evidence that vitamin D levels during pregnancy increase the risk of developing autism is very low. This is because the evidence relies exclusively on observational studies that did not equally consider all important confounders and that assessed the indirect relationship between vitamin D as a surrogate for sunlight exposure and autism risk. The findings of this systematic review are consistent with the hypothesis that low vitamin D levels might contribute to the development of autism. However, we must also recognize the possible confusion bias and therefore experimental studies with very large sample sizes, given incidence of autism, that allow us to detect blood levels in pregnant women would be helpful to clarify this point.
Collapse
|
47
|
Tailored optimal posttreatment surveillance for cancer recurrence. Biometrics 2020; 77:942-955. [PMID: 32712953 DOI: 10.1111/biom.13341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Revised: 06/19/2020] [Accepted: 07/13/2020] [Indexed: 11/27/2022]
Abstract
A substantial rise in the number of cancer survivors has led to urgent management questions regarding effective posttreatment imaging-based surveillance strategies for cancer recurrence. Current surveillance guidelines provided by a number of professional societies all warn against overly aggressive surveillance, especially for low-risk patients, but all fail to provide more specific directions to accommodate underlying heterogeneity of cancer recurrence. Therefore it is imperative to develop data-driven strategies that can tailor the surveillance schedules to recurrence risk in this era of stricter insurance regulations, provider shortages, and rising costs of health care. Due to a lack of statistical methods for optimizing surveillance scheduling in presence of competing risks, we propose a general approach that uses an intuitive loss function for optimization of early detection of recurrence before death. The proposed strategies can tailor to patient risks of recurrence, in terms of both intensity and amount of surveillance. Using general three-state Markov models, our method is flexible and includes earlier works as special cases. We illustrate our method in both simulation studies and an application to breast cancer surveillance.
Collapse
|
48
|
Fully Automated Segmentation of Bladder Sac and Measurement of Detrusor Wall Thickness from Transabdominal Ultrasound Images. SENSORS 2020; 20:s20154175. [PMID: 32727146 PMCID: PMC7436043 DOI: 10.3390/s20154175] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/31/2020] [Revised: 07/18/2020] [Accepted: 07/23/2020] [Indexed: 11/24/2022]
Abstract
Ultrasound measurements of detrusor muscle thickness have been proposed as a diagnostic biomarker in patients with bladder overactivity and voiding dysfunction. In this study, we present an approach based on deep learning (DL) and dynamic programming (DP) to segment the bladder sac and measure the detrusor muscle thickness from transabdominal 2D B-mode ultrasound images. To assess the performance of our method, we compared the results of automated methods to the manually obtained reference bladder segmentations and wall thickness measurements of 80 images obtained from 11 volunteers. It takes less than a second to segment the bladder from a 2D B-mode image for the DL method. The average Dice index for the bladder segmentation is 0.93 ± 0.04 mm, and the average root-mean-square-error and standard deviation for wall thickness measurement are 0.7 ± 0.2 mm, which is comparable to the manual ground truth. The proposed fully automated and fast method could be a useful tool for segmentation and wall thickness measurement of the bladder from transabdominal B-mode images. The computation speed and accuracy of the proposed method will enable adaptive adjustment of the ultrasound focus point, and continuous assessment of the bladder wall during the filling and voiding process of the bladder.
Collapse
|
49
|
A novel pattern matching algorithm for genomic patterns related to protein motifs. J Bioinform Comput Biol 2020; 18:2050011. [PMID: 32336249 DOI: 10.1142/s0219720020500110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Background: Patterns on proteins and genomic sequences are vastly analyzed, extracted and collected in databases. Although protein patterns originate from genomic coding regions, very few works have directly or indirectly dealt with coding region patterns induced from protein patterns. Results: In this paper, we have defined a new genomic pattern structure suitable for representing induced patterns from proteins. The provided pattern structure, which is called "Consecutive Positions Scoring Matrix (CPSSM)", is a replacement for protein patterns and profiles in the genomic context. CPSSMs can be identified, discovered, and searched in genomes. Then, we have presented a novel pattern matching algorithm between the defined genomic pattern and genomic sequences based on dynamic programming. In addition, we have modified the provided algorithm to support intronic gaps and huge sequences. We have implemented and tested the provided algorithm on real data. The results on Saccharomyces cerevisiae's genome show 132% more true positives and no false negatives and the results on human genome show no false negatives and 10 times as many true positives as those in previous works. Conclusion: CPSSM and provided methods could be used for open reading frame detection and gene finding. The application is available with source codes to run and download at http://app.foroughmand.ir/cpssm/.
Collapse
|
50
|
A New Paradigm for Identifying Reconciliation-Scenario Altering Mutations Conferring Environmental Adaptation. J Comput Biol 2020; 27:1561-1580. [PMID: 32250165 DOI: 10.1089/cmb.2019.0472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
An important goal in microbial computational genomics is to identify crucial events in the evolution of a gene that severely alter the duplication, loss, and mobilization patterns of the gene within the genomes in which it disseminates. In this article, we formalize this microbiological goal as a new pattern-matching problem in the domain of gene tree and species tree reconciliation, denoted "Reconciliation-Scenario Altering Mutation (RSAM) Discovery." We propose an [Formula: see text] time algorithm to solve this new problem, where m and n are the number of vertices of the input gene tree and species tree, respectively, and k is a user-specified parameter that bounds from above the number of optimal solutions of interest. The algorithm first constructs a hypergraph representing the k highest scoring reconciliation scenarios between the given gene tree and species tree, and then interrogates this hypergraph for subtrees matching a prespecified RSAM pattern. Our algorithm is optimal in the sense that the number of hypernodes in the hypergraph can be lower bounded by [Formula: see text]. We implement the new algorithm as a tool, called RSAM-finder, and demonstrate its application to the identification of RSAMs in toxins and drug resistance elements across a data set spanning hundreds of species.
Collapse
|