1
|
Dias FHC, Cáceres M, Williams L, Mumey B, Tomescu AI. A safety framework for flow decomposition problems via integer linear programming. Bioinformatics 2023; 39:btad640. [PMID: 37862229 PMCID: PMC10628435 DOI: 10.1093/bioinformatics/btad640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 09/05/2023] [Accepted: 10/19/2023] [Indexed: 10/22/2023] Open
Abstract
MOTIVATION Many important problems in Bioinformatics (e.g. assembly or multiassembly) admit multiple solutions, while the final objective is to report only one. A common approach to deal with this uncertainty is finding "safe" partial solutions (e.g. contigs) which are common to all solutions. Previous research on safety has focused on polynomially time solvable problems, whereas many successful and natural models are NP-hard to solve, leaving a lack of "safety tools" for such problems. We propose the first method for computing all safe solutions for an NP-hard problem, "minimum flow decomposition" (MFD). We obtain our results by developing a "safety test" for paths based on a general integer linear programming (ILP) formulation. Moreover, we provide implementations with practical optimizations aimed to reduce the total ILP time, the most efficient of these being based on a recursive group-testing procedure. RESULTS Experimental results on transcriptome datasets show that all safe paths for MFDs correctly recover up to 90% of the full RNA transcripts, which is at least 25% more than previously known safe paths. Moreover, despite the NP-hardness of the problem, we can report all safe paths for 99.8% of the over 27 000 non-trivial graphs of this dataset in only 1.5 h. Our results suggest that, on perfect data, there is less ambiguity than thought in the notoriously hard RNA assembly problem. AVAILABILITY AND IMPLEMENTATION https://github.com/algbio/mfd-safety.
Collapse
Affiliation(s)
- Fernando H C Dias
- Department of Computer Science, University of Helsinki, Helsinki 00560, Finland
| | - Manuel Cáceres
- Department of Computer Science, University of Helsinki, Helsinki 00560, Finland
| | - Lucia Williams
- School of Computing, Montana State University, Bozeman, MT 59717, United States
| | - Brendan Mumey
- School of Computing, Montana State University, Bozeman, MT 59717, United States
| | - Alexandru I Tomescu
- Department of Computer Science, University of Helsinki, Helsinki 00560, Finland
| |
Collapse
|
2
|
Williams L, Tomescu AI, Mumey B. Flow Decomposition With Subpath Constraints. IEEE/ACM Trans Comput Biol Bioinform 2023; 20:360-370. [PMID: 35104222 DOI: 10.1109/tcbb.2022.3147697] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Flow network decomposition is a natural model for problems where we are given a flow network arising from superimposing a set of weighted paths and would like to recover the underlying data, i.e., decompose the flow into the original paths and their weights. Thus, variations on flow decomposition are often used as subroutines in multiassembly problems such as RNA transcript assembly. In practice, we frequently have access to information beyond flow values in the form of subpaths, and many tools incorporate these heuristically. But despite acknowledging their utility in practice, previous work has not formally addressed the effect of subpath constraints on the accuracy of flow network decomposition approaches. We formalize the flow decomposition with subpath constraints problem, give the first algorithms for it, and study its usefulness for recovering ground truth decompositions. For finding a minimum decomposition, we propose both a heuristic and an FPT algorithm. Experiments on RNA transcript datasets show that for instances with larger solution path sets, the addition of subpath constraints finds 13% more ground truth solutions when minimal decompositions are found exactly, and 30% more ground truth solutions when minimal decompositions are found heuristically.
Collapse
|
3
|
Caceres M, Mumey B, Husic E, Rizzi R, Cairo M, Sahlin K, Tomescu AI. Safety in Multi-Assembly via Paths Appearing in All Path Covers of a DAG. IEEE/ACM Trans Comput Biol Bioinform 2022; 19:3673-3684. [PMID: 34847041 DOI: 10.1109/tcbb.2021.3131203] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
A multi-assembly problem asks to reconstruct multiple genomic sequences from mixed reads sequenced from all of them. Standard formulations of such problems model a solution as a path cover in a directed acyclic graph, namely a set of paths that together cover all vertices of the graph. Since multi-assembly problems admit multiple solutions in practice, we consider an approach commonly used in standard genome assembly: output only partial solutions (contigs, or safe paths), that appear in all path cover solutions. We study constrained path covers, a restriction on the path cover solution that incorporate practical constraints arising in multi-assembly problems. We give efficient algorithms finding all maximal safe paths for constrained path covers. We compute the safe paths of splicing graphs constructed from transcript annotations of different species. Our algorithms run in less than 15 seconds per species and report RNA contigs that are over 99% precise and are up to 8 times longer than unitigs. Moreover, RNA contigs cover over 70% of the transcripts and their coding sequences in most cases. With their increased length to unitigs, high precision, and fast construction time, maximal safe paths can provide a better base set of sequences for transcript assembly programs.
Collapse
|
4
|
Abstract
Minimum flow decomposition (MFD) is an NP-hard problem asking to decompose a network flow into a minimum set of paths (together with associated weights). Variants of it are powerful models in multiassembly problems in Bioinformatics, such as RNA assembly. Owing to its hardness, practical multiassembly tools either use heuristics or solve simpler, polynomial time-solvable versions of the problem, which may yield solutions that are not minimal or do not perfectly decompose the flow. Here, we provide the first fast and exact solver for MFD on acyclic flow networks, based on Integer Linear Programming (ILP). Key to our approach is an encoding of all the exponentially many solution paths using only a quadratic number of variables. We also extend our ILP formulation to many practical variants, such as incorporating longer or paired-end reads, or minimizing flow errors. On both simulated and real-flow splicing graphs, our approach solves any instance in <13 seconds. We hope that our formulations can lie at the core of future practical RNA assembly tools. Our implementations are freely available on Github.
Collapse
Affiliation(s)
- Fernando H.C. Dias
- Department of Computer Science, University of Helsinki, Helsinki, Finland
| | - Lucia Williams
- School of Computing, Montana State University, Bozeman, Montana, USA
| | - Brendan Mumey
- School of Computing, Montana State University, Bozeman, Montana, USA
| | | |
Collapse
|
5
|
Li H, Hu X, Lovell JT, Grabowski PP, Mamidi S, Chen C, Amirebrahimi M, Kahanda I, Mumey B, Barry K, Kudrna D, Schmutz J, Lachowiec J, Lu C. Genetic dissection of natural variation in oilseed traits of camelina by whole-genome resequencing and QTL mapping. Plant Genome 2021; 14:e20110. [PMID: 34106529 DOI: 10.1002/tpg2.20110] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Accepted: 04/29/2021] [Indexed: 06/12/2023]
Abstract
Camelina [Camelina sativa (L.) Crantz] is an oilseed crop in the Brassicaceae family that is currently being developed as a source of bioenergy and healthy fatty acids. To facilitate modern breeding efforts through marker-assisted selection and biotechnology, we evaluated genetic variation among a worldwide collection of 222 camelina accessions. We performed whole-genome resequencing to obtain single nucleotide polymorphism (SNP) markers and to analyze genomic diversity. We also conducted phenotypic field evaluations in two consecutive seasons for variations in key agronomic traits related to oilseed production such as seed size, oil content (OC), fatty acid composition, and flowering time. We determined the population structure of the camelina accessions using 161,301 SNPs. Further, we identified quantitative trait loci (QTL) and candidate genes controlling the above field-evaluated traits by genome-wide association studies (GWAS) complemented with linkage mapping using a recombinant inbred line (RIL) population. Characterization of the natural variation at the genome and phenotypic levels provides valuable resources to camelina genetic studies and crop improvement. The QTL and candidate genes should assist in breeding of advanced camelina varieties that can be integrated into the cropping systems for the production of high yield of oils of desired fatty acid composition.
Collapse
Affiliation(s)
- Huang Li
- Department of Plant Sciences and Plant Pathology, Montana State University, Bozeman, MT, 59717, USA
| | - Xiao Hu
- School of Computing, Montana State University, Bozeman, MT, 59717, USA
| | - John T Lovell
- Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, AL, 38508, USA
| | - Paul P Grabowski
- Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, AL, 38508, USA
| | - Sujan Mamidi
- Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, AL, 38508, USA
| | - Cindy Chen
- United States Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Mojgan Amirebrahimi
- United States Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Indika Kahanda
- School of Computing, Montana State University, Bozeman, MT, 59717, USA
| | - Brendan Mumey
- School of Computing, Montana State University, Bozeman, MT, 59717, USA
| | - Kerrie Barry
- United States Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - David Kudrna
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Jeremy Schmutz
- Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, AL, 38508, USA
- United States Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Jennifer Lachowiec
- Department of Plant Sciences and Plant Pathology, Montana State University, Bozeman, MT, 59717, USA
| | - Chaofu Lu
- Department of Plant Sciences and Plant Pathology, Montana State University, Bozeman, MT, 59717, USA
| |
Collapse
|
6
|
Abstract
Recent work provides the first method to measure the relative fitness of genomic variants within a population that scales to large numbers of genomes. A key component of the computation involves finding maximal perfect haplotype blocks from a set of genomic samples for which SNPs (single-nucleotide polymorphisms) have been called. Often, owing to low read coverage and imperfect assemblies, some of the SNP calls can be missing from some of the samples. In this work, we consider the problem of finding maximal perfect haplotype blocks where some missing values may be present. Missing values are treated as wildcards, and the definition of maximal perfect haplotype blocks is extended in a natural way. We provide an output-linear time algorithm to identify all such blocks and demonstrate the algorithm on a large population SNP dataset. Our software is publicly available. Defined haplotype blocks to study SNP population data with missing values Developed a fast software tool to find these blocks Tested on a human chromosome 22 dataset of 5,008 samples and over one million SNPs
Collapse
Affiliation(s)
- Lucia Williams
- Gianforte School of Computing, Montana State University, Bozeman, MT 59717, USA.
| | - Brendan Mumey
- Gianforte School of Computing, Montana State University, Bozeman, MT 59717, USA
| |
Collapse
|
7
|
Williams L, Mumey B. Extending Maximal Perfect Haplotype Blocks to the Realm of Pangenomics. Algorithms for Computational Biology 2020. [PMCID: PMC7197059 DOI: 10.1007/978-3-030-42266-0_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
Abstract
Recent work provides the first method to measure the relative fitness of genomic variants within a population that scales to large numbers of genomes. A key component of the computation involves finding conserved haplotype blocks, which can be done in linear time. Here, we extend the notion of conserved haplotype blocks to pangenomes, which can store more complex variation than a single reference genome. We define a maximal perfect pangenome haplotype block and give a linear-time, suffix tree based approach to find all such blocks from a set of pangenome haplotypes. We demonstrate the method by applying it to a pangenome built from yeast strains.
Collapse
|
8
|
Cleary A, Ramaraj T, Kahanda I, Mudge J, Mumey B. Exploring Frequented Regions in Pan-Genomic Graphs. IEEE/ACM Trans Comput Biol Bioinform 2019; 16:1424-1435. [PMID: 30106690 DOI: 10.1109/tcbb.2018.2864564] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
We consider the problem of identifying regions within a pan-genome De Bruijn graph that are traversed by many sequence paths. We define such regions and the subpaths that traverse them as frequented regions (FRs). In this work, we formalize the FR problem and describe an efficient algorithm for finding FRs. Subsequently, we propose some applications of FRs based on machine-learning and pan-genome graph simplification. We demonstrate the effectiveness of these applications using data sets for the organisms Staphylococcus aureus (bacterium) and Saccharomyces cerevisiae (yeast). We corroborate the biological relevance of FRs such as identifying introgressions in yeast that aid in alcohol tolerance, and show that FRs are useful for classification of yeast strains by industrial use and visualizing pan-genomic space.
Collapse
|
9
|
Salinas D, Mumey B, June RK. Physiological dynamic compression regulates central energy metabolism in primary human chondrocytes. Biomech Model Mechanobiol 2019; 18:69-77. [PMID: 30097814 PMCID: PMC9851408 DOI: 10.1007/s10237-018-1068-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2017] [Accepted: 08/01/2018] [Indexed: 01/21/2023]
Abstract
Chondrocytes use the pathways of central metabolism to synthesize molecular building blocks and energy for cartilage homeostasis. An interesting feature of the in vivo chondrocyte environment is the cyclical loading generated in various activities (e.g., walking). However, it is unknown whether central metabolism is altered by mechanical loading. We hypothesized that physiological dynamic compression alters central metabolism in chondrocytes to promote production of amino acid precursors for matrix synthesis. We measured the expression of central metabolites (e.g., glucose, its derivatives, and relevant co-factors) for primary human osteoarthritic chondrocytes in response to 0-30 minutes of compression. To analyze the data, we used principal components analysis and ANOVA-simultaneous components analysis, as well as metabolic flux analysis. Compression-induced metabolic responses consistent with our hypothesis. Additionally, these data show that chondrocyte samples from different patient donors exhibit different sensitivity to compression. Most importantly, we find that grade IV osteoarthritic chondrocytes are capable of synthesizing non-essential amino acids and precursors in response to mechanical loading. These results suggest that further advances in metabolic engineering of chondrocyte mechanotransduction may yield novel translational strategies for cartilage repair.
Collapse
Affiliation(s)
- Daniel Salinas
- Department of Computer Science, Montana State University, PO Box 173800, Bozeman, MT 59717-3800
| | - Brendan Mumey
- Department of Computer Science, Montana State University, PO Box 173800, Bozeman, MT 59717-3800
| | - Ronald K. June
- Department of Mechanical and Industrial Engineering, Montana State University, PO Box 173800, Bozeman, MT 59717-3800
| |
Collapse
|
10
|
Yaw S, Howard E, Mumey B, Wittie MP. Cooperative group provisioning with latency guarantees in multi-cloud deployments. SIGCOMM Comput Commun Rev 2015. [DOI: 10.1145/2805789.2805791] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Given a set of datacenters and groups of application clients, well-connected datacenters can be rented as traffic proxies to reduce client latency. Rental costs must be minimized while meeting the application specific latency needs. Here, we formally define the Cooperative Group Provisioning problem and show it is NP-hard to approximate within a constant factor. We introduce a novel greedy approach and demonstrate its promise through extensive simulation using real cloud network topology measurements and realistic client churn. We find that multi-cloud deployments dramatically increase the likelihood of meeting group latency thresholds with minimal cost increase compared to single-cloud deployments.
Collapse
Affiliation(s)
- Sean Yaw
- Montana State University, Bozeman, MT, USA
| | | | | | | |
Collapse
|
11
|
Ramaraj T, Angel T, Dratz EA, Jesaitis AJ, Mumey B. Antigen-antibody interface properties: composition, residue interactions, and features of 53 non-redundant structures. Biochim Biophys Acta 2012; 1824:520-32. [PMID: 22246133 DOI: 10.1016/j.bbapap.2011.12.007] [Citation(s) in RCA: 115] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2011] [Revised: 12/22/2011] [Accepted: 12/23/2011] [Indexed: 11/17/2022]
Abstract
The structures of protein antigen-antibody (Ag-Ab) interfaces contain information about how Ab recognize Ag as well as how Ag are folded to present surfaces for Ag recognition. As such, the Ab surface holds information about Ag folding that resides with the Ab-Ag interface residues and how they interact. In order to gain insight into the nature of such interactions, a data set comprised of 53 non-redundant 3D structures of Ag-Ab complexes was analyzed. We assessed the physical and biochemical features of the Ag-Ab interfaces and the degree to which favored interactions exist between amino acid residues on the corresponding interface surfaces. Amino acid compositional analysis of the interfaces confirmed the dominance of TYR in the Ab paratope-containing surface (PCS), with almost two fold greater abundance than any other residue. Additionally TYR had a much higher than expected presence in the PCS compared to the surface of the whole antibody (defined as the occurrence propensity), along with aromatics PHE, TRP, and to a lesser degree HIS and ILE. In the Ag epitope-containing surface (ECS), there were slightly increased occurrence propensities of TRP and TYR relative to the whole Ag surface, implying an increased significance over the compositionally most abundant LYS>ASN>GLU>ASP>ARG. This examination encompasses a large, diverse set of unique Ag-Ab crystal structures that help explain the biological range and specificity of Ag-Ab interactions. This analysis may also provide a measure of the significance of individual amino acid residues in phage display analysis of Ag binding.
Collapse
|
12
|
Greenbaum JA, Andersen PH, Blythe M, Bui HH, Cachau RE, Crowe J, Davies M, Kolaskar AS, Lund O, Morrison S, Mumey B, Ofran Y, Pellequer JL, Pinilla C, Ponomarenko JV, Raghava GPS, van Regenmortel MHV, Roggen EL, Sette A, Schlessinger A, Sollner J, Zand M, Peters B. Towards a consensus on datasets and evaluation metrics for developing B-cell epitope prediction tools. J Mol Recognit 2007; 20:75-82. [PMID: 17205610 DOI: 10.1002/jmr.815] [Citation(s) in RCA: 163] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
A B-cell epitope is the three-dimensional structure within an antigen that can be bound to the variable region of an antibody. The prediction of B-cell epitopes is highly desirable for various immunological applications, but has presented a set of unique challenges to the bioinformatics and immunology communities. Improving the accuracy of B-cell epitope prediction methods depends on a community consensus on the data and metrics utilized to develop and evaluate such tools. A workshop, sponsored by the National Institute of Allergy and Infectious Disease (NIAID), was recently held in Washington, DC to discuss the current state of the B-cell epitope prediction field. Many of the currently available tools were surveyed and a set of recommendations was devised to facilitate improvements in the currently existing tools and to expedite future tool development. An underlying theme of the recommendations put forth by the panel is increased collaboration among research groups. By developing common datasets, standardized data formats, and the means with which to consolidate information, we hope to greatly enhance the development of B-cell epitope prediction tools.
Collapse
Affiliation(s)
- Jason A Greenbaum
- Immune Epitope Database and Analysis Resource (IEDB), La Jolla Institute for Allergy and Immunology, La Jolla, California, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Riesselman M, Miettinen HM, Gripentrog JM, Lord CI, Mumey B, Dratz EA, Stie J, Taylor RM, Jesaitis AJ. C-Terminal Tail Phosphorylation of N-Formyl Peptide Receptor: Differential Recognition of Two Neutrophil Chemoattractant Receptors by Monoclonal Antibodies NFPR1 and NFPR2. J Immunol 2007; 179:2520-31. [PMID: 17675514 DOI: 10.4049/jimmunol.179.4.2520] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
The N-formyl peptide receptor (FPR), a G protein-coupled receptor that binds proinflammatory chemoattractant peptides, serves as a model receptor for leukocyte chemotaxis. Recombinant histidine-tagged FPR (rHis-FPR) was purified in lysophosphatidyl glycerol (LPG) by Ni(2+)-NTA agarose chromatography to >95% purity with high yield. MALDI-TOF mass analysis (>36% sequence coverage) and immunoblotting confirmed the identity as FPR. The rHis-FPR served as an immunogen for the production of 2 mAbs, NFPR1 and NFPR2, that epitope map to the FPR C-terminal tail sequences, 305-GQDFRERLI-313 and 337-NSTLPSAEVE-346, respectively. Both mAbs specifically immunoblotted rHis-FPR and recombinant FPR (rFPR) expressed in Chinese hamster ovary cells. NFPR1 also recognized recombinant FPRL1, specifically expressed in mouse L fibroblasts. In human neutrophil membranes, both Abs labeled a 45-75 kDa species (peak M(r) approximately 60 kDa) localized primarily in the plasma membrane with a minor component in the lactoferrin-enriched intracellular fractions, consistent with FPR size and localization. NFPR1 also recognized a band of M(r) approximately 40 kDa localized, in equal proportions to the plasma membrane and lactoferrin-enriched fractions, consistent with FPRL1 size and localization. Only NFPR2 was capable of immunoprecipitation of rFPR in detergent extracts. The recognition of rFPR by NFPR2 is lost after exposure of cellular rFPR to f-Met-Leu-Phe (fMLF) and regained after alkaline phosphatase treatment of rFPR-bearing membranes. In neutrophils, NFPR2 immunofluorescence was lost upon fMLF stimulation. Immunoblotting approximately 60 kDa species, after phosphatase treatment of fMLF-stimulated neutrophil membranes, was also enhanced. We conclude that the region 337-346 of FPR becomes phosphorylated after fMLF activation of rFPR-expressing Chinese hamster ovary cells and neutrophils.
Collapse
MESH Headings
- Animals
- Antibodies, Monoclonal/chemistry
- Antibodies, Monoclonal/immunology
- CHO Cells
- Cell Membrane/chemistry
- Cell Membrane/genetics
- Cell Membrane/immunology
- Cell Membrane/metabolism
- Chemotaxis/drug effects
- Chemotaxis/genetics
- Chemotaxis/immunology
- Chromatography, Affinity
- Cricetinae
- Cricetulus
- Epitope Mapping
- Epitopes/chemistry
- Epitopes/genetics
- Epitopes/immunology
- Fibroblasts/immunology
- Fibroblasts/metabolism
- Gene Expression
- Humans
- Lactoferrin/chemistry
- Lactoferrin/genetics
- Lactoferrin/immunology
- Lactoferrin/metabolism
- Lysophospholipids/chemistry
- Mice
- Models, Immunological
- N-Formylmethionine Leucyl-Phenylalanine/analogs & derivatives
- N-Formylmethionine Leucyl-Phenylalanine/chemistry
- N-Formylmethionine Leucyl-Phenylalanine/immunology
- N-Formylmethionine Leucyl-Phenylalanine/metabolism
- N-Formylmethionine Leucyl-Phenylalanine/pharmacology
- Neutrophils/chemistry
- Neutrophils/immunology
- Neutrophils/metabolism
- Phosphorylation/drug effects
- Protein Processing, Post-Translational/drug effects
- Protein Processing, Post-Translational/genetics
- Protein Processing, Post-Translational/immunology
- Protein Structure, Tertiary/genetics
- Receptors, Formyl Peptide/chemistry
- Receptors, Formyl Peptide/genetics
- Receptors, Formyl Peptide/immunology
- Receptors, Formyl Peptide/isolation & purification
- Receptors, Formyl Peptide/metabolism
- Recombinant Fusion Proteins/chemistry
- Recombinant Fusion Proteins/genetics
- Recombinant Fusion Proteins/immunology
- Recombinant Fusion Proteins/metabolism
- Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization
- Spodoptera
Collapse
Affiliation(s)
- Marcia Riesselman
- Department of Microbiology, Montana State University, Bozeman, Montana 59717, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Mumey B, Ohler N, Angel T, Jesaitis A, Dratz E. Filtering Epitope Alignments to Improve Protein Surface Prediction. Frontiers of High Performance Computing and Networking – ISPA 2006 Workshops 2006. [DOI: 10.1007/11942634_67] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
15
|
Bailey BW, Mumey B, Hargrave PA, Arendt A, Ernst OP, Hofmann KP, Callis PR, Burritt JB, Jesaitis AJ, Dratz EA. Constraints on the conformation of the cytoplasmic face of dark-adapted and light-excited rhodopsin inferred from antirhodopsin antibody imprints. Protein Sci 2004; 12:2453-75. [PMID: 14573859 PMCID: PMC2366960 DOI: 10.1110/ps.03233703] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Rhodopsin is the best-understood member of the large G protein-coupled receptor (GPCR) superfamily. The G-protein amplification cascade is triggered by poorly understood light-induced conformational changes in rhodopsin that are homologous to changes caused by agonists in other GPCRs. We have applied the "antibody imprint" method to light-activated rhodopsin in native membranes by using nine monoclonal antibodies (mAbs) against aqueous faces of rhodopsin. Epitopes recognized by these mAbs were found by selection from random peptide libraries displayed on phage. A new computer algorithm, FINDMAP, was used to map the epitopes to discontinuous segments of rhodopsin that are distant in the primary sequence but are in close spatial proximity in the structure. The proximity of a segment of the N-terminal and the loop between helices VI and VIII found by FINDMAP is consistent with the X-ray structure of the dark-adapted rhodopsin. Epitopes to the cytoplasmic face segregated into two classes with different predicted spatial proximities of protein segments that correlate with different preferences of the antibodies for stabilizing the metarhodopsin I or metarhodopsin II conformations of light-excited rhodopsin. Epitopes of antibodies that stabilize metarhodopsin II indicate conformational changes from dark-adapted rhodopsin, including rearrangements of the C-terminal tail and altered exposure of the cytoplasmic end of helix VI, a portion of the C-3 loop, and helix VIII. As additional antibodies are subjected to antibody imprinting, this approach should provide increasingly detailed information on the conformation of light-excited rhodopsin and be applicable to structural studies of other challenging protein targets.
Collapse
Affiliation(s)
- Brian W Bailey
- Department of Chemistry and Biochemistry, Montana State University, Bozeman, Montana 59717-3520, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
|
17
|
Mumey B. A fast heuristic algorithm for a probe mapping problem. Proc Int Conf Intell Syst Mol Biol 1997; 5:191-7. [PMID: 9322035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
A new heuristic algorithm is presented for mapping probes to locations along the genome, given noisy pairwise distance data as input. The model considered is quite general: The input consists of a collection of probe pairs and a confidence interval for the genomic distance separating each pair. Because the distance intervals are only known with some confidence level, some may be erroneous and must be removed in order to find a consistent map. A novel randomized technique for detecting and removing bad distance intervals is described. The technique could be useful in other contexts where partially erroneous data is inconsistent with the remaining data. These algorithms were motivated by the goal of making probe maps with inter-probe distance confidence intervals estimated from fluorescence in-situ hybridization (FISH) experiments. Experimentation was done on synthetic data sets (with and without errors) and FISH data from a region of human chromosome 4. Problems with up to 100 probes could be solved in several minutes on a fast workstation. In addition to FISH mapping, we describe some other possible applications that fall within the problem model. These include: mapping a backbone structure in folded DNA, finding consensus maps between independent maps covering the same genomic region, and ordering clones in a clone library.
Collapse
Affiliation(s)
- B Mumey
- Department of Computer Science, University of Washington, Seattle 98195-2350, USA.
| |
Collapse
|