Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Ekmekci B, McAnany CE, Mura C. An Introduction to Programming for Bioscientists: A Python-Based Primer. PLoS Comput Biol 2016;12:e1004867. [PMID: 27271528 PMCID: PMC4896647 DOI: 10.1371/journal.pcbi.1004867] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open

For:	Ekmekci B, McAnany CE, Mura C. An Introduction to Programming for Bioscientists: A Python-Based Primer. PLoS Comput Biol 2016;12:e1004867. [PMID: 27271528 PMCID: PMC4896647 DOI: 10.1371/journal.pcbi.1004867] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open

Number

Cited by Other Article(s)

Ryzhkov FV, Ryzhkova YE, Elinson MN. Machine learning: Python tools for studying biomolecules and drug design. Mol Divers 2025:10.1007/s11030-025-11199-2. [PMID: 40301135 DOI: 10.1007/s11030-025-11199-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2025] [Accepted: 04/13/2025] [Indexed: 05/01/2025]

Pickard J, Sturgess VE, McDonald KO, Rossiter N, Arnold KB, Shah YM, Rajapakse I, Beard DA. A Hands-On Introduction to Data Analytics for Biomedical Research. FUNCTION 2025;6:zqaf015. [PMID: 40199731 PMCID: PMC11999024 DOI: 10.1093/function/zqaf015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2024] [Revised: 03/07/2025] [Accepted: 03/12/2025] [Indexed: 04/10/2025] Open

Forero DA, Bonilla DA, González-Giraldo Y, Patrinos GP. An overview of key online resources for human genomics: a powerful and open toolbox for in silico research. Brief Funct Genomics 2024;23:754-764. [PMID: 38993146 DOI: 10.1093/bfgp/elae029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 06/19/2024] [Accepted: 06/25/2024] [Indexed: 07/13/2024] Open

Sengupta P, Dutta S, Liew F, Samrot A, Dasgupta S, Rajput MA, Slama P, Kolesarova A, Roychoudhury S. Reproductomics: Exploring the Applications and Advancements of Computational Tools. Physiol Res 2024;73:687-702. [PMID: 39530905 PMCID: PMC11629954 DOI: 10.33549/physiolres.935389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Accepted: 06/25/2024] [Indexed: 12/13/2024] Open

Abstract

Over recent decades, advancements in omics technologies, such as proteomics, genomics, epigenomics, metabolomics, transcriptomics, and microbiomics, have significantly enhanced our understanding of the molecular mechanisms underlying various physiological and pathological processes. Nonetheless, the analysis and interpretation of vast omics data concerning reproductive diseases are complicated by the cyclic regulation of hormones and multiple other factors, which, in conjunction with a genetic makeup of an individual, lead to diverse biological responses. Reproductomics investigates the interplay between a hormonal regulation of an individual, environmental factors, genetic predisposition (DNA composition and epigenome), health effects, and resulting biological outcomes. It is a rapidly emerging field that utilizes computational tools to analyze and interpret reproductive data, with the aim of improving reproductive health outcomes. It is time to explore the applications of reproductomics in understanding the molecular mechanisms underlying infertility, identification of potential biomarkers for diagnosis and treatment, and in improving assisted reproductive technologies (ARTs). Reproductomics tools include machine learning algorithms for predicting fertility outcomes, gene editing technologies for correcting genetic abnormalities, and single cell sequencing techniques for analyzing gene expression patterns at the individual cell level. However, there are several challenges, limitations and ethical issues involved with the use of reproductomics, such as the applications of gene editing technologies and their potential impact on future generations are discussed. The review comprehensively covers the applications and advancements of reproductomics, highlighting its potential to improve reproductive health outcomes and deepen our understanding of reproductive molecular mechanisms.

Collapse

Spahiu E, Kastrati E, Amrute-Nayak M. PyChelator: a Python-based Colab and web application for metal chelator calculations. BMC Bioinformatics 2024;25:239. [PMID: 39014298 PMCID: PMC11253343 DOI: 10.1186/s12859-024-05858-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Accepted: 07/09/2024] [Indexed: 07/18/2024] Open

Abstract

BACKGROUND

Metal ions play vital roles in regulating various biological systems, making it essential to control the concentration of free metal ions in solutions during experimental procedures. Several software applications exist for estimating the concentration of free metals in the presence of chelators, with MaxChelator being the easily accessible choice in this domain. This work aimed at developing a Python version of the software with arbitrary precision calculations, extensive new features, and a user-friendly interface to calculate the free metal ions.

RESULTS

We introduce the open-source PyChelator web application and the Python-based Google Colaboratory notebook, PyChelator Colab. Key features aim to improve the user experience of metal chelator calculations including input in smaller units, selection among stability constants, input of user-defined constants, and convenient download of all results in Excel format. These features were implemented in Python language by employing Google Colab, facilitating the incorporation of the calculator into other Python-based pipelines and inviting the contributions from the community of Python-using scientists for further enhancements. Arbitrary-precision arithmetic was employed by using the built-in Decimal module to obtain the most accurate results and to avoid rounding errors. No notable differences were observed compared to the results obtained from the PyChelator web application. However, comparison of different sources of stability constants showed substantial differences among them.

CONCLUSIONS

PyChelator is a user-friendly metal and chelator calculator that provides a platform for further development. It is provided as an interactive web application, freely available for use at https://amrutelab.github.io/PyChelator , and as a Python-based Google Colaboratory notebook at https://colab.

RESEARCH

google.com/github/AmruteLab/PyChelator/blob/main/PyChelator_Colab.ipynb .

Collapse

Koniaris D, Suciu C, Nica S. Flight to Recovery: Impact of a Rooftop Helipad Air Ambulance Service at the Emergency University Hospital of Bucharest-A Caseload Analysis of the First 3 Years After Its Implementation. Air Med J 2024;43:321-327. [PMID: 38897695 DOI: 10.1016/j.amj.2024.03.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2024] [Revised: 03/03/2024] [Accepted: 03/07/2024] [Indexed: 06/21/2024]

Abstract

OBJECTIVE

This observational study provides an overview of the implementation and impact of the helipad at the Bucharest Emergency University Hospital, Romania. The helipad, established in April 2019, is the only rooftop medical helipad in Bucharest authorized for day and night flights. Its influence extends beyond the local region, enabling the hospital to receive patients from various cities across Romania. The helipad has particularly strengthened the hospital's capabilities in cardiology, neurovascular emergencies, and neonatal care. Patients with acute myocardial infarctions or strokes can now be swiftly transported to the hospital for immediate intervention, whereas critically ill newborns can receive specialized care at the earliest stages of their lives. The objective of this article was to present a comprehensive timeline of the helipad's implementation and to demonstrate its transformative role in improving patient transportation, enhancing medical interventions, and elevating the overall efficiency of the health care facility.

METHODS

The study is a retrospective regional caseload analysis based on data gathered from the Emergency Department of the University Emergency Hospital of Bucharest database. We included all 215 air transfer missions registered between December 2019 and December 2022, exactly 3 years apart from the beginning of the program.

RESULTS

The findings provide valuable insights into patient demographics, case distribution, and trends, highlighting the importance of specialized medical interventions at the University Emergency Hospital of Bucharest. In particular, the mean age of patients treated at the hospital was 55.9 years, with a higher representation of males (156) than females (59). The average duration of hospitalization was 10.68 days. The study also examined transportation statistics, showing a decrease in the average number of transports per month over the years. Cardiologic cases accounted for the highest frequency (62.8%) among the analyzed categories followed by neurosurgery (8.8%) and neurologic cases (8.4%).

CONCLUSION

The analysis provides important insights into patient demographics, case distribution, and trends. The findings highlight the significance of specialized medical interventions, particularly in cardiology and neurosurgery, which accounted for the majority of the cases. The implementation of the helipad has greatly improved patient transportation and facilitated timely medical assistance.

Collapse

Rahman CR, Wong L. How much can ChatGPT really help computational biologists in programming? J Bioinform Comput Biol 2024;22:2471001. [PMID: 38779779 DOI: 10.1142/s021972002471001x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2024]

Zhang S, Li H, Jing Q, Shen W, Luo W, Dai R. Anesthesia decision analysis using a cloud-based big data platform. Eur J Med Res 2024;29:201. [PMID: 38528564 DOI: 10.1186/s40001-024-01764-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2023] [Accepted: 03/01/2024] [Indexed: 03/27/2024] Open

Maurer JJ, Cheng Y, Pedroso A, Thompson KK, Akter S, Kwan T, Morota G, Kinstler S, Porwollik S, McClelland M, Escalante-Semerena JC, Lee MD. Peeling back the many layers of competitive exclusion. Front Microbiol 2024;15:1342887. [PMID: 38591029 PMCID: PMC11000858 DOI: 10.3389/fmicb.2024.1342887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 02/19/2024] [Indexed: 04/10/2024] Open

Abstract

Baby chicks administered a fecal transplant from adult chickens are resistant to Salmonella colonization by competitive exclusion. A two-pronged approach was used to investigate the mechanism of this process. First, Salmonella response to an exclusive (Salmonella competitive exclusion product, Aviguard®) or permissive microbial community (chicken cecal contents from colonized birds containing 7.85 Log10Salmonella genomes/gram) was assessed ex vivo using a S. typhimurium reporter strain with fluorescent YFP and CFP gene fusions to rrn and hilA operon, respectively. Second, cecal transcriptome analysis was used to assess the cecal communities' response to Salmonella in chickens with low (≤5.85 Log10 genomes/g) or high (≥6.00 Log10 genomes/g) Salmonella colonization. The ex vivo experiment revealed a reduction in Salmonella growth and hilA expression following co-culture with the exclusive community. The exclusive community also repressed Salmonella's SPI-1 virulence genes and LPS modification, while the anti-virulence/inflammatory gene avrA was upregulated. Salmonella transcriptome analysis revealed significant metabolic disparities in Salmonella grown with the two different communities. Propanediol utilization and vitamin B12 synthesis were central to Salmonella metabolism co-cultured with either community, and mutations in propanediol and vitamin B12 metabolism altered Salmonella growth in the exclusive community. There were significant differences in the cecal community's stress response to Salmonella colonization. Cecal community transcripts indicated that antimicrobials were central to the type of stress response detected in the low Salmonella abundance community, suggesting antagonism involved in Salmonella exclusion. This study indicates complex community interactions that modulate Salmonella metabolism and pathogenic behavior and reduce growth through antagonism may be key to exclusion.

Collapse

Mullie L, Afilalo J, Archambault P, Bouchakri R, Brown K, Buckeridge DL, Cavayas YA, Turgeon AF, Martineau D, Lamontagne F, Lebrasseur M, Lemieux R, Li J, Sauthier M, St-Onge P, Tang A, Witteman W, Chassé M. CODA: an open-source platform for federated analysis and machine learning on distributed healthcare data. J Am Med Inform Assoc 2024;31:651-665. [PMID: 38128123 PMCID: PMC10873779 DOI: 10.1093/jamia/ocad235] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 10/28/2023] [Accepted: 12/02/2023] [Indexed: 12/23/2023] Open

Abstract

OBJECTIVES

Distributed computations facilitate multi-institutional data analysis while avoiding the costs and complexity of data pooling. Existing approaches lack crucial features, such as built-in medical standards and terminologies, no-code data visualizations, explicit disclosure control mechanisms, and support for basic statistical computations, in addition to gradient-based optimization capabilities.

MATERIALS AND METHODS

We describe the development of the Collaborative Data Analysis (CODA) platform, and the design choices undertaken to address the key needs identified during our survey of stakeholders. We use a public dataset (MIMIC-IV) to demonstrate end-to-end multi-modal FL using CODA. We assessed the technical feasibility of deploying the CODA platform at 9 hospitals in Canada, describe implementation challenges, and evaluate its scalability on large patient populations.

RESULTS

The CODA platform was designed, developed, and deployed between January 2020 and January 2023. Software code, documentation, and technical documents were released under an open-source license. Multi-modal federated averaging is illustrated using the MIMIC-IV and MIMIC-CXR datasets. To date, 8 out of the 9 participating sites have successfully deployed the platform, with a total enrolment of >1M patients. Mapping data from legacy systems to FHIR was the biggest barrier to implementation.

DISCUSSION AND CONCLUSION

The CODA platform was developed and successfully deployed in a public healthcare setting in Canada, with heterogeneous information technology systems and capabilities. Ongoing efforts will use the platform to develop and prospectively validate models for risk assessment, proactive monitoring, and resource usage. Further work will also make tools available to facilitate migration from legacy formats to FHIR and DICOM.

Collapse

Affiliation(s)

Louis Mullie Department of Medicine, Centre Hospitalier de l'Université de Montréal, Montréal, H2X 3E4, Canada Faculty of Medicine, Université de Montréal, Montréal, H3C 3J7, Canada Mila Quebec Artificial Intelligence Institute, Montréal, H2S 3H1, Canada
Jonathan Afilalo Department of Medicine, Jewish General Hospital, Montréal, H3T 1E4, Canada
Patrick Archambault Department of Emergency Medicine and Family Medicine, Université Laval, Québec, G1V 0A6, Canada Department of Anesthesiology and Critical Care Medicine, Université Laval, Québec, G1V 0A6, Canada Centre de Recherche Intégré pour un Système Apprenant en santé et Services Sociaux, Centre intégré de santé et de Services Sociaux de Chaudière-Appalaches, Lévis, G6V 3Z1, Canada
Rima Bouchakri Centre de Recherche du Centre Hospitalier de l'Université de Montréal, Université de Montréal, Montréal, H2X 0A9, Canada
Kip Brown Centre de Recherche du Centre Hospitalier de l'Université de Montréal, Université de Montréal, Montréal, H2X 0A9, Canada
David L Buckeridge Mila Quebec Artificial Intelligence Institute, Montréal, H2S 3H1, Canada Department of Epidemiology and Biostatistics, School of Population and Global Health, McGill University Health Centre, Montréal, H3A 1G1, Canada
Yiorgos Alexandros Cavayas Department of Medicine, Hôpital du Sacré-Coeur de Montréal, Montréal, H4J 1C5, Canada
Alexis F Turgeon Department of Anesthesiology and Critical Care Medicine, Université Laval, Québec, G1V 0A6, Canada Centre de recherche du CHU de Québec-Université Laval, Université Laval, Québec, G1V 4G2, Canada
Denis Martineau Centre de recherche du CHU de Québec-Université Laval, Université Laval, Québec, G1V 4G2, Canada
François Lamontagne Centre de recherche du CHUS, Centre Hospitalier Universitaire de Sherbrooke, Sherbrooke, J1G 2E8, Canada
Martine Lebrasseur Centre de Recherche du Centre Hospitalier de l'Université de Montréal, Université de Montréal, Montréal, H2X 0A9, Canada
Renald Lemieux Centre de recherche du CHUS, Centre Hospitalier Universitaire de Sherbrooke, Sherbrooke, J1G 2E8, Canada
Jeffrey Li Centre de Recherche du Centre Hospitalier de l'Université de Montréal, Université de Montréal, Montréal, H2X 0A9, Canada
Michaël Sauthier Faculty of Medicine, Université de Montréal, Montréal, H3C 3J7, Canada Department of Pediatrics, Université de Montréal and CHU Sainte-Justine Research Centre, Montréal, H3C 3J7, Canada
Pascal St-Onge Centre de Recherche du Centre Hospitalier de l'Université de Montréal, Université de Montréal, Montréal, H2X 0A9, Canada
An Tang Faculty of Medicine, Université de Montréal, Montréal, H3C 3J7, Canada Department of Radiology, Centre Hospitalier de l’Université de Montréal, Montréal, H2X 3E4, Canada
William Witteman Centre de Recherche Intégré pour un Système Apprenant en santé et Services Sociaux, Centre intégré de santé et de Services Sociaux de Chaudière-Appalaches, Lévis, G6V 3Z1, Canada
Michaël Chassé Department of Medicine, Centre Hospitalier de l'Université de Montréal, Montréal, H2X 3E4, Canada Faculty of Medicine, Université de Montréal, Montréal, H3C 3J7, Canada

Collapse

Piccolo SR, Denny P, Luxton-Reilly A, Payne SH, Ridge PG. Evaluating a large language model's ability to solve programming exercises from an introductory bioinformatics course. PLoS Comput Biol 2023;19:e1011511. [PMID: 37769024 PMCID: PMC10564134 DOI: 10.1371/journal.pcbi.1011511] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Revised: 10/10/2023] [Accepted: 09/12/2023] [Indexed: 09/30/2023] Open

Adenaike O, Olabanjo OE, Adedeji AA. Integrating computational skills in undergraduate Microbiology curricula in developing countries. Biol Methods Protoc 2023;8:bpad008. [PMID: 37396465 PMCID: PMC10310463 DOI: 10.1093/biomethods/bpad008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Revised: 05/19/2023] [Accepted: 05/21/2023] [Indexed: 07/04/2023] Open

Zhang P, Wang M, Zhou T, Chen D. SeqWiz: a modularized toolkit for next-generation protein sequence database management and analysis. BMC Bioinformatics 2023;24:201. [PMID: 37194023 DOI: 10.1186/s12859-023-05334-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 05/11/2023] [Indexed: 05/18/2023] Open

Abstract

BACKGROUND

Current proteomic technologies are fast-evolving to uncover the complex features of sequence processes, variations and modifications. Thus, protein sequence database and the corresponding softwares should also be improved to solve this issue.

RESULTS

We developed a state-of-the-art toolkit (SeqWiz) for constructing next-generation sequence databases and performing proteomic-centric sequence analyses. First, we proposed two derived data formats: SQPD (a well-structured and high-performance local sequence database based on SQLite), and SET (an associated list of selected entries based on JSON). The SQPD format follows the basic standards of the emerging PEFF format, which also aims to facilitate the search of complex proteoform. The SET format is designed for generating subsets with with high-efficiency. These formats are shown to greatly outperform the conventional FASTA or PEFF formats in time and resource consumption. Then, we mainly focused on the UniProt knowledgebase and developed a collection of open-source tools and basic modules for retrieving species-specific databases, formats conversion, sequence generation, sequence filter, and sequence analysis. These tools are implemented by using the Python language and licensed under the GNU General Public Licence V3. The source codes and distributions are freely available at GitHub ( https://github.com/fountao/protwiz/tree/main/seqwiz ).

CONCLUSIONS

SeqWiz is designed to be a collection of modularized tools, which is friendly to both end-users for preparing easy-to-use sequence databases as well as bioinformaticians for performing downstream sequence analysis. Besides the novel formats, it also provides compatible functions for handling the traditional text based FASTA or PEFF formats. We believe that SeqWiz will promote the implementing of complementary proteomics for data renewal and proteoform analysis to achieve precision proteomics. Additionally, it can also drive the improvement of proteomic standardization and the development of next-generation proteomic softwares.

Collapse

Roesch E, Greener JG, MacLean AL, Nassar H, Rackauckas C, Holy TE, Stumpf MPH. Julia for biologists. Nat Methods 2023;20:655-664. [PMID: 37024649 PMCID: PMC10216852 DOI: 10.1038/s41592-023-01832-z] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 02/27/2023] [Indexed: 04/08/2023]

Rather MA, Agarwal D, Bhat TA, Khan IA, Zafar I, Kumar S, Amin A, Sundaray JK, Qadri T. Bioinformatics approaches and big data analytics opportunities in improving fisheries and aquaculture. Int J Biol Macromol 2023;233:123549. [PMID: 36740117 DOI: 10.1016/j.ijbiomac.2023.123549] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Revised: 01/30/2023] [Accepted: 01/31/2023] [Indexed: 02/05/2023]

Prediction and Modeling of Protein–Protein Interactions Using “Spotted” Peptides with a Template-Based Approach. Biomolecules 2022;12:biom12020201. [PMID: 35204702 PMCID: PMC8961654 DOI: 10.3390/biom12020201] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2021] [Revised: 01/20/2022] [Accepted: 01/22/2022] [Indexed: 12/10/2022] Open

Prasai R, Schwertner TW, Mainali K, Mathewson H, Kafley H, Thapa S, Adhikari D, Medley P, Drake J. Application of Google earth engine python API and NAIP imagery for land use and land cover classification: A case study in Florida, USA. ECOL INFORM 2021. [DOI: 10.1016/j.ecoinf.2021.101474] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Zuvanov L, Basso Garcia AL, Correr FH, Bizarria R, Filho APDC, da Costa AH, Thomaz AT, Pinheiro ALM, Riaño-Pachón DM, Winck FV, Esteves FG, Margarido GRA, Casagrande GMS, Frajacomo HC, Martins L, Cavalheiro MF, Grachet NG, da Silva RGC, Cerri R, Ramos RTJ, de Medeiros SDS, Tavares TV, Corrêa dos Santos RA. The experience of teaching introductory programming skills to bioscientists in Brazil. PLoS Comput Biol 2021;17:e1009534. [PMID: 34762646 PMCID: PMC8584955 DOI: 10.1371/journal.pcbi.1009534] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open

Abstract

Computational biology has gained traction as an independent scientific discipline over the last years in South America. However, there is still a growing need for bioscientists, from different backgrounds, with different levels, to acquire programming skills, which could reduce the time from data to insights and bridge communication between life scientists and computer scientists. Python is a programming language extensively used in bioinformatics and data science, which is particularly suitable for beginners. Here, we describe the conception, organization, and implementation of the Brazilian Python Workshop for Biological Data. This workshop has been organized by graduate and undergraduate students and supported, mostly in administrative matters, by experienced faculty members since 2017. The workshop was conceived for teaching bioscientists, mainly students in Brazil, on how to program in a biological context. The goal of this article was to share our experience with the 2020 edition of the workshop in its virtual format due to the Coronavirus Disease 2019 (COVID-19) pandemic and to compare and contrast this year's experience with the previous in-person editions. We described a hands-on and live coding workshop model for teaching introductory Python programming. We also highlighted the adaptations made from in-person to online format in 2020, the participants' assessment of learning progression, and general workshop management. Lastly, we provided a summary and reflections from our personal experiences from the workshops of the last 4 years. Our takeaways included the benefits of the learning from learners' feedback (LLF) that allowed us to improve the workshop in real time, in the short, and likely in the long term. We concluded that the Brazilian Python Workshop for Biological Data is a highly effective workshop model for teaching a programming language that allows bioscientists to go beyond an initial exploration of programming skills for data analysis in the medium to long term.

Collapse

Affiliation(s)

Luíza Zuvanov São Carlos Institute of Physics, University of São Paulo, São Carlos, Brazil
Ana Letycia Basso Garcia Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, Brazil
Fernando Henrique Correr Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, Brazil
Rodolfo Bizarria Department of General and Applied Biology, São Paulo State University, Rio Claro, Brazil Center of the Study of Social Insects, Department of General and Applied Biology, Institute of Biosciences of Rio Claro, São Paulo State University, Rio Claro, Brazil
Ailton Pereira da Costa Filho Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
Alisson Hayasi da Costa Department of Computer Science, Federal University of São Carlos, São Carlos, Brazil
Andréa T. Thomaz School of Natural Sciences, Universidad del Rosario, Bogotá, Colombia
Ana Lucia Mendes Pinheiro Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, Brazil
Diego Mauricio Riaño-Pachón Computational, Evolutionary and Systems Biology Lab, Center for Nuclear Energy in Agriculture, University of São Paulo, Piracicaba, Brazil
Flavia Vischi Winck Regulatory Systems Biology Lab, Center for Nuclear Energy in Agriculture, University of São Paulo, Piracicaba, Brazil
Franciele Grego Esteves Center of the Study of Social Insects, Department of General and Applied Biology, Institute of Biosciences of Rio Claro, São Paulo State University, Rio Claro, Brazil
Gabriel Rodrigues Alves Margarido Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, Brazil
Giovanna Maria Stanfoca Casagrande Barretos Cancer Hospital, Barretos, Brazil
Henrique Cordeiro Frajacomo Department of Computer Science, Federal University of São Carlos, São Carlos, Brazil
Leonardo Martins Paulista School of Medicine, Federal University of São Paulo, São Paulo, Brazil
Mariana Feitosa Cavalheiro Department of Genetics, Evolution, Microbiology and Immunology, Institute of Biology, University of Campinas, Campinas, Brazil Genomics for Climate Change Research Center, University of Campinas, Campinas, Brazil
Nathalia Graf Grachet Roche Sequencing Solutions, Pleasanton, California, United States of America
Raniere Gaia Costa da Silva Department of Infectious Diseases and Public Health, Jockey Club College of Veterinary Medicine and Life Sciences, City University of Hong Kong, Hong Kong, Special Administrative Region, People’s Republic of China
Ricardo Cerri Department of Computer Science, Federal University of São Carlos, São Carlos, Brazil
Rommel Thiago Juca Ramos Institute of Biological Sciences, Federal University of Pará, Belém, Brazil
Simone Daniela Sartorio de Medeiros Department of Informatics and Statistics, Federal University of Santa Catarina, Florianópolis, Brazil
Thayana Vieira Tavares Department of Genetics and Evolution, Federal University of São Carlos, São Carlos, Brazil
Renato Augusto Corrêa dos Santos School of Pharmaceutical Sciences of Ribeirao Preto, University of São Paulo, Ribeirão Preto, Brazil Institute of Biology, State University of Campinas, Campinas, Brazil * E-mail:

Collapse

Allbee Q, Barber R. Writing python programs to map alleles related to genetic disease. BIOCHEMISTRY AND MOLECULAR BIOLOGY EDUCATION : A BIMONTHLY PUBLICATION OF THE INTERNATIONAL UNION OF BIOCHEMISTRY AND MOLECULAR BIOLOGY 2021;49:677-678. [PMID: 33991167 DOI: 10.1002/bmb.21528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2020] [Accepted: 05/06/2021] [Indexed: 06/12/2023]

Driscoll MK, Zaritsky A. Data science in cell imaging. J Cell Sci 2021;134:jcs254292. [PMID: 33795377 PMCID: PMC8034880 DOI: 10.1242/jcs.254292] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open

Elghafari A, Finkelstein J. Automated Identification of Common Disease-Specific Outcomes for Comparative Effectiveness Research Using ClinicalTrials.gov: Algorithm Development and Validation Study. JMIR Med Inform 2021;9:e18298. [PMID: 33460388 PMCID: PMC7899806 DOI: 10.2196/18298] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Revised: 08/30/2020] [Accepted: 01/17/2021] [Indexed: 01/02/2023] Open

Abstract

Background

Common disease-specific outcomes are vital for ensuring comparability of clinical trial data and enabling meta analyses and interstudy comparisons. Traditionally, the process of deciding which outcomes should be recommended as common for a particular disease relied on assembling and surveying panels of subject-matter experts. This is usually a time-consuming and laborious process.

Objective

The objectives of this work were to develop and evaluate a generalized pipeline that can automatically identify common outcomes specific to any given disease by finding, downloading, and analyzing data of previous clinical trials relevant to that disease.

Methods

An automated pipeline to interface with ClinicalTrials.gov’s application programming interface and download the relevant trials for the input condition was designed. The primary and secondary outcomes of those trials were parsed and grouped based on text similarity and ranked based on frequency. The quality and usefulness of the pipeline’s output were assessed by comparing the top outcomes identified by it for chronic obstructive pulmonary disease (COPD) to a list of 80 outcomes manually abstracted from the most frequently cited and comprehensive reviews delineating clinical outcomes for COPD.

Results

The common disease-specific outcome pipeline successfully downloaded and processed 3876 studies related to COPD. Manual verification indicated that the pipeline was downloading and processing the same number of trials as were obtained from the self-service ClinicalTrials.gov portal. Evaluating the automatically identified outcomes against the manually abstracted ones showed that the pipeline achieved a recall of 92% and precision of 79%. The precision number indicated that the pipeline was identifying many outcomes that were not covered in the literature reviews. Assessment of those outcomes indicated that they are relevant to COPD and could be considered in future research.

Conclusions

An automated evidence-based pipeline can identify common clinical trial outcomes of comparable breadth and quality as the outcomes identified in comprehensive literature reviews. Moreover, such an approach can highlight relevant outcomes for further consideration.

Collapse

Haiman ZB, Zielinski DC, Koike Y, Yurkovich JT, Palsson BO. MASSpy: Building, simulating, and visualizing dynamic biological models in Python using mass action kinetics. PLoS Comput Biol 2021;17:e1008208. [PMID: 33507922 PMCID: PMC7872247 DOI: 10.1371/journal.pcbi.1008208] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Revised: 02/09/2021] [Accepted: 12/21/2020] [Indexed: 01/01/2023] Open

Du L, Liu Q, Fan Z, Tang J, Zhang X, Price M, Yue B, Zhao K. Pyfastx: a robust Python package for fast random access to sequences from plain and gzipped FASTA/Q files. Brief Bioinform 2020;22:6042388. [PMID: 33341884 DOI: 10.1093/bib/bbaa368] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2020] [Revised: 10/30/2020] [Accepted: 11/17/2020] [Indexed: 11/14/2022] Open

Mura C, Chalupa M, Newbury AM, Chalupa J, Bourne PE. Ten simple rules for starting research in your late teens. PLoS Comput Biol 2020;16:e1008403. [PMID: 33211694 PMCID: PMC7676678 DOI: 10.1371/journal.pcbi.1008403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Covalent Versus Non-covalent Enzyme Inhibition: Which Route Should We Take? A Justification of the Good and Bad from Molecular Modelling Perspective. Protein J 2020;39:97-105. [PMID: 32072438 DOI: 10.1007/s10930-020-09884-2] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Application of Systems Engineering Principles and Techniques in Biological Big Data Analytics: A Review. Processes (Basel) 2020. [DOI: 10.3390/pr8080951] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open

Workflow for Data Analysis in Experimental and Computational Systems Biology: Using Python as ‘Glue’. Processes (Basel) 2019. [DOI: 10.3390/pr7070460] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open

Rakov AV, Mastriani E, Liu SL, Schifferli DM. Association of Salmonella virulence factor alleles with intestinal and invasive serovars. BMC Genomics 2019;20:429. [PMID: 31138114 PMCID: PMC6540521 DOI: 10.1186/s12864-019-5809-8] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Accepted: 05/20/2019] [Indexed: 12/12/2022] Open

Fletcher AC, Mura C. Ten quick tips for using a Raspberry Pi. PLoS Comput Biol 2019;15:e1006959. [PMID: 31048834 PMCID: PMC6497221 DOI: 10.1371/journal.pcbi.1006959] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open

Mariano D, Martins P, Helene Santos L, de Melo-Minardi RC. Introducing Programming Skills for Life Science Students. BIOCHEMISTRY AND MOLECULAR BIOLOGY EDUCATION : A BIMONTHLY PUBLICATION OF THE INTERNATIONAL UNION OF BIOCHEMISTRY AND MOLECULAR BIOLOGY 2019;47:288-295. [PMID: 30860646 DOI: 10.1002/bmb.21230] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/29/2018] [Revised: 01/18/2019] [Accepted: 02/18/2019] [Indexed: 05/04/2023]

Mathema VB, Dondorp AM, Imwong M. OSTRFPD: Multifunctional Tool for Genome-Wide Short Tandem Repeat Analysis for DNA, Transcripts, and Amino Acid Sequences with Integrated Primer Designer. Evol Bioinform Online 2019;15:1176934319843130. [PMID: 31040636 PMCID: PMC6482647 DOI: 10.1177/1176934319843130] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Accepted: 03/15/2019] [Indexed: 01/18/2023] Open

Wang G, Peng B. Script of Scripts: A pragmatic workflow system for daily computational research. PLoS Comput Biol 2019;15:e1006843. [PMID: 30811390 PMCID: PMC6411228 DOI: 10.1371/journal.pcbi.1006843] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2018] [Revised: 03/11/2019] [Accepted: 01/29/2019] [Indexed: 01/22/2023] Open

Diaz-del-Pino S, Rodriguez-Brazzarola P, Perez-Wohlfeil E, Trelles O. Combining Strengths for Multi-genome Visual Analytics Comparison. Bioinform Biol Insights 2019;13:1177932218825127. [PMID: 30783378 PMCID: PMC6365554 DOI: 10.1177/1177932218825127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2018] [Accepted: 12/22/2018] [Indexed: 11/25/2022] Open

Abstract

The eclosion of data acquisition technologies has shifted the bottleneck in molecular biology research from data acquisition to data analysis. Such is the case in Comparative Genomics, where sequence analysis has transitioned from genes to genomes of several orders of magnitude larger. This fact has revealed the need to adapt software to work with huge experiments efficiently and to incorporate new data-analysis strategies to manage results from such studies. In previous works, we presented GECKO, a software to compare large sequences; now we address the representation, browsing, data exploration, and post-processing of the massive amount of information derived from such comparisons. GECKO-MGV is a web-based application organized as client-server architecture. It is aimed at visual analysis of the results from both pairwise and multiple sequences comparison studies combining a set of common commands for image exploration with improved state-of-the-art solutions. In addition, GECKO-MGV integrates different visualization analysis tools while exploiting the concept of layers to display multiple genome comparison datasets. Moreover, the software is endowed with capabilities for contacting external-proprietary and third-party services for further data post-processing and also presents a method to display a timeline of large-scale evolutionary events. As proof-of-concept, we present 2 exercises using bacterial and mammalian genomes which depict the capabilities of GECKO-MGV to perform in-depth, customizable analyses on the fly using web technologies. The first exercise is mainly descriptive and is carried out over bacterial genomes, whereas the second one aims to show the ability to deal with large sequence comparisons. In this case, we display results from the comparison of the first Homo sapiens chromosome against the first 5 chromosomes of Mus musculus.

Collapse

Erickson RA, Fienen MN, McCalla SG, Weiser EL, Bower ML, Knudson JM, Thain G. Wrangling distributed computing for high-throughput environmental science: An introduction to HTCondor. PLoS Comput Biol 2018;14:e1006468. [PMID: 30281592 PMCID: PMC6169842 DOI: 10.1371/journal.pcbi.1006468] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Garcia-Milian R, Hersey D, Vukmirovic M, Duprilot F. Data challenges of biomedical researchers in the age of omics. PeerJ 2018;6:e5553. [PMID: 30221093 PMCID: PMC6138043 DOI: 10.7717/peerj.5553] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2018] [Accepted: 08/10/2018] [Indexed: 12/17/2022] Open

Gauthier J, Vincent AT, Charette SJ, Derome N. A brief history of bioinformatics. Brief Bioinform 2018;20:1981-1996. [DOI: 10.1093/bib/bby063] [Citation(s) in RCA: 59] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Revised: 06/22/2018] [Indexed: 02/06/2023] Open

Smith JK, Jiang S, Pfaendtner J. Redefining the Protein-Protein Interface: Coarse Graining and Combinatorics for an Improved Understanding of Amino Acid Contributions to the Protein-Protein Binding Affinity. LANGMUIR : THE ACS JOURNAL OF SURFACES AND COLLOIDS 2017;33:11511-11517. [PMID: 28850233 DOI: 10.1021/acs.langmuir.7b02438] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Picard V, Mulner-Lorillon O, Bourdon J, Morales J, Cormier P, Siegel A, Bellé R. Model of the delayed translation of cyclin B maternal mRNA after sea urchin fertilization. Mol Reprod Dev 2016;83:1070-1082. [PMID: 27699901 DOI: 10.1002/mrd.22746] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Accepted: 10/01/2016] [Indexed: 01/24/2023]

Affiliation(s)

Vincent Picard CNRS UMR 6241, Laboratoire LINA, Université de Nantes, Nantes, France.,CNRS, IRISA-UMR 6074, Campus de Beaulieu, Rennes, France.,INRIA, Centre Rennes-Bretagne Atlantique, Symbiose, Campus de Beaulieu, Rennes, France
Odile Mulner-Lorillon Sorbonne Universités, UPMC Univ Paris 06, UMR 8227, Integrative Biology of Marine Models, Translation Cell Cycle and Development, Station Biologique de Roscoff, Roscoff Cedex, France.,CNRS, UMR 8227, Integrative Biology of Marine Models, Translation Cell Cycle and Development, Station Biologique de Roscoff, Roscoff Cedex, France
Jérémie Bourdon CNRS UMR 6241, Laboratoire LINA, Université de Nantes, Nantes, France
Julia Morales Sorbonne Universités, UPMC Univ Paris 06, UMR 8227, Integrative Biology of Marine Models, Translation Cell Cycle and Development, Station Biologique de Roscoff, Roscoff Cedex, France.,CNRS, UMR 8227, Integrative Biology of Marine Models, Translation Cell Cycle and Development, Station Biologique de Roscoff, Roscoff Cedex, France
Patrick Cormier Sorbonne Universités, UPMC Univ Paris 06, UMR 8227, Integrative Biology of Marine Models, Translation Cell Cycle and Development, Station Biologique de Roscoff, Roscoff Cedex, France.,CNRS, UMR 8227, Integrative Biology of Marine Models, Translation Cell Cycle and Development, Station Biologique de Roscoff, Roscoff Cedex, France
Anne Siegel CNRS, IRISA-UMR 6074, Campus de Beaulieu, Rennes, France.,INRIA, Centre Rennes-Bretagne Atlantique, Symbiose, Campus de Beaulieu, Rennes, France
Robert Bellé Sorbonne Universités, UPMC Univ Paris 06, UMR 8227, Integrative Biology of Marine Models, Translation Cell Cycle and Development, Station Biologique de Roscoff, Roscoff Cedex, France.,CNRS, UMR 8227, Integrative Biology of Marine Models, Translation Cell Cycle and Development, Station Biologique de Roscoff, Roscoff Cedex, France

Collapse