Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Amsaleg L, Chelly O, Furon T, Girard S, Houle ME, Kawarabayashi KI, Nett M. Extreme-value-theoretic estimation of local intrinsic dimensionality. Data Min Knowl Discov 2018. [DOI: 10.1007/s10618-018-0578-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

For:	Amsaleg L, Chelly O, Furon T, Girard S, Houle ME, Kawarabayashi KI, Nett M. Extreme-value-theoretic estimation of local intrinsic dimensionality. Data Min Knowl Discov 2018. [DOI: 10.1007/s10618-018-0578-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Number

Cited by Other Article(s)

Valdés JJ, Tchagang AB. Novel machine learning insights into the QM7b and QM9 quantum mechanics datasets. J Comput Chem 2024;45:1193-1214. [PMID: 38329198 DOI: 10.1002/jcc.27295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Revised: 12/06/2023] [Accepted: 12/12/2023] [Indexed: 02/09/2024]

Abstract

This paper (i) explores the internal structure of two quantum mechanics datasets (QM7b, QM9), composed of several thousands of organic molecules and described in terms of electronic properties, and (ii) further explores an inverse design approach to molecular design consisting of using machine learning methods to approximate the atomic composition of molecules, using QM9 data. Understanding the structure and characteristics of this kind of data is important when predicting the atomic composition from physical-chemical properties in inverse molecular designs. Intrinsic dimension analysis, clustering, and outlier detection methods were used in the study. They revealed that for both datasets the intrinsic dimensionality is several times smaller than the descriptive dimensions. The QM7b data is composed of well-defined clusters related to atomic composition. The QM9 data consists of an outer region predominantly composed of outliers, and an inner, core region that concentrates clustered inliner objects. A significant relationship exists between the number of atoms in the molecule and its outlier/inliner nature. The spatial structure exhibits a relationship with molecular weight. Despite the structural differences between the two datasets, the predictability of variables of interest for inverse molecular design is high. This is exemplified by models estimating the number of atoms of the molecule from both the original properties and from lower dimensional embedding spaces. In the generative approach the input is given by a set of desired properties of the molecule and the output is an approximation of the atomic composition in terms of its constituent chemical elements. This could serve as the starting region for further search in the huge space determined by the set of possible chemical compounds. The quantum mechanic's dataset QM9 is used in the study, composed of 133,885 small organic molecules and 19 electronic properties. Different multi-target regression approaches were considered for predicting the atomic composition from the properties, including feature engineering techniques in an auto-machine learning framework. High-quality models were found that predict the atomic composition of the molecules from their electronic properties, as well as from a subset of only 52.6% size. Feature selection worked better than feature generation. The results validate the generative approach to inverse molecular design.

Collapse

The generalized ratios intrinsic dimension estimator. Sci Rep 2022;12:20005. [PMID: 36411305 PMCID: PMC9678878 DOI: 10.1038/s41598-022-20991-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Accepted: 09/21/2022] [Indexed: 11/23/2022] Open

Bailey J, Houle ME, Ma X. Local Intrinsic Dimensionality, Entropy and Statistical Divergences. ENTROPY (BASEL, SWITZERLAND) 2022;24:1220. [PMID: 36141105 PMCID: PMC9497584 DOI: 10.3390/e24091220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Revised: 08/22/2022] [Accepted: 08/26/2022] [Indexed: 06/16/2023]

Benkő Z, Stippinger M, Rehus R, Bencze A, Fabó D, Hajnal B, Eröss LG, Telcs A, Somogyvári Z. Manifold-adaptive dimension estimation revisited. PeerJ Comput Sci 2022;8:e790. [PMID: 35111907 PMCID: PMC8771813 DOI: 10.7717/peerj-cs.790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Accepted: 11/01/2021] [Indexed: 06/14/2023]

Qiu H, Yang Y, Rezakhah S. Intrinsic dimension estimation method based on correlation dimension and kNN method. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2021.107627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Thordsen E, Schubert E. ABID: Angle Based Intrinsic Dimensionality — Theory and analysis. INFORM SYST 2022. [DOI: 10.1016/j.is.2022.101989] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Aumüller M, Ceccarello M. The role of local dimensionality measures in benchmarking nearest neighbor search. INFORM SYST 2021. [DOI: 10.1016/j.is.2021.101807] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Bac J, Mirkes EM, Gorban AN, Tyukin I, Zinovyev A. Scikit-Dimension: A Python Package for Intrinsic Dimension Estimation. ENTROPY (BASEL, SWITZERLAND) 2021;23:1368. [PMID: 34682092 PMCID: PMC8534554 DOI: 10.3390/e23101368] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Revised: 10/10/2021] [Accepted: 10/16/2021] [Indexed: 02/07/2023]

Qiu H, Yang Y, Li B. Intrinsic dimension estimation based on local adjacency information. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.01.017] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]