1
|
Flament F, Jiang R, Houghton J, Zhang Y, Kroely C, Jablonski NG, Jean A, Clarke J, Steeg J, Sehgal C, McParland J, Delaunay C, Passeron T. Accuracy and clinical relevance of an automated, algorithm-based analysis of facial signs from selfie images of women in the United States of various ages, ancestries and phototypes: A cross-sectional observational study. J Eur Acad Dermatol Venereol 2023; 37:176-183. [PMID: 35986708 PMCID: PMC10087370 DOI: 10.1111/jdv.18541] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 07/27/2022] [Indexed: 12/15/2022]
Abstract
BACKGROUND Real-life validation is necessary to ensure our artificial intelligence (AI) skin diagnostic tool is inclusive across a diverse and representative US population of various ages, ancestries and skin phototypes. OBJECTIVES To explore the relevance and accuracy of an automated, algorithm-based analysis of facial signs in representative women of different ancestries, ages and phototypes, living in the same country. METHODS In a cross-sectional study of selfie images of 1041 US women, algorithm-based analyses of seven facial signs were automatically graded by an AI-based algorithm and by 50 US dermatologists of various profiles (age, gender, ancestry, geographical location). For automated analysis and dermatologist assessment, the same referential skin atlas was used to standardize the grading scales. The average values and their variability were compared with respect to age, ancestry and phototype. RESULTS For five signs, the grading obtained by the automated system were strongly correlated with dermatologists' assessments (r ≥ 0.75); cheek skin pores were moderately correlated (r = 0.63) and pigmentation signs, especially for the darkest skin tones, were weakly correlated (r = 0.40) to the dermatologist assessments. Age and ancestry had no effect on the correlations. In many cases, the automated system performed better than the dermatologist-assessed clinical grading due to 0.3-0.5 grading unit differences among the dermatologist panel that were not related to any individual characteristic (e.g. gender, age, ancestry, location). The use of phototypes, as discontinuous categorical variables, is likely a limiting factor in the assessments of grading, whether obtained by automated analysis or clinical assessment of the images. CONCLUSIONS The AI-based automatic procedure is accurate and clinically relevant for analysing facial signs in a diverse and inclusive population of US women, as confirmed by a diverse panel of dermatologists, although skin tone requires further improvement.
Collapse
Affiliation(s)
| | - Ruowei Jiang
- ModiFace - A L'Oréal Group Company, Toronto, Ontario, Canada
| | - Jeff Houghton
- ModiFace - A L'Oréal Group Company, Toronto, Ontario, Canada
| | - Yuze Zhang
- ModiFace - A L'Oréal Group Company, Toronto, Ontario, Canada
| | | | - Nina G Jablonski
- Department of Anthropology, The Pennsylvania State University, University Park, State College, Pennsylvania, USA
| | | | - Jeffrey Clarke
- Evaluative Criteria Incorporated, Tarrytown, New York, USA
| | - Jason Steeg
- Evaluative Criteria Incorporated, Tarrytown, New York, USA
| | | | | | | | - Thierry Passeron
- Department of Dermatology, Université Côte d'Azur, CHU Nice, Nice, France.,Université Côte d'Azur, INSERM, U1065, C3M, Nice, France
| |
Collapse
|