(27cf) A Systems Engineering Computer-Assisted Biomarker Detection Framework for Autism Spectrum Disorder Using Proteomic Data
AIChE Annual Meeting
2023
2023 AIChE Annual Meeting
Food, Pharmaceutical & Bioengineering Division
Poster session: Bioengineering
Monday, November 6, 2023 - 3:30pm to 5:00pm
To address this issue, we propose systematically generating physically meaningful novel features more resilient to these confounding factors than the original measurements, such as protein levels. We then propose an automated computer-assisted biomarker detection framework that integrates these novel features with a hybrid feature selection technique and a linear machine learning (ML) model. The effectiveness of the framework was demonstrated using a dataset of serum samples from 76 typically developing (TD) boys and 78 boys with ASD, aged 18 months to 8 years, which were examined to identify potential biomarkers for ASD using SomaLogicâs SOMAScan⢠assay 1.3K platform [3].
Our proposed framework identifies a panel of 12 features, including a combination of protein levels and novel features defined in this work. Using the dataset mentioned above, the proposed method detects ASD with high accuracy - achieving an area under the curve (AUC) of 0.940, outperforming the previous study of 0.860. In addition to the proteins used as features in previous studies, a novel set of engineered features that includes the ratio of proteins is proposed, which reduces within-class variations due to their resilience to confounding factors. The proposed feature selection technique combines a sequential filter and a wrapper feature selection method to tackle their respective limitations. A linear ML model is then developed using training samples and independently tested using a set of hold-out samples. The linear ML model is chosen for its robustness to overfitting and superior interpretability.
Our methodology introduces systems engineering principles and techniques to ASD detection research. Specifically, biomarkers beyond the traditional physical trait are defined to include bio-information that can only be extracted by considering their interactions and correlations. The systems engineering perspective provides additional insights into the ASD mechanism, which can lead to additional discoveries in the future.
References:
- Matthew J Maenner et al. âPrevalence and characteristics of autism spectrum disorder among children aged 8 yearsâautism and developmental disabilities monitoring network, 11 sites, United States, 2018â. In: MMWR Surveillance Summaries 11 (2021), p. 1.
- Fatir Qureshi et al. âMultivariate Analysis of Metabolomic and Nutritional Pro- files among Children with Autism Spectrum Disorderâ. In: Journal of Personalized Medicine 6 (2022), p. 923.
- Laura Hewitson et al. âBlood biomarker discovery for autism spectrum disorder: A proteomic analysisâ. In: PLoS One 2 (2021), e0246581.
- Eleftherios P Diamandis. âAnalysis of serum proteomic patterns for early cancer diagnosis: drawing attention to potential problemsâ. In: Journal of the National Cancer Institute 5 (2004), pp. 353â356.
- Keith A Baggerly et al. âSignal in noise: evaluating reported reproducibility of serum proteomic tests for ovarian cancerâ. In: Journal of the National Cancer Institute 4 (2005), pp. 307â309.