(403e) Ffiber: Framework for Fluorescent Neuroimaging Based Experimental Routines
AIChE Annual Meeting
2020
2020 Virtual AIChE Annual Meeting
Topical Conference: Applications of Data Science to Molecules and Materials
Applications of Data Science to High Throughput Experimentation
Wednesday, November 18, 2020 - 9:00am to 9:15am
Methods: To develop a data awareness, we performed a data assessment of relevant metadata, data storage, and data relationships. When designing a data management plan, we identified the optimal data storage locations, file structure for raw data and results, and personnel responsibilities. To determine an optimal experimental pipeline, we performed software reviews, addressed labor-hour bottlenecks, and identified necessary programs, packages, and scripts. Then we built out the supporting data science infrastructure utilizing Python and Jupyter notebooks as the base script for pipelines, electronic laboratory notebook integration, and version control with the team through GitHub and Google Drive. After the pipeline was built, we completed the primary set of experimental imaging using confocal microscopy of brain slices and fed the data through the pipeline for analysis. We then completed supplemental imaging to experimentally support a representative set of results obtained through FFIBER. Finally, we wrote Python scripts to output consistent interpretable visualizations of our analyses and align these with other experimental results.
Results: We applied FFIBER to a data science experiment analyzing the phenotype of microglial cells in ex vivo brain slices to assess extent of glial cell activation. We were able to create and modify a pipeline during the initial stages of (1) an oxygen-glucose deprivation experiment and retroactively for (2) an inflammation-sensitized model using E. coli derived lipopolysaccharide, (3) and a neonatal hypoxia-ischemia (HI) model. A preliminary data assessment revealed a storage necessity of 5 GB per experiment with five individual file types from five separate personnel. We completed the data assessment and built a data management plan with university-supported Google Drive as our storage location, .tiff, .csv, and .ipynb as our file types, and established a slice-based, double-blind file structure. We wrote Python scripts automating image upload, cleaning and automatic feed to our analysis pipeline that reduced labor-hours from five hours per image set to less than ten minutes. Our software review led us to integrate a published cell morphometrics package, VAMPIRE, with our diff_register package for skeletonized morphometrics of microglia. Our image analysis pipeline developed with FFIBER split uploaded confocal images into fourths, segmented, skeletonized and identified the microglia in the images, and uploaded the images into our integrated morphometric analysis. FFIBER structured the development of our work pipeline that decreased work hours from a full day to less than an hour to analyze an entire experiment. Finally, our FFIBER structured pipeline enabled us to detect and quantify shape differences between non-treated and injured ex vivo slices â providing additional information for our initial oxygen-glucose deprivation experiment and retroactive insight to our inflammation sensitized and neonatal HI models. Our first application of FFIBER decreased our human labor time from > 24 hours per 5 GB image set to less than an hour, decreased our storage budget to $0, and produced information about cell morphometrics previously unavailable to our lab due to lack of analysis software and time constraints.
Conclusions: By applying FFIBER, we were able to create workflow pipelines both during initial stages of experimental design and retroactively in neuroimage sets already collected. We decreased labor hours for data sets from 24 hours to less than an hour through using Python scripts rather than manual work and a well-designed pipeline. Our framework creates a high throughput approach for experimental design in research dependent on neuroimaging by integrating traditional experimental methods with modern data science applications. We can further iterate the methodology to include metadatabases for easier data access and data connections, budget vs storage matrices for determining best data storage location, and templates for electronic lab notebooks, file structures, data relationships, and generalized processing pipelines. Additionally, while FFIBER was developed for fluorescent neuroimaging datasets, the current workflow is robust, and flexible enough for application to other image analysis techniques, other organs, and other fluorescent imaging methods.