The FBMN method consists of two main steps: 1) LC-MS feature detection and alignment, then 2) a dedicated molecular networking workflow on GNPS. Our first prototype for FBMN was developed with the Optimus workflow7 (link),14 that uses OpenMS tools10 (link). Following step 1 (feature detection and alignment), two files are exported: a feature quantification table (.TXT format) and a MS2spectral summary (.MGF format). The feature quantification table contains information about LC-MS features across all considered samples including a unique identifier (Feature ID) for each feature, m/z value, retention time, and intensity. The MS2spectral summary contains a list of MS2 spectra, with one representative MS2 spectrum per feature. The mapping of information between the feature quantification table and the MS2spectral summary is stored in these files using the feature ID and scan number, respectively. This simple mapping enables to relate LC-MS feature information or statistically derived results to the molecular network nodes. This approach was also used for the integration of other tools with FBMN, and does not require third party software like it was proposed in the past22 ,23 (link). Finally, the FBMN workflow also supports the mzTab-M format6 (link), a standardized output format designed for the report of metabolomics MS-data processing results. In this case, the mzTab-M file is used instead of feature quantification table and requires the input of the mzML files instead of the MS2spectral summary file. Support for the mzTab-M format enables the possibility to perform FBMN with any existing and future processing tools that support this standardized format.
The FBMN workflow has been integrated into the GNPS ecosystem and thus benefits from the connection with other GNPS features, e.g. the possibility to perform automatic MS2 spectral library search, the direct addition and curation of library entries, the search of a spectrum against public datasets with MASST20 (link), and the visualization of molecular networks directly in the web browser24 (link) or with Cytoscape25 (link). The FBMN workflow is available on the GNPS platform (https://gnps.ucsd.edu/) via a web interface (See Supplementary Fig. 2). Jobs are computed and stored on the computational infrastructure of the Center for Computational Mass Spectrometry at the University of California San Diego. Each finished job is saved in the private user space for future examination and has a permanent static link that enables data sharing and collaborative analyses. We strongly recommend the sharing of this static link along with publications using GNPS workflows to facilitate results accessibility and data analysis reproducibility. Instructions to perform FBMN with the supported tools and input file format requirements are provided in the GNPS documentation (https://ccms-ucsd.github.io/GNPSDocumentation/featurebasedmolecularnetworking and Supplementary Fig. 3).