AUTHOR=Zhou Mowei , Fulcher James M. , Zemaitis Kevin J. , Degnan David J. , Liao Yen-Chen , Veličković Marija , Veličković Dušan , Bramer Lisa M. , Kew William R , Stacey Gary , Paša-Tolić Ljiljana TITLE=Discovery top-down proteomics in symbiotic soybean root nodules JOURNAL=Frontiers in Analytical Science VOLUME=2 YEAR=2022 URL=https://www.frontiersin.org/journals/analytical-science/articles/10.3389/frans.2022.1012707 DOI=10.3389/frans.2022.1012707 ISSN=2673-9283 ABSTRACT=

Proteomic methods have been widely used to study proteins in complex biological samples to understand biological molecular mechanisms. Most well-established methods (known as bottom-up proteomics, BUP) employ an enzymatic digestion step to cleave intact proteins into smaller peptides for liquid chromatography (LC) mass spectrometry (MS) detection. In contrast, top-down proteomics (TDP) directly characterizes intact proteins including all possible post-translational modifications (PTMs), thus offering unique insights into proteoform biology where combinations of individual PTMs may play important roles. We performed TDP on soybean root nodules infected by the symbiotic Bradyrhizobium japonicum in both the wildtype bacterium and a nifH- mutant, which lacks the ability to fix nitrogen in the soybean root nodule. TDP captured 1648 proteoforms derived from 313 bacterial genes and 178 soybean genes. Leghemoglobin, the most abundant protein in the sample, existed in many truncated proteoforms. Interestingly, these truncated proteoforms were considerably more abundant in the wildtype relative to the nifH- mutant, implicating protease activity as an important factor in nitrogen fixation. Proteoforms with various PTMs and combinations thereof were identified using an unrestricted open modification search. This included less common PTMs such as myristoylation, palmitoylation, cyanylation, and sulfation. In parallel, we collected high resolution MS imaging (MSI) data of intact proteins and biopolymers (<20 kDa due to current technical limitations) from sections of the soybean root nodules using matrix-assisted laser desorption/ionization (MALDI) coupled to high resolution Orbitrap. Several detected proteoforms exhibited unique spatial distributions inside the infection zone and cortex, suggesting functional compartmentalization in these regions. A subset of peaks from the MALDI-MSI were assigned to proteoforms detected in TDP LCMS data based on matching accurate masses. Many of the proteins detected in both LCMS and MALDI-MSI are currently uncharacterized in UniProt: the PTM and spatial information presented here will be valuable in understanding their biological functions. Taken together, our study demonstrates how untargeted TDP approach can provide unique insights into plant proteoform biology. On-going technology developments are expected to further improve TDP coverage for more comprehensive high-throughput analysis of proteoforms.