- 1Bioanalytics Group, Department of Clinical Chemistry, University Medical Center Götttingen, Göttingen, Germany
- 2Department of Medical Bioinformatics, University Medical Center Götttingen, Göttingen, Germany
- 3Cardiac Remodeling Section, Heart Research Center Göttingen, Department of Cardiology and Pneumology, University Medical Center Göttingen, Göttingen, Germany
- 4Cellular Biophysics and Translational Cardiology Section, Heart Research Center Göttingen, Department of Cardiology and Pneumology, University Medical Center Göttingen, Göttingen, Germany
- 5Cluster of Excellence “Multiscale Bioimaging: From Molecular Machines to Networks of Excitable Cells” (MBExC), Georg August University, Göttingen, Germany
- 6Bioanalytical Mass Spectrometry Group, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
Introduction: Co-fractionation mass spectrometry couples native-like separations of protein-protein complexes with mass spectrometric proteome analysis for global characterization of protein networks. The technique allows for both de novo detection of complexes and for the detection of subtle changes in their protein composition. The typical requirement for fine-grained fractionation of >80 fractions, however, translates into significant demands on sample quantity and mass spectrometric instrument time, and represents a significant barrier to experimental replication and the use of scarce sample material (ex. patient biopsies).
Methods: We developed mini-Complexome Profiling (mCP), a streamlined workflow with reduced requirements for fractionation and, thus, biological material and laboratory and instrument time. Soluble and membrane-associated protein complexes are extracted from biological material under mild conditions, and fractionated by Blue Native electrophoresis using commercial equipment. Each fraction is analysed by data-independent acquisition mass spectrometry, and known protein complexes are detected based on the coelution of known components using a novel R package with a controlled false discovery rate approach. The tool is available to the community on a GitHub repository.
Results: mCP was benchmarked using HEK293 cell lysate and exhibited performance similar to established workflows, but from a significantly reduced number of fractions. We then challenged mCP by performing comparative complexome analysis of cardiomyocytes isolated from different chambers from a single mouse heart, where we identified subtle chamber-specific changes in mitochondrial OxPhos complexes.
Discussion: The reduced sample and instrument time requirements open up new applications of co-fractionation mass spectrometry, specifically for the analysis of sparse samples such as human patient biopsies. The ability to identify subtle changes between similar tissue types (left/right ventricular and atrial cardiomyocytes) serves as a proof of principle for comparative analysis of mild/asymptomatic disease states.
1 Introduction
On the molecular level, the majority of cellular functions such as energy metabolism, cell division or replication rely on the actions of protein-protein complexes (PPCs). The systematic study of PPCs is therefore crucial to understanding and characterizing cellular biology. Over the last decade, complexome profiling (CP) as an implementation of co-fractionation mass spectrometry (CF-MS) has emerged as a key approach to detecting PPCs on a global scale. CP frequently includes mild non-ionic detergent-based solubilization and extraction of complexes; non-denaturing (“native-like”) enrichment and fractionation of the resulting PPCs; profiling of the protein contents across all fractions by mass spectrometry; and finally, statistical and bioinformatic analysis of the fraction-resolved protein profiles obtained. The co-elution of proteins suggests their biophysical interaction in a non-covalent PPC, and there are computational approaches to calculate co-elution scores and correlate them to protein-protein interactions (Havugimana et al., 2012; Giese et al., 2015; Stacey et al., 2017; Heusel et al., 2019; Bludau et al., 2020). CP has been used to complement affinity purification experiments (Uliana et al., 2023) and is currently the method of choice for proteome-wide screening of interactome rewiring across samples or conditions (Bludau, 2021). Recently, efforts have begun to consolidate this young technique to enable better sharing of biological results, which will be of great value to the scientific community (Van Strien et al., 2021).
All complexome profiling methods build on the premise of successful solubilization of intact PPCs from their biological environments with high fidelity. This is particularly challenging in the case of membrane-associated PPCs, where a 3-step extraction mechanism has been proposed (Le Maire et al., 2000): in stage I, detergent molecules interact with membrane lipids, and non-micellar detergent partitions into the phospholipid bilayer are observed. In stage II, a mixture of phospholipid-detergent micelles and phospholipid membranes saturated with detergent coexist in equilibrium. Finally, in stage III, membrane complexes are fully solubilized into micelles. Therefore, membrane-associated PPCs may or may not be characterized by a defined stoichiometry. When using “native-like” separations for their enrichment, this often results in poorly defined elution profiles compared to those observed for soluble complexes, making it challenging to establish statistical parameters for their automated detection.
The study of membrane-associated PPCs has gained particular attention in the field of cardiovascular research, where they play key roles in several key functions such as Ca2+ homeostasis, cell-cell junctions and energy metabolism. A frequently used approach involves the preparation of so-called enriched membrane fractions (Santos et al., 2002; Soni et al., 2016), followed by solubilization with non-ionic detergents such as cholate, deoxycholate, 3-[(3-cholamidopropyl) dimethylamino]-1-propane sulfonate (CHAPS) or mixtures thereof (Cornelius, 1991). Alternatively, enriched membrane fractions may be solubilized by glycosides such as digitonin, a strategy frequently used for the detection of mitochondrial (Schtigger et al., 1991) or cardiac membrane proteins (Alsina et al., 2019). E.g., mitochondrial supercomplexes were visualized using blue native electrophoresis (BNE) following mild digitonin or Nonoxinol 40 (NP-40) extraction (Acín-Pérez et al., 2008), or using size exclusion chromatography (SEC) followed by octaethylene glycol monododecyl ether (C12E8) extraction (Foo et al., 2024). In the context of cardiac research, the compositions of active mitochondrial human respiratory super-complexes composed of mitochondrial complex I, III, and IV (CI1III2IV1) and the architecture of mitochondrial complex I, II, and IV(CI2III2IV2) were elucidated (Guo et al., 2017). This is of particular importance since the metabolic state of e.g., mouse hearts has been directly linked to the abundance of mitochondrial supercomplexes in isolated mitochondria (Zheng et al., 2023). As an alternative to the preparation of enriched membrane fractions, cytoplasmic and membrane-associated protein complexes can be extracted directly from whole cells. In this case, the lysis buffer is applied directly to the cells without previous fractionation, an approach here called general lysis (Heusel et al., 2019). It is worth mentioning that both enriched membrane fractions and general lysis methods are able to extract both membrane and soluble protein complexes, albeit with varying efficiencies and fidelities.
Once complexes are solubilized, the fractionation of PPCs is carried out using non-denaturing or “native-like” separation methods such as BNE (Heide et al., 2012; Alsina et al., 2019), SEC (Heusel et al., 2019; Heusel et al., 2020), ion exchange chromatography (IEX) (Havugimana et al., 2012) or density gradient centrifugation (Páleníková et al., 2021). Among these, BNE and SEC are by far the most frequently used due to their demonstrated potential for high proteome and PPC coverage (Skinnider and Foster, 2021). Blue native polyacrylamide gel electrophoresis (BN-PAGE) was originally developed as a variant of native PAGE to study mitochondrial protein complexes. Depending on the gel matrix used, the method enables separations up to the 10 MDa molecular weight range (Schtigger et al., 1991; Wittig et al., 2006). Solubilizing detergents are not strictly required for BN-PAGE; the combination of the anionic Coomassie dye with non-ionic detergents, however, can generate an amphipathic effect that can be detrimental to protein complex stability (Wittig et al., 2006), necessitating careful optimization.
Historically, mass spectrometric analysis in CP-MS experiments has mostly been performed using data-dependent acquisition mass spectrometry (DDA-MS) (Skinnider and Foster, 2021). In the last decade, however, data-independent acquisition mass spectrometry (DIA-MS) has emerged as a superior acquisition mode in proteomics to detect and quantify proteins in complex samples (Gillet et al., 2012). Due to its sensitivity (Bruderer et al., 2015; Dowell et al., 2021) and quantitative fidelity (Barkovits et al., 2020), DIA-MS has been increasingly applied to complexome profiling (Heusel et al., 2019; Bludau et al., 2020; Heusel et al., 2020). More recently, trapped ion mobility-time of flight-mass spectrometry (timsToF-MS) (Meier et al., 2015) has expanded the utility of DIA-MS by introducing an additional dimension of separation to the analysis of peptide precursors. The approach, termed diaPASEF (Meier et al., 2020), allows for the identification and quantification of high numbers of proteins in very short liquid chromatography tandem mass spectrometry (LC/MS/MS) experiments gradients (Skowronek et al., 2022), and can be tailored to different gradient lengths and sample complexities, and novel DIA setup designs (Szyrwiel et al., 2022). Following the development and refinement of these acquisition paradigms, multiple data analysis platforms have been made available for the analysis of DIA data, including DIA-NN (Demichev et al., 2020), Spectronaut (Bruderer et al., 2015), Skyline (MacLean et al., 2010), Max Quant (Sinitcyn et al., 2021), and MS-Fragger (Yu et al., 2023).
The statistical and bioinformatic analysis of CF-MS datasets has been greatly improved, and today there are many different alternatives to detect PPCs in complex biological samples. The detection of protein complexes by unbiased clustering of protein abundance profiles, known as non-targeted analysis, is implemented in software tools like NOVA (Giese et al., 2015). In addition, there have been approaches to detect protein complexes in LC/MS/MS that look for protein complexes by machine learning, namely EPIC (Hu et al., 2019), PCprophet (Fossati et al., 2021b), ComplexFinder (Nolte and Langer, 2021), PrInCE (Stacey et al., 2017), and DIP-MS (Frommelt et al., 2024). Targeted analysis, on the other hand, relies on the introduction of a working hypothesis or “ground truth” to extract protein profiles for further correlation analysis (Bludau, 2021). This principle is implemented in software packages like CCprofiler (Bludau et al., 2020) and ComplexBrowser (Michalak et al., 2019). Ground truth is often introduced from existing PPC databases such as CORUM (Tsitsiridis et al., 2023), String (Szklarczyk et al., 2023) or Complex Portal (Meldal et al., 2015).
The main current disadvantage of most CP approaches is the classic requirement for high numbers of fractions, which in turn requires significant mass spectrometric instrument time for analysis. Most established approaches use 50–80 fractions, and high-resolution experiments with up to 1,000 fractions have been tested (Havugimana et al., 2012). The resulting requirement for MS instrument time has been greatly reduced due to the advent of high density, short gradient LC/MS/MS methods such as diaPASEF, and systematic studies have demonstrated the limited benefits of over-fractionation, as pointed out in a meta-study of 206 published datasets (Skinnider and Foster, 2021). Even so, the effort required currently precludes the comparison of significant numbers of samples, or even replication to assess the reproducibility of quantitation.
Here, we introduce a novel workflow called mini-Complexome Profiling (mCP), complete with a publicly available R package available for data analysis, which allows for targeted detection of known protein complexes in low fractionation, BNE-based experiments. At 35 fractions and correspondingly reduced amounts of required instrument time, mCP is particularly suitable for screening samples from multiple conditions, and allows for replication to stabilize results. We developed and benchmarked it on HEK293 cells, an established model for protein complex studies, where our approach allowed the detection of 535 protein complexes in total cell lysates, equivalent to the number of PPCs detected in 81 fractions previously (Heusel et al., 2019; Bludau et al., 2020). Subsequently, we show the versatility of mCP in the comparative analysis of cardiomyocytes from the left and right ventricles as well as the atria of a single mouse heart, demonstrating its utility in performing comparative analysis from sparse samples.
2 Materials and methods
2.1 HEK293 cultivation
HEK293A cells (ATCC) were cultivated on T75 flasks (Greiner) in low-glucose Dulbecco’s Modified Eagle’s Medium (DMEM, Sigma) supplemented with 10% (v/v) fetal bovine serum (FBS, Thermo Fisher), 2 mM L-glutamine GlutaMAX (Sigma-Aldrich) and 1x penicillin/streptomycin (Sigma-Aldrich). Cells were cultivated under humidified conditions with 5% CO2 at 37°C and passaged every 3-4 days using 0.05% trypsin/EDTA (Sigma-Aldrich). Cells were harvested in ice-cold PBS buffer using a cell scraper, and sedimented for 5 min at 500 × G at 4°C.
2.2 Murine cardiomyocyte isolation
Mouse handling was performed in accordance with directive 2010/63/EU of the European Parliament and in keeping with NIH guidelines. The use of mice as a source of cardiac tissues for assay development is covered by the institutional protocols 11/2 and 22/19, reviewed and approved by the institutional animal committee of the University Medical Center Göttingen.
A 12-week old, male, wild-type mouse in the C57BL6N background was sacrificed and cardiomyocytes were isolated following a protocol published previously (Wagner et al., 2014). Following digestion by collagenase, the left and right ventricles, as well as the atria were dissected under a binocular microscope (Stemi 305, Zeiss) in 2 mL digestion buffer, and digestion was stopped by adding 3 mL stopping buffer containing 10% FBS. Isolated cardiomyocytes from the left and right ventricles (LV/RV) and the pooled atria (PA) were washed twice with the stopping buffer, cells sedimented for 8 min by gravity at room temperature (RT), the supernatant discarded and the cell pellets stored at −80°C.
2.3 Non-denaturing lysis and BNE fractionation
Protein complexes were extracted from HEK293 cells and isolated cardiomyocyte samples following established protocols (Bludau et al., 2020), with the omission of the concentration and buffer exchange steps. Lysis was performed at 4°C until the electrophoresis step. HEK293 cells were resuspended in 500 µL of HNN lysis buffer [0.05% vol/vol NP-40, 50 mM HEPES, 150 mM NaCl, 50 mM NaF, 200 µM Na3VO4, 1 mM phenylmethylsulfonyl fluoride (PMSF), and 1x protease inhibitor cocktail, pH 7.5] (Bludau et al., 2020). Cardiomyocytes isolated from the mouse left and right ventricle and from the atria were lysed by addition of 0.62 µL of non-denaturing lysis buffer per mg of moist pellet. Lysates were cleared by ultracentrifugation (100,000 × G, 15 min) and protein concentrations were determined by BCA (Pierce) and determined to be at least 2.7 mg/mL. Lysates were analyzed by Blue Native-PAGE on pre-cast Native 3%–12% Bis-Tris minigels (Invitrogen) using standard protocols (Wittig et al., 2006). Nativemark standard (Invitrogen) was used for molecular weight calibration. 35 μg of protein per lane were used for HEK293 complexome profiling, and 60 µg for cardiomyocytes. Lanes were cut into 35 equidistant fractions irrespective of staining using an in-house stainless steel cutter. All fractions were reduced, alkylated, and digested with trypsin in 96 well microplates as previously described (Atanassov and Urlaub, 2013).
2.4 SDS-PAGE purification
Briefly, 17.6 µg cleared lysate from HEK293 cells, or 5 µg cleared lysate from cardiomyocytes were purified by a brief 1 cm SDS-PAGE run (Nu-PAGE minigel 4%–12%, Invitrogen). After Coomassie staining, each lane was excised in one piece, reduced, alkylated, and digested with trypsin as previously described (Atanassov and Urlaub, 2013).
2.5 LC/MS/MS data acquisition
In-gel digested samples were resuspended via sonication in loading buffer (2% aqueous acetonitrile, 0.1% formic acid) and spiked with iRT standard peptides (1/100 diluted, Biognosys). Peptides were separated on a nanoflow chromatography system using reversed-phase chromatography, and the eluent was analyzed on a hyphenated timed ion mobility-time of light mass spectrometer (timsTOF Pro 2, Bruker) in either PASEF or diaPASEF acquisition modes.
2.5.1 Data-independent acquisition for proteome profiling
Peptides were separated using a linear 12.5 min gradient of 4%–32% aqueous acetonitrile versus 0.1% formic acid on a reversed phase-C18 column (PepSep Fifteen, 150 mm × 0.150 mm, Bruker) at a flow rate of 850 nL min−1. DIA analysis was performed in diaPASEF mode using a customized 8 × 2 window acquisition method from m/z 100 to 1,700, and from 1/K0 = 0.7–1.5 to include the 2+/3+/4+ population in the m/z-ion mobility plane (Skowronek et al., 2022). The collision energy was ramped linearly as a function of the mobility from 59 eV at 1/K0 = 1.5 Vs cm−2 to 20 eV at 1/K0 = 0.7 Vs cm−2. Two technical replicates of each sample were acquired.
2.5.2 Data-independent acquisition for spectral library generation
Peptides coming from lysates were measured using two different gradients and columns. The first two technical replicates of each lysate were acquired with the same column and gradient as DIA for proteome profiling (see above). Another two technical replicates were measured using a longer linear 100 min gradient of 2%–37% aqueous acetonitrile versus 0.1% formic acid on a reversed phase-C18 column (Aurora Elite 250 mm × 0.075 mm, IonOpticks) at a flow rate of 250 nL min−1 and the same DIA setting used for proteome profiling.
2.6 Data processing
DIA complexome profiling datasets were analyzed by DIA-NN software v1.8.1 (Charité; Demichev et al., 2020), Spectronaut software v16.2 (Biognosys; Bruderer et al., 2015) and MaxQuant software v2.0.1.0 (MPI for Biochemistry; Sinitcyn et al., 2021). Protein identification was performed against the UniProtKB human and mouse reference proteomes (version 01/22), respectively, augmented by a set of 51 known laboratory contaminants, including the concatenated iRT peptide sequences. For all search engines, carbamidomethylation was set as a fixed modification, trypsin/P as the enzyme specificity, and a maximum of two missed cleavages were allowed. The minimum peptide length was set to seven amino acid residues, the maximum to 30. All FDR levels were set to a maximum of 0.01.
2.6.1 DIA-NN analysis
The search parameters of DIA-NN (v1.8.1) were set to robust LC (high precision mode), library-free search enabled and library generation with smart profiling, deep learning-based spectra, neural network classified single-pass mode, cross normalization RT dependent with match between runs, use of isotopologues, no shared spectra, and heuristic protein and gene protein inference active. RTs and IMs prediction were enabled. When generating a spectral library, in silico predicted spectra were retained if deemed more reliable than experimental ones. Fixed-width center of each elution peak was used for quantification.
2.6.2 MaxQuant TIMS-DIA analysis
For data analysis in MaxQuant, the TIMS DIA algorithm v2.0.1.0 was used at default parameters. The MaxQuant discovery library Homo sapiens peptides, evidence, and msms files (release 2021-06-23) were used for searching. Protein quantification was based on both unique and razor peptides, FastLFQ was used for DIA quantification.
2.6.3 Spectronaut analysis
Search parameters in Spectronaut (v16.2) were set as follows: analysis was performed in directDIA mode using default settings, except that the top 10 peptides per protein were used for quantification (major group top N).
2.6.4 Generation of annotated MS/MS spectral libraries
We built two annotated MS/MS spectral libraries based on experimental data and processing in DIA-NN software (v1.8.1) using the search settings described above. For HEK293 cells, the spectral library was generated from the analysis of two non-fractionated lysates using both the short and long gradients described above with two technical injection replicates, plus single replicate files from the analysis of 35 BNE fractions generated for lysates P1 and P2, resulting in a total of 78 DIA-MS files. For the analysis of mouse cardiomyocytes, we chose a simplified library approach which included only non-fractionated measurements of one sample each from LVCM; RVCM and ACM. Each sample was analysed using both the short and long gradients described above, and with two technical injection replicates.
2.6.5 Statistical analysis of complexome profiling data sets
mCP software was developed in R version 3.6 (R Core Team, 2022). All searches for the detection of PPCs were performed in dynamic mode by the mCP R package with default settings including a fixed-filter value equal to 0.81, 185 Monte Carlo simulations, and Pearson correlation methods (default). The only exception was the analysis presented in Supplementary Figure S12, which was performed in de novo mode (dynamic = FALSE). PPCs were detected using a subset of the CORUM database (version 03/2022) filtered either for human (HEK293) or mouse (cardiomyocyte) entries, respectively. For the analysis of mouse cardiomyocytes, known mouse cardiac PPCs such as the calcium handling complexes (SERCA2A/RyR2/PLN), the SLMAP complex, and the IDH3G complex were added manually to generate a custom database, which is incorporated into the example data of the mCP R package. The molecular weight marker values used are indicated in Supplementary Material S2 (Supplementary Table S1).
2.7 Additional software tools
Cartoon illustrations were created based on graphical components sourced from BioRender.com (publication license number PG26PW3SEG).
3 Results
3.1 Development and benchmarking of the mCP workflow
3.1.1 mCP workflow design and development
We explicitly designed the mCP workflow to be compatible with freely available and affordable consumables, starting with commercially available BNE mini-gels, which do not require significant optimization and afford excellent reproducibility (Figure 1A; Supplementary Figures S8–S10). Following non-denaturing electrophoresis, gel lanes are cut into 35 equidistant slices using an in-house manufactured stainless steel comb (Schmidt and Urlaub, 2009), alkylated and digested with trypsin. In-gel digestion of proteins is readily parallelized [e.g., in 96-well format (Schmidt et al., 2013)], and directly produces MS-compatible peptide mixtures.
Figure 1. mCP workflow overview. (A) Schematic BN-PAGE separation of HEK293 lysates. M: Molecular weight marker. P1 and P2, biological replicates. Following fractionation to 35 slices per lane and in-gel digestion, slices were analyzed using 72 samples per day DIA-MS acquisition, and their protein area values as a function of fraction number. (B–D) Performance comparison of different DIA-MS software packages on the detection of precursors, peptides, and protein groups from P1 and P2. (E) Detection of protein complexes in these data sets using the mCP R package in dynamic mode. (n = 2 mass spectrometry replicates; error bars, 3x standard deviation).
The choice of 35 fractions was informed by a previously published meta-study of 206 co-fractionation mass spectrometry experiments (Skinnider and Foster, 2021). The authors showed that the recovery of known PPCs evens out at around 40 fractions, with only small incremental gains obtained by further increasing the fraction number. An additional benefit of limiting the number of fractions is provided by the corresponding reduction in protein input requirements. While high fraction workflows such as the one by Schulte et al. (2023) offer very high levels of detail, they also require significant amounts of starting material to yield at least low microgram protein amounts per fraction in order to support reproducible sample preparation and recovery. We aimed for a protein input amount ranging from 35 to 70 μg, which is consistent with the protein quantities recovered from human patient biopsies (e.g., Brandenburg et al., 2022) or single mouse organs. Assuming a roughly equal distribution of protein across the BNE, each of the 35 fractions generated in our approach will contain 1-2 µg of protein. We did not aim for a further reduction in fraction numbers because changes in the composition of PPCs are not necessarily reflected in their detectability, but in often subtle changes of their apparent molecular weight distribution patterns that may indicate changes in PPC stability and assembly states. A further reduction in fraction numbers would have reduced the fidelity of displaying these distribution patterns.
The tryptically digested gel slices were then analyzed using DIA-MS with short gradients (Fossati et al., 2021a; Bludau et al., 2023). The integration of CF-MS with DIA-MS has been demonstrated to produce highly reproducible and comprehensive complexome profiling data (Hay et al., 2023). The use of short gradients for DIA-MS acquisition leads to only a minor loss of information compared to longer gradient acquisitions, while also increasing throughput. We opted for a 20 min, or 72 samples per day, DIA-MS method, which allows for the analysis of two full BNE lanes per day and thus enables not only biological and technical replication but, more importantly, the comparison of different biological or disease states.
We tested the experimental workflow using 35 µg protein equivalents of HEK293 cell lysates. Following the workflow outlined above, we evaluated the performance of three widely used software packages for the analysis of the DIA-MS data, namely DIA-NN (Demichev et al., 2020), MaxQuant (Sinitcyn et al., 2021), and Spectronaut (Bruderer et al., 2015). The choice of processing software has been demonstrated to have a significant impact on the quality of DIA-MS experimental results (Fröhlich et al., 2022; Fangfei Zhang et al., 2023; Lou et al., 2023). We analyzed two biological replicates P1 and P2, each with two technical injection replicates (Supplementary Figure S8). All three software packages were employed for analysis without an a priori annotated MS/MS spectral library (so-called “directDIA”). In addition, we used a dedicated MS/MS spectral library for analysis by DIA-NN (“DIANN library”). The results are summarized in Figures 1B–D.
In summary, we detected an average of 6,304 protein groups in the data set, evidenced by an average of 70,440 unique peptides and 87,999 precursors, respectively. On the precursor and peptide levels, Spectronaut significantly outperformed the other software packages, particularly MaxQuant (Figures 1B, C). Consolidation of the precursors and peptides into quantified protein groups, however, showed more similar results, with DIA-NN in library-free mode showing the best results (Figure 1D). This difference may hint at differences in the stringency of the protein grouping algorithms in the software.
3.1.2 FDR-controlled processing of mCP data sets
We next set out to develop a bioinformatic tool for processing mCP data sets that enables automated detection of protein complexes under control of false discovery rates (FDR). Earlier CP-MS approaches mostly employed unsupervised approaches to data analysis such as hierarchical clustering (Giese et al., 2015), which, while powerful for comparative analysis, are subjective with regard to the parameters used, and lack FDR control. More recently, FDR-controlled CP-MS approaches have been developed that allow for an FDR control of PPC detections in CP-MS data sets by introducing an external “ground truth,” e.g., lists of PCCs obtained in comprehensive, well-curated PPC databases such as CORUM (Tsitsiridis et al., 2023) or String (Szklarczyk et al., 2023). Heusel et al. (2019) first published this approach and a corresponding computational tool named CCProfiler for the analysis of SEC-DIA-MS data sets. Instead of the peptide-centric approach to data analysis inferred by building on large-scale peptide detection, this rather shifts the focus to a complex-centric approach where evidence for PPCs is extracted in a targeted manner, and FDR can be stringently assessed using target/decoy approaches.
Here we present a novel statistical approach to detect PPCs in CP-MS data sets at a controlled FDR, and its implementation in an R package called mCP which is optimized for low fractionation approaches (here, 35 fractions) as discussed above (Figure 2). mCP takes the experimental matrix of protein quantitation values vs. fraction number as input, as well as a set of ground truth PPC compositions from a database such as Corum or ComplexPortal (Figure 2A). Following pre-filtering of the data by an experimentally calibrated Pearsons’s correlation filter (Figure 2B), the FDR is assessed by a Monte Carlo (MC) simulation target decoy approach (Figure 2C). Finally, detected PPCs are reported in list format and as heatmaps (Figure 2D), and supported by co-fractionation profile plots for further validation (see Supplementary Material S1, S3).
Figure 2. Schematic workflow of the mCP R package for PPC detection. The detection of PPCs is achieved in four stages: (A) Input of an experimental data matrix and a selected PPC database. (B) Pre-filtering of the data using empirically established thresholds and Monte Carlo (MC) simulations. (C) False Discovery Rate (FDR) evaluation of the resulting PPC candidates. (D) Output of graphical and table-based results formats.
We developed two modalities of search: a de novo search mode and a dynamic search mode. The de novo search calculates FDR under the assumption of independence in the Pearson’s coefficients (see Supplementary Material S1; Supplementary Section S4; Supplementary Material S3; Supplementary Figure S12). The dynamic mode, on the other hand, prioritizes individual correlation within the PPC; it is more permissive and recommended for annotated protein complexes searches using CORUM or Complex Portal. Here, mCP calculates Pearson’s correlation matrixes for each complex candidate present in both the experimental matrix and the ground truth database. Then it applies a filter based on an empirically calibrated Pearson’s correlation value threshold. On the resulting candidates, it counts the number of hits and the average value of the Pearson correlation higher than the first filter, which is used as a dynamic Pearson filter specific for each protein complex to detect PPCs using a Monte Carlo simulation. In this step, random inputs from the experimental matrix are used as decoy matrices to establish false positives. Using our HEK293 test set described above, we found that the results of the Monte Carlo simulation stabilized after 185 iterations. After establishing candidate and decoy PPCs, the results are truncated to satisfy an FDR limit set by the user. In the initial implementation of mCP, we observed over-assignment of protein complexes in the low apparent molecular weight (MW) regions of the BN-PAGE separated samples corresponding to apparent binary complexes consisting of low MW protein monomers. To address this, we implemented an additional “monomeric risk filter” in the mCP function (see Supplementary Material S3).
Depending on the DIA processing algorithm used, our mCP approach detected an average of 426 (270–586) protein complexes in the HEK293 data set against the CORUM database, all at an FDR of 5% and with standard settings (Pearson’s filter 0.81, 185 Monte Carlo simulations) (see Supplementary Material S3). Among the different DIA data analysis softwares, DIA-NN with or without the use of a spectral library outperformed both Spectronaut and MaxQuant when it came to the detection of PPCs in this data set (Figure 1E), with slightly better performance observed for the DIA-NN library option. While we cannot provide a detailed reason for this, we speculate that this may be a consequence of different stringencies in protein grouping, thus causing DIA-NN to present more protein complex constituent candidates. In spite of slightly lower performance compared to the library-free approach on the precursor, peptide, and protein group levels, DIA-NN processing using an experimental DIA library achieved the highest number of detected PPCs against the CORUM database (average 537 protein complexes, RSD 8.8%). We conclude that this approach is currently best suited for our workflow, which highlights the ongoing importance of experimental spectral libraries for select purposes of DIA-MS data analysis.
3.2 Benchmark analysis of HEK293 lysates
Using the optimized mCP workflow and parameters described above (Figure 2; DIA-NN library input, Pearson’s filter 0.81, Monte Carlo 185 simulations), we next examined the results obtained on our HEK293 data set in more detail and benchmarked its performance against existing computational approaches like CCprofiler (Heusel et al., 2019; Bludau et al., 2020). HEK293 cells present an extremely well-characterized cell line model, with a significant number of PPCs contained in publicly available databases such as CORUM, STRING, or others.
mCP detected an average of 504 ± 10 and 570 ± 22 protein complexes in biological duplicates P1 and P2, respectively, demonstrating good technical reproducibility between replicate samples (Figure 3A). We next applied the mCP algorithm to the HEK293 dataset published by Heusel et al. (2019) to test our computational approach on a higher degree fractionation (n = 81), SEC-based data set of a comparable sample. Here, the mCP algorithm detected 490 protein complexes against the CORUM database, which stands up well to the 509 complexes detected by the authors (Figure 3A). Moreover, mCP recovered 382/509 (75.0%) of the complexes detected with CCProfiler (Figure 3B). Even among these two experimentally diverse datasets (our data, BN-PAGE, n = 35 fractions; Heusel et al., SEC, n = 81) we observed good overlap, with the two mCP replicates P1 and P2 describing 392/509 (77.0%) of the complexes detected by CCprofiler (Figure 3B) (Bludau et al., 2020), underlining the robustness of our approach.
Figure 3. mCP workflow benchmarking for analysis of HEK293 cell lysates. (A) Protein complexes detected in HEK293 cell lysates in our data set (P1/P2, 35 fractions, 2 technical replicates) and in a reference dataset (81 fractions) (Heusel et al., 2019). Thick borders, analyzed with mCP; thin borders, analyzed with CCProfiler. (B) Venn diagram of complexes detected by the mCP R package using different data sets. mCP vs. CCprofiler datasets analyzed by mCP (left panel) or CCprofiler dataset analyzed by CCprofiler and mCP (right panel). (C,D) Elution profiles of SNX and VPS complexes exclusively detected by mCP in the 81 fraction reference data set.
Complexes detected exclusively by mCP in the 81 fraction reference data set include both complexes exhibiting wide, “hill-shaped” elution profiles like a three-protein SNX complex, a membrane-associated complex involved in ribonucleoprotein assembly (Figure 3C), as well as well-defined elution profiles like a VPS35-VPS29-VPS26A vacuolar sorting protein complex (Figure 3D). While we cannot assess the completeness or purity of these complexes based on our data, our computational approach is clearly able to recover protein complexes not detected by other approaches from the same data.
To test the reproducibility of our approach, we investigated PPC detection between the BN-PAGE replicates P1 and P2, as well as between the mass spectrometry replicates R1 and R2 for each of these (Figure 4). Numerical analysis showed a high overlap on both levels of replication (Figure 4A), with 67% of PPCs detected in all four replicates (Figure 4B). A heatmap analysis of the 1,097 proteins clustered into PPCs supports a high reproducibility of the workflow, with very similar clustering results observed both between BN-PAGE and mass spectrometry replicates (Figure 4C).
Figure 4. Reproducibility of PPC detection in HEK293 Cells. (A) Venn diagram of the overlap of PPC detections in BN-PAGE (P1, P2) and mass spectrometry (R1, R2) replicates. (B) Frequency histogram of PPC detections across the 2 × 2 replicates. x-axis, number of times a PPC was detected across the four datasets. (C) Heatmap showing the abundance of 1,097 proteins in the shared protein complexes detected in the HEK293 data set.
3.3 Analysis of individual mouse heart cavities
Next, we set out to test our mCP workflow and computational approach on an altogether different sample type, i.e., the analysis of cardiomyocytes obtained from different cavities of an individual adult wild-type mouse heart in the C57BL6N genetic background. Previous work from both our lab (Foo et al., 2024) and other groups (Rosca et al., 2008; Gómez et al., 2009; Hou et al., 2019) has shown PPCs and higher-order protein supercomplexes to play key roles in the heart, ranging from energy production (i.e., mitochondrial CI/III/IV supercomplexes) to Ca+2-handling (Alsina et al., 2019). While BN-PAGE has long been proposed as a tool to detect molecular defects in patients with disorders of oxidative phosphorylation (Van Coster et al., 2001), complexome profiling has so far been rarely applied either to the direct analysis of cardiac tissues, or to cardiomyocyte preparations of these. We thus investigated whether mCP can be applied to cardiac samples: specifically, cardiomyocytes isolated from different heart chambers (LV, RV, and atria) from a single mouse. In many regards, these samples are more challenging than HEK293 cell lysates: i) the limited protein amounts contained in cavities from individual animals preclude the use of native-like separations such as wide-bore SEC or wide-band BN-PAGE separations [e.g., as employed by Schulte et al. (2023)] combined with high fraction numbers, and ii) publicly available PPC databases are largely derived from the analysis of human cancer research cell lines, while organ-specific proteomes from other species such as the mouse heart are strongly underrepresented, e.g., CORUM 4.0 contains <1% of entries curated from tissue or cardiomyocyte samples (Foo et al., 2024).
Cardiomyocytes were isolated by tissue lysis and sedimentation from the cavities of a single mouse heart (left/right ventricle, LV/RV; pooled atria, PA) extracted from a 12 week-old wildtype (WT) male mouse. Again, proteins and protein complexes were fractionated by BNE and fractionated into 35 equidistant fractions (Supplementary Figures S9, S10), and all BNE slices were tryptically digested in-gel and analyzed by DIA-MS. The raw data were processed using DIA-NN with a library approach and evaluated using the mCP R package against the CORUM database as outlined above.
The results of the mCP analysis of cardiomyocytes from the different cavities are shown in Figure 5. Our exploratory analysis shows a total of 79 protein complexes found among all chambers, Here, only 34/79 (43%) of PPCs were detected in all cavities (Figures 5A, B), which can be rationalized by the underlying biological and proteomic differences between the cavities (Linscheid et al., 2021). The latter group comprises several well-described, high-abundance complexes such as the 20s proteasome, the immunoproteasome, the COP9 signalosome complex, and the CCT complex. Heatmap analysis of the 206 proteins present in these 79 complexes, however, showed much greater similarities between the left and right ventricle, than between the pooled atria (PA) and both of the ventricles (Figure 5C). This is in keeping with the proteomic similarities observed between cavities across several species (Linscheid et al., 2021).
Figure 5. Reproducibility of PPC detection in mouse heart cavities. (A) Venn diagram of the overlap of PPC detections in the left ventricle (LV), right ventricle (RV) and pooled atria (PA). (B) Frequency histogram of PPC detections across the cavities. x-axis, number of times a PPC was detected across the three datasets. (C) Heatmap showing the abundance of the 206 proteins in the shared protein complexes detected in the mouse heart cavities data set.
Interestingly, the mitochondrial protein complexes CI, CIII, and CIV, which are constituents of the CIx-CIIIx-CIVx respiratory chain supercomplex, showed markedly different migration profiles in the left and right ventricles (LV, RV) compared to the combined atria (Figure 6). In particular, CI protein profiles presented a down-shift from an experimental apparent MW of 3,285 kDa in the LV and RV, which agrees with an expected MW of a CI2-CIII2-CIV2 supercomplex (Chavez et al., 2018) of ca. 1,165 in mouse atria. Similarly, CIV proteins showed distributions to lower MW in the atrial samples, indicating a difference between respiratory chain supercomplex assembly states between the atria and ventricles in mouse, which to our best knowledge has not been documented so far [for analysis of whole mouse hearts see, e.g., (Guo et al., 2017; Zheng et al., 2023)]. Thus, we conclude that the mCP workflow here is sufficiently sensitive to perform analysis of cardiomyocytes from individual mouse heart cavities; and that it can visualize subtle compositional differences in protein complex assemblies through shifts in MW distributions.
Figure 6. Detection of mitochondrial respiratory chain supercomplexes in cardiomyocytes. (A) Schematic overview of the mitochondrial respiratory chain (super-)complexes. (B,E) Complex I profiles in the right ventricle (RV, left) and the pooled atria (PA, right) samples. (C,F) Complex IV profiles in RV and PA samples. (D,G) Complex III profiles in the RV and PA samples. The dashed line indicates the expected MW of the respiratory chain supercomplex CI2-CIII2-CIV2 at 3,285 kDa.
4 Discussion
Here we present mCP: a complete experimental workflow for rapid, global complexome analysis coupled with an R package that allows for the detection and FDR control of PPCs. The mCP workflow consists of a BN-PAGE separation in a commercial minigel format; fractionation into 35 fractions and in-gel digestion; rapid protein abundance profiling across fractions by DIA-MS; protein identification by DIA-NN software and finally, data analysis using the mCP R package.
mCP was designed to work on 35 fractions, a reduced number of compared to most complexome profiling workflows. This idea was initially supported by the work of Skinnider and Foster. (2021), who performed a comprehensive reanalysis of 206 published complexome profiling datasets and demonstrated that the recovery of known protein complexes stabilizes at around 40 fractions. Following this reasoning, reducing the number of fractions allows mCP to work with minimal requirements for the input material, as demonstrated by our analysis of cardiomyocyte preparations from individual mouse heart cavities, as well as minimizing the amount of mass spectrometry instrument time, which allows researchers to improve their experimental designs by way of replication or including additional biological controls and comparisons. Here, using a DIA-MS method with 20 min turnaround time, or 72 samples per day, allowed us to analyze two full replicates of each BN-PAGE lane in a single day.
With regard to protein identification and quantification softwares to perform the initial protein identification and quantitation, we observed the best results when using DIA-NN software with a spectral library approach, which resulted in the highest number of protein complexes detected in an initial HEK293 cell lysate benchmarking experiment. However, our downstream data analysis pipeline also showed competitive results with, e.g., Spectronaut, another widely available DIA-MS processing software, increasing researchers’ flexibility in adopting mCP for their needs.
The bioinformatic detection of protein complexes under false discovery control in these datasets is still challenging, not least because most target-decoy approaches to false discovery rate estimation are based on large search spaces and well-defined coelution profiles, which limits their effectiveness to data sets with high degrees of fractionation. Here, we propose a different approach by rather looking at whole chromatogram correlation matrices, calculating Pearson’s correlation, and then using Monte Carlo simulations to estimate false discovery (Figure 2).
The mCP bioinformatics approach has two modalities. The dynamic search and the de novo search. The dynamic search is deeply studied in this publication; it performs similarly to CCprofiler, while the de novo search looks for integral detection of protein complexes with a deep search testing algorithm that is a formal FDR calculation. A comparison is provided in Supplementary Material S3; Supplementary Figure S12. We see the application of this function in the context of de novo searches for the detection or validation of unknown PPC candidates, while the dynamic algorithm is strongly recommended for a search based on annotated protein complexes databases (CORUM or Complexportal). A great aspect of the mCP bioinformatics approach is the possibility of evaluating parallel datasets or parallel databases, as many as processing cores are available in the computer. This opens the possibility of validating inferred networks (Zahiri et al., 2020) or “blind testing” with a de novo search or parallel systems evaluation of known complexes.
In HEK293 cell lysate benchmarking data, we achieved similar performance to a previously published complexome profiling workflow and software, SEC-SWATH and CCprofiler (Figure 3) (Heusel et al., 2019). The major improvement observed with mCP analysis of both our data set and the dataset published by Heusel et al. is in the ability to detect protein complexes which do not show a highly defined, narrow MW range elution profile. As a whole chromatogram approach, mCP shows particularly improved performance for detecting broad, “hill-shaped” elution profiles that are associated with protein complexes lacking a uniformly defined stoichiometry (Figure 3C). We have begun to leverage this advantage for the analysis of very high MW, membrane-associated functional protein complexes that may exist as dynamic, non-covalent “functional rafts” rather than as defined stoichiometric complexes (Foo et al., 2024). While it may be impossible to call out defined complex stoichiometries under these circumstances, differential analysis may nonetheless allow us to detect biochemical changes in large-scale complexome comparisons.
A major inherent limitation of mCP as a targeted data analysis approach lies in its dependence on the availability of comprehensive protein complex databases such as CORUM (Tsitsiridis et al., 2023) or Complex Portal (Meldal et al., 2015). The scope of the database effectively constrains the detection of PPCs, as highlighted by the presence of 1,753 human, but only 694 mouse protein complexes in CORUM as of March 2022. A large part of proteome research is directed towards cancer research in human disease models and backgrounds; the situation for other organ or disease backgrounds may be much less favorable. Going forward, this may be addressed by community efforts to increase protein complex coverage in these scenarios. As another technical limitation, the mCP R package currently contains March 2022 versions of CORUM and Complex Portal. Automated database updates in our R package, for example, the CORUM database such as BioPlex R (Geistlinger et al., 2023) could not be made to work by us; therefore, an updated version of the database should be introduced by the user in our R package manually to do the search on the last database versions.
Apart from profile plots for each protein complex, mCP plots network heatmaps for each detected protein complex generated by the corrr package (Kuhn M et al., 2022). Network and heatmap plots are a good starting point to visualize the strength of interactions. We are currently evaluating the use of a more number-based approach to evaluate complex stability as well as compositional changes.
5 Conclusion
We developed a novel experimental and data processing workflow to streamline BN-PAGE-based complexome profiling analysis called mini-Complexome Profiling, or mCP. mCP consists of a rapid experimental workflow based on readily available and affordable resources, as well as a novel bioinformatics tool that is able to detect known protein complexes at controlled false discovery rates. The reduced material requirements open the possibility of applying this workflow to sparse or rare samples, i.e., patient biopsies or organoids, extending the use of complexome profiling to perform differential analysis of, e.g., multiple cellular states from a system biology perspective.
mCP is suitable for rapid screening of complexes to detect variations between different conditions. Using mCP, we detected different mitochondrial supercomplexes in RV vs. atrium. RV supercomplex stoichiometry correlates to a previous mouse supercomplex model (Chavez et al., 2018). Nevertheless, evaluation and comparison tools need to be developed to further detect alterations between datasets; there are already some other tools available (Meldal et al., 2015; Rizzetto et al., 2018). On a high number of fractions, we believe mCP could get further development to be able to identify unknown protein complexes.
Data availability statement
The data sets presented in this study can be found online in the following PRIDE repositories: https://www.ebi.ac.uk/pride/archive/, PXD049340, https://www.ebi.ac.uk/pride/archive/, PXD049426.
Ethics statement
Ethical approval was not required for work with commercially available established cell lines. The use of mice as a source of cardiac tissues is covered by the institutional protocols of the University Medical Center Göttingen (UMG), Robert-Koch-Strasse 40, 37075 Göttingen, Germany.
Author contributions
HA: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Software, Validation, Visualization, Writing–original draft, Writing–review and editing. NP: Formal Analysis, Methodology, Software, Writing–review and editing. BF: Data curation, Validation, Writing–review and editing. LN: Investigation, Methodology, Writing–review and editing. SL: Conceptualization, Funding acquisition, Resources, Supervision, Writing–review and editing. HU: Funding acquisition, Resources, Supervision, Writing–review and editing. CL: Conceptualization, Data curation, Funding acquisition, Methodology, Project administration, Resources, Supervision, Validation, Writing–original draft, Writing–review and editing.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by the Deutsche Forschungsgemeinschaft (DFG) via Collaborative Research Centre 1002 “Modulatory Units in Heart Failure” (project number 193793266, A09 to SL and CL, INF to NP), and by Germany’s Excellence Strategy—EXC 2067/1-390729940. BF and HA received additional funding via a PLN Foundation Crazy Ideas Award (NL Heart Institution).
Acknowledgments
HA would like to thank Phillip Schad for his contribution to the conceptualization and development of the mCP R package, as well as Resul Elgin (Georg August University Göttingen) and Jannis Anstatt (Max Planck Institute for Multidisciplinary Sciences) for testing software functionality. Tobias Kohl, Mufassra Mushtaq and Birgit Schumann (University Medical Center Göttingen, UMG) provided valuable technical and administrative support for obtaining cardiac tissues, and Brigitte Korff supported the culture of HEK cells. CL would like to thank Jeanes Strebe (University Medical Center Göttingen) and Monika Raabe (Max Planck Institute for Multidisciplinary Sciences) for excellent technical assistance. Mass spectrometric analysis was supported by the UMG Core Facility Proteomics.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/frans.2024.1425190/full#supplementary-material
SUPPLEMENTARY MATERIAL S1: | mCP Script.
SUPPLEMENTARY MATERIAL S2: | Molecular Weight Calibration.
SUPPLEMENTARY MATERIAL S3: | mCP Bioinformatic Approach.
References
Acín-Pérez, R., Fernández-Silva, P., Peleato, M. L., Pérez-Martos, A., and Enriquez, J. A. (2008). Respiratory active mitochondrial supercomplexes. Mol. Cell. 32, 529–539. doi:10.1016/j.molcel.2008.10.021
Alsina, K. M., Hulsurkar, M., Brandenburg, S., Kownatzki-Danger, D., Lenz, C., Urlaub, H., et al. (2019). Loss of protein phosphatase 1 regulatory subunit PPP1R3A promotes atrial fibrillation. Circulation 140, 681–693. doi:10.1161/CIRCULATIONAHA.119.039642
Atanassov, I., and Urlaub, H. (2013). Increased proteome coverage by combining PAGE and peptide isoelectric focusing: comparative study of gel-based separation approaches. Proteomics 13, 2947–2955. doi:10.1002/pmic.201300035
Barkovits, K., Pacharra, S., Pfeiffer, K., Steinbach, S., Eisenacher, M., Marcus, K., et al. (2020). Reproducibility, specificity and accuracy of relative quantification using spectral library-based data-independent acquisition. Mol. Cell. Proteomics 19, 181–197. doi:10.1074/mcp.RA119.001714
Bludau, I. (2021). Discovery- versus hypothesis- driven detection of protein-protein interactions and complexes. Int. J. Mol. Sci. 22, 4450. doi:10.3390/ijms22094450
Bludau, I., Heusel, M., Frank, M., Rosenberger, G., Hafen, R., Banaei-Esfahani, A., et al. (2020). Complex-centric proteome profiling by SEC-SWATH-MS for the parallel detection of hundreds of protein complexes. Nat. Protoc. 15, 2341–2386. doi:10.1038/s41596-020-0332-6
Bludau, I., Nicod, C., Martelli, C., Xue, P., Heusel, M., Fossati, A., et al. (2023). Rapid profiling of protein complex reorganization in perturbed systems. J. Proteome Res. 22, 1520–1536. doi:10.1021/acs.jproteome.3c00125
Brandenburg, S., Drews, L., Schönberger, H. L., Jacob, C. F., Paulke, N. J., Beuthner, B. E., et al. (2022). Direct proteomic and high-resolution microscopy biopsy analysis identifies distinct ventricular fates in severe aortic stenosis. J. Mol. Cell. Cardiol. 173, 1–15. doi:10.1016/j.yjmcc.2022.08.363
Bruderer, R., Bernhardt, O. M., Gandhi, T., Miladinović, S. M., Cheng, L. Y., Messner, S., et al. (2015). Extending the limits of quantitative proteome profiling with data-independent acquisition and application to acetaminophen-treated three-dimensional liver microtissues. Mol. Cell. Proteomics 14, 1400–1410. doi:10.1074/mcp.M114.044305
Chavez, J. D., Lee, C. F., Caudal, A., Keller, A., Tian, R., and Bruce, J. E. (2018). Chemical cross-linking mass spectrometry analysis of protein conformations and supercomplexes in heart tissue. Cell. Syst. 6, 136–141.e5. doi:10.1016/j.cels.2017.10.017
Cornelius, F. (1991). Functional reconstitution of the sodium pump. Kinetics of exchange reactions performed by reconstituted Na/K-ATPase. Biochim. Biophys. Acta 1071, 19–66. doi:10.1016/0304-4157(91)90011-k
Demichev, V., Messner, C. B., Vernardis, S. I., Lilley, K. S., and Ralser, M. (2020). DIA-NN: neural networks and interference correction enable deep proteome coverage in high throughput. Nat. Methods 17, 41–44. doi:10.1038/s41592-019-0638-x
Dowell, J. A., Wright, L. J., Armstrong, E. A., and Denu, J. M. (2021). Benchmarking quantitative performance in label-free proteomics. ACS Omega 6, 2494–2504. doi:10.1021/acsomega.0c04030
Fangfei Zhang, A., Ge, W., Huang, L., Li, D., Liu, L., Dong, Z., et al. (2023). A comparative analysis of data analysis tools for data-independent acquisition mass spectrometry. Mol. Cell. Proteomics 22, 100623. doi:10.1016/j.mcpro.2023.100623
Foo, B., Amedei, H., Kaur, S., Jaawan, S., Boshnakovska, A., Gall, T., et al. (2024). Unbiased complexome profiling and global proteomics analysis reveals mitochondrial impairment and potential changes at the intercalated disk in presymptomatic R14Δ/+ mice hearts. bioRxiv 2024 (21), 586093. doi:10.1101/2024.03.21.586093
Fossati, A., Frommelt, F., Uliana, F., Martelli, C., Vizovisek, M., Gillet, L., et al. (2021a). System-wide profiling of protein complexes via size exclusion chromatography-mass spectrometry (SEC-MS). Methods Mol. Biol. 2259, 269–294. doi:10.1007/978-1-0716-1178-4_18
Fossati, A., Li, C., Uliana, F., Wendt, F., Frommelt, F., Sykacek, P., et al. (2021b). PCprophet: a framework for protein complex prediction and differential analysis using proteomic data. Nat. Methods 18, 520–527. doi:10.1038/s41592-021-01107-5
Fröhlich, K., Brombacher, E., Fahrner, M., Vogele, D., Kook, L., Pinter, N., et al. (2022). Benchmarking of analysis strategies for data-independent acquisition proteomics using a large-scale dataset comprising inter-patient heterogeneity. Nat. Commun. 13, 2622. doi:10.1038/s41467-022-30094-0
Frommelt, F., Fossati, A., Uliana, F., Wendt, F., Peng, X., Heusel, M., et al. (2024). DIP-MS: a novel ultra-deep interaction proteomics 1 for the deconvolution of protein complexes. Nat. Methods 26. doi:10.1038/s41592-024-02211-y
Geistlinger, L., Vargas, R., Lee, T., Pan, J., Huttlin, E. L., and Gentleman, R. (2023). BioPlexR and BioPlexPy: integrated data products for the analysis of human protein interactions. Bioinformatics 39, btad091. doi:10.1093/bioinformatics/btad091
Giese, H., Ackermann, J., Heide, H., Bleier, L., Dröse, S., Wittig, I., et al. (2015). NOVA: a software to analyze complexome profiling data. Bioinformatics 31, 440–441. doi:10.1093/bioinformatics/btu623
Gillet, L. C., Navarro, P., Tate, S., Röst, H., Selevsek, N., Reiter, L., et al. (2012). Targeted data extraction of the MS/MS spectra generated by data-independent acquisition: a new concept for consistent and accurate proteome analysis. Mol. Cell. Proteomics 11, 016717. doi:10.1074/mcp.O111.016717
Gómez, L. A., Monette, J. S., Chavez, J. D., Maier, C. S., and Hagen, T. M. (2009). Supercomplexes of the mitochondrial electron transport chain decline in the aging rat heart. Arch. Biochem. Biophys. 490, 30–35. doi:10.1016/j.abb.2009.08.002
Guo, R., Zong, S., Wu, M., Gu, J., and Yang, M. (2017). Architecture of human mitochondrial respiratory megacomplex I2III2IV2. Cell. 170, 1247–1257. doi:10.1016/j.cell.2017.07.050
Havugimana, P. C., Hart, G. T., Nepusz, T., Yang, H., Turinsky, A. L., Li, Z., et al. (2012). A census of human soluble protein complexes. Cell. 150, 1068–1081. doi:10.1016/j.cell.2012.08.011
Hay, B. N., Akinlaja, M. O., Baker, T. C., Houfani, A. A., Stacey, R. G., and Foster, L. J. (2023). Integration of data-independent acquisition (DIA) with co-fractionation mass spectrometry (CF-MS) to enhance interactome mapping capabilities. Proteomics 23, e2200278. doi:10.1002/pmic.202200278
Heide, H., Bleier, L., Steger, M., Ackermann, J., Dröse, S., Schwamb, B., et al. (2012). Complexome profiling identifies TMEM126B as a component of the mitochondrial complex I assembly complex. Cell. Metab. 16, 538–549. doi:10.1016/j.cmet.2012.08.009
Heusel, M., Bludau, I., Rosenberger, G., Hafen, R., Frank, M., Banaei-Esfahani, A., et al. (2019). Complex-centric proteome profiling by SEC-SWATH-MS. Mol. Syst. Biol. 15, e8438. doi:10.15252/msb.20188438
Heusel, M., Frank, M., Köhler, M., Amon, S., Frommelt, F., Rosenberger, G., et al. (2020). A global screen for assembly state changes of the mitotic proteome by SEC-SWATH-MS. Cell. Syst. 10, 133–155. doi:10.1016/j.cels.2020.01.001
Hou, T., Zhang, R., Jian, C., Ding, W., Wang, Y., Ling, S., et al. (2019). NDUFAB1 confers cardioprotection by enhancing mitochondrial bioenergetics through coordination of respiratory complex and supercomplex assembly. Cell. Res. 29, 754–766. doi:10.1038/s41422-019-0208-x
Hu, L. Z. M., Goebels, F., Tan, J. H., Wolf, E., Kuzmanov, U., Wan, C., et al. (2019). EPIC: software toolkit for elution profile-based inference of protein complexes. Nat. Methods 16, 737–742. doi:10.1038/s41592-019-0461-4
Kuhn, M., Jackson, S., and Cimentada, J. (2022). Correlations in R. Available at: https://github.com/tidymodels/corrr (Accessed July 2, 24).
Le Maire, M., Champeil, P., and Möller, J. V. (2000). Interaction of membrane proteins and lipids with solubilizing detergents. Biochim. Biophys. Acta. 1508, 86–111. doi:10.1016/s0304-4157(00)00010-1
Linscheid, N., Santos, A., Poulsen, P. C., Mills, R. W., Calloe, K., Leurs, U., et al. (2021). Quantitative proteome comparison of human hearts with those of model organisms. PloS Biol. 19, e3001144. doi:10.1371/journal.pbio.3001144
Lou, R., Cao, Y., Li, S., Lang, X., Li, Y., Zhang, Y., et al. (2023). Benchmarking commonly used software suites and analysis workflows for DIA proteomics and phosphoproteomics. Nat. Commun. 14, 94. doi:10.1038/s41467-022-35740-1
MacLean, B., Tomazela, D. M., Shulman, N., Chambers, M., Finney, G. L., Frewen, B., et al. (2010). Skyline: an open source document editor for creating and analyzing targeted proteomics experiments. Bioinformatics 26, 966–968. doi:10.1093/bioinformatics/btq054
Meier, F., Beck, S., Grassl, N., Lubeck, M., Park, M. A., Raether, O., et al. (2015). Parallel Accumulation−Serial fragmentation (PASEF): multiplying sequencing speed and sensitivity by synchronized scans in a trapped ion mobility device. J. Proteome Res. 10, 45. doi:10.1021/acs.jproteome.5b00932
Meier, F., Brunner, A. D., Frank, M., Ha, A., Bludau, I., Voytik, E., et al. (2020). diaPASEF: parallel accumulation-serial fragmentation combined with data-independent acquisition. Nat. Methods 17, 1229–1236. doi:10.1038/s41592-020-00998-0
Meldal, B. H. M., Forner-Martinez, O., Costanzo, M. C., Dana, J., Demeter, J., Dumousseau, M., et al. (2015). The complex portal - an encyclopaedia of macromolecular complexes. Nucleic Acids Res. 43, D479–D484. doi:10.1093/nar/gku975
Michalak, W., Tsiamis, V., Schwämmle, V., and Rogowska-Wrzesińska, A. (2019). ComplexBrowser: a tool for identification and quantification of protein complexes in large-scale proteomics datasets. Mol. Cell. Proteomics 18, 2324–2334. doi:10.1074/mcp.TIR119.001434
Nolte, H., and Langer, T. (2021). ComplexFinder: a software package for the analysis of native protein complex fractionation experiments. Biochim. Biophys. Acta Bioenerg. 1862, 148444. doi:10.1016/j.bbabio.2021.148444
Páleníková, P., Harbour, M. E., Ding, S., Fearnley, I. M., Van Haute, L., Rorbach, J., et al. (2021). Quantitative density gradient analysis by mass spectrometry (qDGMS) and complexome profiling analysis (ComPrAn) R package for the study of macromolecular complexes. Biochim. Biophys. Acta Bioenerg. 1862, 148399. doi:10.1016/j.bbabio.2021.148399
Perez-Riverol, Y., Bai, J., Bandla, C., García-Seisdedos, D., Hewapathirana, S., Kamatchinathan, S., et al. (2021). The PRIDE database resources in 2022: a hub for mass spectrometry-based proteomics evidences. Nucleic Acids Res. 50, D543–D552. doi:10.1093/nar/gkab1038
R Core Team (2022). A language and environment for statistical computing. R Found. 0 Stat. Comput. Available at: https://www.R-project.org/.
Rizzetto, S., Moyseos, P., Baldacci, B., Priami, C., and Csikász-Nagy, A. (2018). Context-dependent prediction of protein complexes by SiComPre. NPJ Syst. Biol. Appl. 4, 37. doi:10.1038/s41540-018-0073-0
Rosca, M. G., Vazquez, E. K., Kerner, J., Parland, W., Chandler, M. P., Stanles, W., et al. (2008). Cardiac mitochondria in heart failure: decrease in respirasomes and oxidative phosphorylation. Cardiovasc. Res. 80, 30–39. doi:10.1093/cvr/cvn184
Santos, H. L., Lamas, R. P., and Ciancaglini, P. (2002). Solubilization of Na,K-ATPase from rabbit kidney outer medulla using only C12E8. J. Med. Biol. Res. 35, 277–288. doi:10.1590/s0100-879x2002000300002
Schmidt, C., Hesse, D., Raabe, M., Urlaub, H., and Jahn, O. (2013). An automated in-gel digestion/iTRAQ-labeling workflow for robust quantification of gel-separated proteins. Proteomics 13, 1417–1422. doi:10.1002/pmic.201200366
Schmidt, C., and Urlaub, H. (2009). iTRAQ-labeling of in-gel digested proteins for relative quantification. Methods Mol. Biol. 564, 207–226. doi:10.1007/978-1-60761-157-8_12
Schtigger, H., Von, G., and Gustav-Embden, J. (1991). Blue native electrophoresis for isolation of membrane protein complexes in enzymatically active form. Anal. Biochem. 199, 223–231. doi:10.1016/0003-2697(91)90094-a
Schulte, U., den Brave, F., Haupt, A., Gupta, A., Song, J., Müller, C. S., et al. (2023). Mitochondrial complexome reveals quality-control pathways of protein import. Nature 614, 153–159. doi:10.1038/s41586-022-05641-w
Sinitcyn, P., Hamzeiy, H., Salinas Soto, F., Itzhak, D., McCarthy, F., Wichmann, C., et al. (2021). MaxDIA enables library-based and library-free data-independent acquisition proteomics. Nat. Biotechnol. 39, 1563–1573. doi:10.1038/s41587-021-00968-7
Skinnider, M. A., and Foster, L. J. (2021). Meta-analysis defines principles for the design and analysis of co-fractionation mass spectrometry experiments. Nat. Methods 18, 806–815. doi:10.1038/s41592-021-01194-4
Skowronek, P., Thielert, M., Voytik, E., Tanzer, M. C., Hansen, F. M., Willems, S., et al. (2022). Rapid and in-depth coverage of the (phospho-) proteome with deep libraries and optimal window design for dia-PASEF. Mol. Cell. Proteomics 21, 100279. doi:10.1016/j.mcpro.2022.100279
Soni, S., A Raaijmakers, A. J., Raaijmakers, L. M., Mirjam Damen, J. A., van Stuijvenberg, L., Vos, M. A., et al. (2016). A proteomics approach to identify new putative cardiac intercalated disk proteins. PLoS One 11, e0152231. doi:10.1371/journal.pone.0152231
Stacey, R. G., Skinnider, M. A., Scott, N. E., and Foster, L. J. (2017). A rapid and accurate approach for prediction of interactomes from co-elution data (PrInCE). BMC Bioinforma. 18, 457–514. doi:10.1186/s12859-017-1865-8
Szklarczyk, D., Kirsch, R., Koutrouli, M., Nastou, K., Mehryary, F., Hachilif, R., et al. (2023). The STRING database in 2023: protein-protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Res. 51, D638–D646. doi:10.1093/nar/gkac1000
Szyrwiel, L., Sinn, L., Ralser, M., and Demichev, V. (2022). Slice-PASEF: fragmenting all ions for maximum sensitivity in proteomics. bioRxiv 10 (31), 514544. doi:10.1101/2022.10.31.514544
Tsitsiridis, G., Steinkamp, R., Giurgiu, M., Brauner, B., Fobo, G., Frishman, G., et al. (2023). CORUM: the comprehensive resource of mammalian protein complexes-2022. Nucleic Acids Res. 51, D539–D545. doi:10.1093/nar/gkac1015
Uliana, F., Ciuffa, R., Mishra, R., Fossati, A., Frommelt, F., Keller, S., et al. (2023). Phosphorylation-linked complex profiling identifies assemblies required for Hippo signal integration. Mol. Syst. Biol. 19, e11024. doi:10.15252/msb.202211024
Van Coster, R., Smet, J., George, E., De Meirleir, L., Seneca, S., Van Hove, J., et al. (2001). Blue native polyacrylamide gel electrophoresis: a powerful tool in diagnosis of oxidative phosphorylation defects. Pediatr. Res. 50, 658–665. doi:10.1203/00006450-200111000-00020
Van Strien, J., Haupt, A., Schulte, U., Braun, H.-P., Cabrera-Orefice, A., Choudhary, J. S., et al. (2021). CEDAR, an online resource for the reporting and exploration of complexome profiling data. BBA-Bioenergetics 1862, 148411. doi:10.1016/j.bbabio.2021.148411
Wagner, E., Brandenburg, S., Kohl, T., and Lehnart, S. E. (2014). Analysis of tubular membrane networks in cardiac myocytes from atria and ventricles. J. Vis. Exp. 92, e51823. doi:10.3791/51823
Wittig, I., Braun, H.-P., and Schägger, H. (2006). Blue native PAGE. Nat. Protoc. 1, 418–428. doi:10.1038/nprot.2006.62
Yu, F., Teo, G. C., Kong, A. T., Li, G. X., Demichev, V., Nesvizhskii, A. I., et al. (2023). Analysis of DIA proteomics data using MSFragger-DIA and FragPipe computational platform. Nat. Commun. 14, 4154. doi:10.1038/s41467-023-39869-5
Zahiri, J., Emamjomeh, A., Bagheri, S., Ivazeh, A., Mahdevar, G., Sepasi Tehrani, H., et al. (2020). Protein complex prediction: a survey. Genomics 112, 174–183. doi:10.1016/j.ygeno.2019.01.011
Keywords: proteome, protein complex, protein interaction, complexome, targeted proteomics, blue native
Citation: Amedei H, Paul NB, Foo B, Neuenroth L, Lehnart SE, Urlaub H and Lenz C (2024) mini-Complexome Profiling (mCP), an FDR-controlled workflow for global targeted detection of protein complexes. Front. Anal. Sci. 4:1425190. doi: 10.3389/frans.2024.1425190
Received: 29 April 2024; Accepted: 16 July 2024;
Published: 01 August 2024.
Edited by:
Makusu Tsutsui, Osaka University, JapanReviewed by:
Yuan Liu, Columbia University, United StatesGuanghui Han, PTM Biolabs Inc., United States
Copyright © 2024 Amedei, Paul, Foo, Neuenroth, Lehnart, Urlaub and Lenz. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Christof Lenz, Y2hyaXN0b2YubGVuekBtZWQudW5pLWdvZXR0aW5nZW4uZGU=