AUTHOR=Ling Jin , Zheng Luxia , Xu Mingming , Chen Gang , Wang Xiao , Mao Danzhuo , Shao Hong TITLE=Extreme Point Sort Transformation Combined With a Long Short-Term Memory Network Algorithm for the Raman-Based Identification of Therapeutic Monoclonal Antibodies JOURNAL=Frontiers in Chemistry VOLUME=10 YEAR=2022 URL=https://www.frontiersin.org/journals/chemistry/articles/10.3389/fchem.2022.887960 DOI=10.3389/fchem.2022.887960 ISSN=2296-2646 ABSTRACT=

Therapeutic monoclonal antibodies (mAbs) are a new generation of protein-based medicines that are usually expensive and thus represent a target for counterfeiters. In the present study, a method based on Raman spectroscopy that combined extreme point sort transformation with a long short-term memory (LSTM) network algorithm was presented for the identification of therapeutic mAbs. A total of 15 therapeutic mAbs were used in this study. An in-house Raman spectrum dataset for model training was created with 1,350 spectra. The characteristic region of the Raman spectrum was reduced in dimension and then transformed through an extreme point sort transformation into a sequence array, which was fitted for the LSTM network. The characteristic array was extracted from the sequence array using a well-trained LSTM network and then compared with standard spectra for identification. To demonstrate whether the present algorithm was better, ThermoFisher OMNIC 8.3 software (Thermo Fisher Scientific Inc., U.S.) with two matching modes was selected for comparison. Finally, the present method was successfully applied to identify 30 samples, including 15 therapeutic mAbs and 15 other injections. The characteristic region was selected from 100 to 1800 cm−1 of the full spectrum. The optimized dimensional values were set from 35 to 53, and the threshold value range was from 0.97 to 0.99 for 15 therapeutic mAbs. The results of the robustness test indicated that the present method had good robustness against spectral peak drift, random noise and fluorescence interference from the measurement. The areas under the curve (AUC) values of the present method that were analysed on the full spectrum and analysed on the characteristic region by the OMNIC 8.3 software’s built-in method were 1.000, 0.678, and 0.613, respectively. The similarity scores for 15 therapeutic mAbs using OMNIC 8.3 software in all groups compared with that of the relative present algorithm group had extremely remarkable differences (p < 0.001). The results suggested that the extreme point sort transformation combined with the LSTM network algorithm enabled the characteristic extraction of the therapeutic mAb Raman spectrum. The present method is a proposed solution to rapidly identify therapeutic mAbs.