Validity of self-assessment pubertal Tanner stages by realistic color images and Pubertal Development Scale in a longitudinal cohort study

Luo, Jie; Wu, Di; Tian, Yu; Wang, Yujie; Zhang, Qin; He, Zongwei; Wang, Hong; Liu, Qin

doi:10.3389/fped.2024.1380934

ORIGINAL RESEARCH article

Front. Pediatr., 16 July 2024

Sec. Children and Health

Volume 12 - 2024 | https://doi.org/10.3389/fped.2024.1380934

Validity of self-assessment pubertal Tanner stages by realistic color images and Pubertal Development Scale in a longitudinal cohort study

Jie Luo^1,†

Di Wu^1,2,†

Yu Tian¹

Yujie Wang¹

Qin Zhang¹

Zongwei He¹

Hong Wang¹

Qin Liu^1*

¹School of Public Health, Research Center for Medicine and Social Development, Chongqing Medical University, Chongqing, China
²College of Medical Informatics, Chongqing Medical University, Chongqing, China

Introduction: To date, the reliability of pubertal development self-assessment tools is questioned, and very few studies have explored the comparison between these tools in longitudinal studies. Hence, this study aimed to examine the reliability of pubertal development self-assessment using realistic color images (RCIs) and the Pubertal Development Scale (PDS) in a longitudinal cohort study.

Methods: Our longitudinal study recruited 1,429 participants (695 boys and 734 girls), aged 5.8–12.2 years old, in Chongqing, China. We conducted two surveys, 6 months apart. Tanner stages were examined by trained medical students at each visit. RCIs and PDS scores were used to self-assess puberty at each visit. Agreement between physical examination and self-assessment was determined using weighted kappa (wk), accuracy, and Kendall rank correlation.

Results: The concordance of puberty self-assessment using RCIs at baseline and the first follow-up was almost perfect in girls and boys, wk >0.800 (p < 0.001). At baseline, the concordance of genital development self-assessment using RCIs was fair in boys, wk = 0.285 (p < 0.001), and that of boys’ pubic hair development self-assessment using RCIs was poor, wk = 0.311 [95% confidence interval (CI) −0.157 to 0.818]. The wk of the PDS was less than 0.300, except for breast development. The reliability and validity of the PDS in this study population were low, and the consistency of the PDS was not good.

Conclusions: The concordance of RCIs is better than that of the PDS. Pubertal development self-assessment using RCIs is reliable, while the reliability and validity of the PDS are unacceptable. Therefore, RCIs are recommended as a reliable pubertal development self-assessment tool to measure pubertal development for large-scale epidemiological investigations and long-term longitudinal studies in China.

1 Introduction

Puberty is a significant period of physiological changes and sexual maturity, marking the transition from childhood to adolescence. Extensive evidence indicates that abnormal development during this period is a contributing factor to obesity, hypertension, insulin resistance, type 2 diabetes, cardiovascular disease, cancers including breast cancer, endometrial cancer, and ovarian cancer, and emotional and behavioral problems (1–7). Therefore, monitoring pubertal development is significant for health during both childhood and adulthood.

Pubertal development requires accurate puberty measurements to determine temporal changes in pubertal milestone events. However, the measurement of pubertal development varies widely across studies, and there are many challenges in obtaining data for the assessment of pubertal development. The current gold standard for measuring puberty is the Tanner Sexual Maturation Scale (SMS) (8, 9). The SMS classifies development into five levels based on breast and pubic hair development for girls and genital and pubic hair development for boys: these levels include Tanner stage 1 (prepubertal), stage 2 (early pubertal), stage 3 (midpubertal), stage 4 (late pubertal), and stage 5 (postpubertal). The SMS requires the child to undress and be examined by a professional. Despite the accuracy provided by professionals, this method might be impractical in non-clinical settings, especially in large-scale epidemiological studies, due to its labor-intensive, material, financial, time-consuming nature, making it challenging to implement. In addition, because of the sensitivity of pubertal development assessment, children and parents may be less likely to agree to the SMS compared to the self-assessment of pubertal development. As a result, many studies rely on pictures or line drawings of Tanner stages that ask children or their parents to self-assess the children's pubertal development. The Pubertal Development Scale (PDS) (10) is another widely used tool for self-assessment of pubertal development, designed to avoid the sensitivity issues of visual sexual maturation charts.

Although pubertal development self-assessment using pictures and the PDS are widely used, their reliability has been questioned. On the one hand, the results of a systematic review and meta-analysis (11) of 22 studies showed moderate or substantial agreement between clinician assessment and self-assessment using pictures. However, a US study (12) showed that self-assessment of Tanner stages using pictures is not a reliable method. On the other hand, the consistency between self-assessment by the PDS and physical examination (PE) remains controversial in different studies. Although several studies (13–17) indicated that the PDS could be used in large-scale epidemiological investigations of pubertal development, they reported varying levels of agreement—moderate agreement, fair agreement, and even slight agreement—between clinician assessment and self-assessment using the PDS. Hence, the reliability of pubertal development self-assessment tools is controversial, and the scientific accuracy and authenticity of the study results will be affected, although a large number of studies used pictures, especially the PDS, to self-assess pubertal development.

Moreover, few studies have compared the agreement between pictures and the PDS, which are both popular tools for self-assessment of pubertal development. Compared to pictures, the PDS is widely used, and it has been cited more than 2,500 times. While the reliability of the PDS is more controversial (16–18), we believe that the PDS and pubertal development pictures have not been compared in a single study. If the concordance of pictures is better than that of the PDS, pictures could potentially replace the PDS as the preferred self-assessment tool of pubertal development, ensuring scientific and authentic results. Therefore, studies comparing pictures with the PDS should be carried out, as this can assist doctors, healthcare providers, epidemiologists, and other researchers in deciding a more appropriate pubertal development self-assessment tool.

At present, global trends indicate that adolescent puberty is generally occurring earlier, which is a problem that needs significant attention (19, 20). Given that puberty is a process requiring longitudinal tracking and involves psychological sensitivity of children during this period (21, 22), pubertal development examination by SMS may increase the psychological burden on adolescents. Furthermore, cross-sectional studies cannot cover the processes of pubertal development, such as pubertal trajectory and tempo, which are important for health services for children and adolescents. Assessing pubertal development every 6 months provides a more precise method for tracking pubertal changes and is widely conducted in studies such as the Danish National Birth Cohort (23). With little known about the longitudinal reliability of pubertal development self-assessment tools, more evidence is needed to longitudinally track the reliability of the self-assessment tools.

To date, the reliability of pubertal development self-assessment tools is questioned, and few previous studies have explored the comparison between pictures and the PDS in longitudinal studies. Hence, this study included the most widely used self-assessment tools to verify whether pictures with Tanner stages and the PDS can be used in cohorts through a longitudinal study in Chongqing, China.

2 Methods

2.1 Participants

The study participants were from an ongoing longitudinal cohort aimed at exploring adolescents’ pubertal development and its influencing factors and health implications. During the enrollment period since May 2014, 1,237 participants (542 girls and 695 boys) were enrolled from grades 1–4 in primary schools in an urban district of Chongqing, China. After 6 months, we added a further 192 girls at the first follow-up and thus officially recruited 1,429 participants (734 girls and 695 boys) for this study, aged 5.8–12.2 years old; informed consent was obtained from their parents and themselves. JLP District in Chongqing is a pilot district for urban and rural planning, and its demographic characteristics and economic status are similar to the characteristics of the main urban area of Chongqing. Therefore, this study adopted purpose-based sampling in the district and selected four primary schools in the district of Chongqing according to the differences in their geographical locations. A detailed description of this cohort study has been reported previously (24). The study was approved by the Medical Ethics Review Committee of Chongqing Medical University.

At baseline and the first follow-up, girls were examined for puberty outcomes, including breast and pubic hair development, while boys were examined for genital and pubic hair development by trained medical students. In addition, the Tanner stage was self-assessed by realistic color images (RCIs) and the PDS. The age, body weight, and height of the children were measured at baseline and the first follow-up. Father’s education, mother’s education, parental marital status, the number of left-behind children, and average monthly household income were surveyed at the first follow-up.

2.2 Physical examination

Pubertal development was assessed for boys and girls by trained medical school graduates during physical examinations according to Tanner stages (25). Before each investigation, every investigator received standard training from medical professors. In addition, each child's pubertal development was measured independently by two investigators for quality assurance. Tanner stages were determined as stage 1 (prepubertal), stage 2 (early pubertal), stage 3 (midpubertal), stage 4 (late pubertal), or stage 5 (postpubertal). PEs assessed pubertal development stages, including genital stages (G1–G5) for boys, breast stages (B1–B5) for girls, and pubic hair stages (P1–P5) for both genders (Supplementary Table S1).

2.3 Realistic color images

After completing the PDS and before undergoing the physical examination, puberty ratings were self-assessed using RCIs according to Tanner stages, as proposed by Carel and Leger in the New England Journal of Medicine in 2008 (26). Each child was presented with a gender-specific pubertal self-assessment questionnaire using realistic color images. After a short explanation of each stage of puberty by trained medical school graduates, the participants would read the questions along with the pictures and select the self-perceived stage that best represented their pubertal development by comparing their situation with the illustrations. Girls were asked to choose the most appropriate breast and pubic hair Tanner stage. Boys were asked to choose the most appropriate genital and pubic hair Tanner stage.

2.4 Pubertal Development Scale

Pubertal development was self-reported using the PDS before PE and RCIs. The PDS was developed to provide a continuous score of pubertal development and classified into a five-level category representing stages from prepubertal to postpubertal without physical examination or visually depicted sexual maturation stages (10). The PDS consists of five items: breast development and menstruation in girls; voice changes and facial hair growth in boys; and body hair growth, growth spurt, and skin changes in both sexes. Response options are as follows: not yet started (1), barely started (2), definitely started (3), and seems complete (4). In addition, girls’ menarche is a binary variable: no (0) or yes (1). The PDS scores can be used to classify individuals into five pubertal categories according to algorithms developed by Petersen et al. (1988) (10) and described by Carskadon and Acebo (27)—prepubertal, early pubertal, midpubertal, late pubertal, and postpubertal (in alignment with the original Tanner categories). The content of the PDS and the algorithm are provided in Table 1.

Table 1

Table 1 Composition of the PDS and the computation of the PDS using the algorithm.

2.5 Statistical analysis

The weighted kappa (wk) statistic was performed using R4.2.2, and other statistical analyses were performed using SPSS 25. McDonald's omega (28) coefficients were used to measure the internal consistency and scale reliability of the PDS. The wk statistic and accuracy were used to evaluate the agreement between PE and RCIs and between PE and the PDS. The strength of agreement criteria for wk is as follows: <0.00, poor; 0.00–0.20, slight; 0.21–0.40, fair; 0.41–0.60, moderate; 0.61–0.80, substantial; >0.8, almost perfect (29). Agreement between PE and the RCI or the PDS was calculated using Crosstabs, and percent (%) accuracy calculated the precise agreement between the three measurements of pubertal development. The Kendall rank correlation was also used to measure the strength of the correlations between the three measurements of pubertal development. The Kendall rank correlation coefficient ranges between −1 (perfect inverse relationship between two variables) and +1 (perfect direct relationship), with zero indicating a lack of relationship. The t-test and chi-squared test were used to analyze the influencing factors of PDS concordance.

3 Results

3.1 Characteristics of study participants

At baseline, a total of 1,234 participants, including 539 girls and 695 boys, were recruited for this study. At the first follow-up visit, 1,429 participants, including 734 girls and 695 boys, had completed the survey. Age, BMI, body weight, body height, father’s education, mother’s education, parental marital status, the number of left-behind children, and average monthly household income of the study participants are given in Table 2.

Table 2

Table 2 Characteristics of study participants.

3.2 Comparison of PE and self-assessment by RCIs

Figure 1 shows the concordance of PE and self-assessment by RCIs for breast and pubic hair development in girls and genital and pubic hair development in boys.

Figure 1

Figure 1 Concordance of PE and RCI.

For girls, there was almost perfect concordance between PE and RCIs. At baseline, the wk (weighted kappa coefficient) was 0.860 (p < 0.001) for breast development and 0.887 (p < 0.001) for pubic hair development. At the first follow-up visit, the wk was 0.961 (p < 0.001) for breast development and 0.950 (p < 0.001) for pubic hair development.

For boys, there was almost perfect concordance between PE and RCIs at the first follow-up visit and fair concordance in genital development at baseline. At baseline, the wk was 0.285 (p < 0.001) for genital development and 0.331 (95% CI −0.157 to 0.818) for pubic hair development. At the first follow-up visit, the wk was 0.968 (p < 0.001) for genital development and 0.915 (p < 0.001) for pubic hair development.

The accuracy, underestimation, overestimation, and Kendall rank correlation coefficient between PE and self-assessment by RCIs for breast/genital development and pubic hair development are presented in Supplementary Table S2.

3.3 Comparison of PE and self-reporting by the PDS

The McDonald's omega coefficient for all PDS items was 0.426 in girls and 0.105 for boys.

For breast development in girls, the concordance between PE and self-reporting by the PDS was almost perfect at baseline (wk = 0.827, p < 0.001) and the first follow-up (wk = 0.891, p < 0.001). For pubic hair development in girls, there was slight concordance at baseline (wk = 0.104, p < 0.001) and the first follow-up (wk = 0.164, p < 0.001) (Figure 2).

Figure 2

Figure 2 Concordance of PE and PDS.

For genital development comparison in boys, there was slight concordance (wk = 0.024, p < 0.05) at baseline and fair concordance (wk = 0.242, p < 0.001) at the first follow-up. For pubic hair development, there was poor concordance (wk = 0.001, p > 0.05) at baseline and slight concordance (wk = 0.081, p < 0.001) at the first follow-up (Figure 2).

The accuracy, underestimation, overestimation, and Kendall rank correlation coefficient between PE and self-reporting by the PDS for breast/genital development and pubic hair development are presented in Supplementary Table S3.

3.4 Concordance between PE and PDS in age-stratified analysis

Age, BMI, father’s education, mother’s education, parental marital status, the number of left-behind children, and average monthly household income were included as influence factors in the analysis of the concordance between PE and the PDS. We found that age was the only factor affecting the poor concordance of the PDS in both genders (Supplementary Tables S4 and S5). The age of girls was categorized as <8 and ≥8 years old. The age of the boys was categorized as <9 and ≥9 years old (30). Ages classified as <8 years had slight concordance in girls for breast development, while ages classified as ≥8 years had perfect concordance. Ages classified as <8 and ≥8 years had slight concordance in girls for pubic hair development (Figure 3 and Supplementary Table S6). Ages classified as <9 and ≥9 years had slight concordance in boys for genital and pubic hair development (Figure 3 and Supplementary Table S7).

Figure 3

Figure 3 Concordance of PE and PDS in age-stratified analysis.

4 Discussion

To our knowledge, this is the first study to discuss the longitudinal reliability of pubertal development self-assessment using RCIs and the PDS and to compare the concordance of RCIs with the PDS through a longitudinal study. The current study has several key findings. First, our study suggests that self-assessment by RCIs for puberty development is a reliable and valid tool in longitudinal analysis. Second, the reliability and validity of the PDS were low, and the consistency of the PDS was not good. Third, the reliability of self-assessment by RCIs for puberty development is better than that by the PDS. Finally, the only factor affecting the poor concordance of the PDS in both sexes was age. These findings might assist doctors, healthcare providers, epidemiologists, and other researchers in deciding a more appropriate pubertal development self-assessment tool.

Our results demonstrate that RCIs are a reliable and valid pubertal development self-assessment tool. Like our results, studies from Thailand (31), Iran (32), and China (33, 34) have shown that such images can be used as a reliable tool to assess pubertal development in both sexes. Of these four studies, two included line drawings with brief explanatory text and two included colored images of Tanner stages; the colored images showed better concordance. Therefore, RCIs could be a better alternative pubertal assessment tool for extensive epidemiological research, (34) and might avoid the tension resulting from SMS examinations to improve the consent rate and reduce the dropout rate in cohorts.

Different from previous studies, we analyzed the dynamic changes in RCI self-rated reliability through longitudinal rather than cross-sectional data. Our longitudinal analysis suggested that it is more difficult for boys in early puberty to self-assess pubertal onset than girls and that boys tend to overestimate their genital development. Similarly, a Japanese study (35) showed that self-assessment of the onset of breast development using illustrations of Tanner stages in full color was consistent with assessment by PE in girls but not in boys. Moreover, a Danish study reported that boys overestimated their pubertal stage when using illustrations of pubertal stages (36). Yet, we noticed that the consistency of boys’ pubertal self-assessment using RCIs at the first follow-up visit was almost perfect. This might be because we explained puberty knowledge during our follow-up and boys became more knowledgeable about pubertal development, making the RCIs more reliable. We will continue to explore how sexual and reproductive health education enhances adolescents’ pubertal health and epidemiology studies into puberty. Therefore, for large-scale epidemiological studies requiring follow-up over many years, RCIs are a reliable alternative tool when SMS examinations are not feasible, saving manpower, budget, and time.

In addition to RCIs, we conducted the PDS reliability analysis using our cohort. Notably, our study suggests that the reliability and validity of the PDS are very low. Even so, the concordance of breast development self-assessment by the PDS was only almost perfect at baseline and the first follow-up visit, which might be because breast development is a representative indicator of pubertal development in girls (20). Studies from Bond et al. (15) and Pompeia et al. (14) reported that self-assessment of pubertal development using the PDS is useful when physical examination is not possible and images are unavailable. Also, the concordance of the PDS was between fair and moderate in the two studies. Such a conclusion could be challenged because the concordance of the PDS was not almost perfect or substantial. Another study (17) reported that the PDS was reliable and generally tracks with Tanner stages. However, it was only reliable when combining pubertal scores into three stages of development, and the original self-assessed Tanner stages by the PDS were fairly consistent. This means that the PDS is not applicable for measuring important indicators of pubertal development, such as pubertal onset and tempo, and cannot help doctors, healthcare providers, epidemiologists, and other researchers to accurately measure the pubertal development process. Hence, it is not known whether the concordance of the PDS is acceptable, and even if it is acceptable, the evaluation criteria are not uniform.

A study from Norris and Richter (18) did not recommend the PDS as a reliable self-assessment tool for pubertal development in South Africa, which is identical to our results. Another Chinese study (16) showed that Cronbach's alpha of the PDS was 0.80 for girls and 0.66 for boys in the Chinese version of the PDS. Although the study reported that the PDS was reliable, its reliability in boys was low. Also, there was only moderate concordance in self-rated pubertal development using the PDS. The translation and applicability of the PDS among Chinese children should be further investigated, combining our results of low reliability and validity and the influencing factors of PDS concordance. Currently, the PDS is widely used in large-scale epidemiological investigations (37–40). However, many studies have failed to find that the exact correspondence to Tanner stages between the PDS and PE is reliable (13, 17, 18). Therefore, the reliability of the PDS needs further study. Based on the results, our next step is to try out health education interventions and revise or develop a Pubertal Development Scale suitable for the local population.

5 Strengths and limitations

Our study had several strengths. First, we used a longitudinal study to assess the dynamic changes in the reliability of self-assessment tools for pubertal development. At present, most studies on the reliability of self-assessment tools for pubertal development are cross-sectional studies, which do not prove their applicability to long-term longitudinal studies. Second, we compared the agreement and reliability of self-assessment tools with the PDS. This comparison can help doctors, healthcare providers, epidemiologists, and other researchers decide a more appropriate pubertal development self-assessment tool. Third, current studies demonstrating good reliability of pubertal self-assessment tools predominantly involve white/Caucasian populations. There is limited research, including our study in China, on the reliability and validity of the PDS is relatively low, providing potential research directions in the region.

Several limitations should be acknowledged. First, although our study explored the reliability of pubertal self-assessment stools longitudinally, we conducted only two surveys that did not cover the whole of puberty, which limits the inferences of our results. Second, since each item can have an aggregate effect with the other items of the PDS and the Cronbach's alpha coefficient is very low, McDonald's omega was used to evaluate the internal consistency reliability of the PDS. However, the results showed that the reliability and validity of the PDS were very poor, which made the consistent results of the PDS in our study population unreliable. The possible reasons include the following: (1) Chinese children have relatively little understanding of pubertal development; (2) most of the children in this study are in the early stage of pubertal development (mainly stages 1, 2, and 3), with very few children in stages 4 and 5. In the next step of our research, we will conduct intervention studies on health education and revise or retranslate the Chinese version of the PDS.

6 Conclusions

The reliability of pubertal development self-assessment using RCIs is acceptable, while the reliability and validity of the PDS are low. This finding can assist doctors, healthcare providers, epidemiologists, and other researchers in deciding a more appropriate pubertal development self-assessment tool. Consequently, RCIs, rather than the PDS, are recommended as a reliable pubertal development self-assessment tool to measure pubertal development for large-scale epidemiological investigations and long-term longitudinal studies in China.

Data availability statement

The datasets presented in this article are not readily available because the data are stored and used among Chongqing Medical University research teams, and we are working on data sharing policies and a website for collaborators' data access. The datasets used or analyzed during the current study are available from the corresponding author on reasonable request. Requests to access the datasets should be directed to QL,bGl1cWluQGNxbXUuZWR1LmNu.

Ethics statement

The studies involving humans were approved by the Medical Ethics Review Committee of Chongqing Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.

Author contributions

JL: Formal Analysis, Investigation, Writing – original draft, Writing – review & editing, Conceptualization, Data curation. DW: Writing – original draft, Writing – review & editing, Data curation, Formal Analysis, Investigation. YT: Data curation, Investigation, Writing – review & editing. YW: Data curation, Investigation, Writing – review & editing. QZ: Data curation, Investigation, Writing – review & editing. ZH: Data curation, Investigation, Writing – review & editing. HW: Conceptualization, Writing – review & editing. QL: Conceptualization, Funding acquisition, Investigation, Project administration, Supervision, Writing – review & editing.

Funding

The authors declare financial support was received for the research, authorship, and/or publication of this article.

This research was supported by the National Science Fund Project (Grant No. 81973067), the National Youth Science Fund Project (Grant No. 81502825), the Venture & Innovation Support Program for Chongqing Overseas Returnees, China (Grant No. cx2018105), the Basic and Frontier Research Project of Chongqing Science and Technology Commission (Grant No. cstc2013jcyjA10001), and the Chongqing Medical University Future Medical Innovation Team Support Program (Grant No. W0054).

Acknowledgments

The authors thank all the children and their parents for their participation and the teachers in four primary schools and eight middle schools for their kind support. The authors also thank all the students who participated in the data collection.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fped.2024.1380934/full#supplementary-material

Abbreviations

RCIs, realistic color image; PDS, Pubertal Development Scale; PE, physical examination; SMS, Tanner Sexual Maturation Scale.

References

1. Anderson GM, Hill JW, Kaiser UB, Navarro VM, Ong KK, Perry JRB, et al. Metabolic control of puberty: 60 years in the footsteps of Kennedy and Mitra’s seminal work. Nat Rev Endocrinol. (2024) 20(2):111–23. doi: 10.1038/s41574-023-00919-z

PubMed Abstract | Crossref Full Text | Google Scholar

2. Guo JZ, Wu QJ, Liu FH, Gao C, Gong TT, Li G. Review of Mendelian randomization studies on endometrial cancer. Front Endocrinol (Lausanne). (2022) 13:783150. doi: 10.3389/fendo.2022.783150

PubMed Abstract | Crossref Full Text | Google Scholar

3. Shamshirian A, Heydari K, Shams Z, Aref AR, Shamshirian D, Tamtaji OR, et al. Breast cancer risk factors in Iran: a systematic review & meta-analysis. Horm Mol Biol Clin Investig. (2020) 41(4):20200021. doi: 10.1515/hmbci-2020-0021

PubMed Abstract | Crossref Full Text | Google Scholar

4. Ren Y, Zou H, Zhang D, Han C, Hu D. Relationship between age at menarche and risk of glucose metabolism disorder: a systematic review and dose-response meta-analysis. Menopause. (2020) 27(7):818–26. doi: 10.1097/GME.0000000000001529

PubMed Abstract | Crossref Full Text | Google Scholar

5. Mendle J, Ryan RM, McKone K. Early menarche and internalizing and externalizing in adulthood: explaining the persistence of effects. J Adolesc Health. (2019) 65(5):599–606. doi: 10.1016/j.jadohealth.2019.06.004

PubMed Abstract | Crossref Full Text | Google Scholar

6. Zhang Z, Hu X, Yang C, Chen X. Early age at menarche is associated with insulin resistance: a systemic review and meta-analysis. Postgrad Med. (2019) 131(2):144–50. doi: 10.1080/00325481.2019.1559429

PubMed Abstract | Crossref Full Text | Google Scholar

7. Bubach S, De Mola CL, Hardy R, Dreyfus J, Santos AC, Horta BL. Early menarche and blood pressure in adulthood: systematic review and meta-analysis. J Public Health (Oxf). (2018) 40(3):476–84. doi: 10.1093/pubmed/fdx118

PubMed Abstract | Crossref Full Text | Google Scholar

8. Marshall WA, Tanner JM. Variations in the pattern of pubertal changes in boys. Arch Dis Child. (1970) 45(239):13–23. doi: 10.1136/adc.45.239.13

PubMed Abstract | Crossref Full Text | Google Scholar

9. Marshall WA, Tanner JM. Variations in pattern of pubertal changes in girls. Arch Dis Child. (1969) 44(235):291–303. doi: 10.1136/adc.44.235.291

PubMed Abstract | Crossref Full Text | Google Scholar

10. Petersen AC, Crockett L, Richards M, Boxer A. A self-report measure of pubertal status: reliability, validity, and initial norms. J Youth Adolesc. (1988) 17(2):117–33. doi: 10.1007/BF01537962

PubMed Abstract | Crossref Full Text | Google Scholar

11. Campisi SC, Marchand JD, Siddiqui FJ, Islam M, Bhutta ZA, Palmert MR. Can we rely on adolescents to self-assess puberty stage? A systematic review and meta-analysis. J Clin Endocrinol Metab. (2020) 105(8):2846–56. doi: 10.1210/clinem/dgaa135

PubMed Abstract | Crossref Full Text | Google Scholar

12. Desmangles JC, Lappe JM, Lipaczewski G, Haynatzki G. Accuracy of pubertal Tanner staging self-reporting. J Pediatr Endocrinol Metab. (2006) 19(3):213–21. doi: 10.1515/JPEM.2006.19.3.213

PubMed Abstract | Crossref Full Text | Google Scholar

13. Campisi SC, Humayun KN, Wasan Y, Soofi S, Islam M, Lou W, et al. Self-assessed puberty is reliable in a low-income setting in rural Pakistan. J Pediatr Endocrinol Metab. (2020) 33(9):1191–6. doi: 10.1515/jpem-2020-0246

PubMed Abstract | Crossref Full Text | Google Scholar

14. Pompéia S, Zanini GAV, Freitas RS, Inacio LMC, Silva FCD, Souza GR, et al. Adapted version of the Pubertal Development Scale for use in Brazil. Rev Saude Publica. (2019) 53:56. doi: 10.11606/s1518-8787.2019053000915

PubMed Abstract | Crossref Full Text | Google Scholar

15. Bond L, Clements J, Bertalli N, Evans-Whipp T, McMorris BJ, Patton GC, et al. A comparison of self-reported puberty using the Pubertal Development Scale and the Sexual Maturation Scale in a school-based epidemiologic survey. J Adolescence. (2006) 29(5):709–20. doi: 10.1016/j.adolescence.2005.10.001

PubMed Abstract | Crossref Full Text | Google Scholar

16. Chan NP, Sung RY, Nelson EA, So HK, Tse YK, Kong AP. Measurement of pubertal status with a Chinese self-report Pubertal Development Scale. Matern Child Health J. (2010) 14(3):466–73. doi: 10.1007/s10995-009-0481-2

PubMed Abstract | Crossref Full Text | Google Scholar

17. Koopman-Verhoeff ME, Gredvig-Ardito C, Barker DH, Saletin JM, Carskadon MA. Classifying pubertal development using child and parent report: comparing the Pubertal Development Scales to Tanner staging. J Adolesc Health. (2020) 66(5):597–602. doi: 10.1016/j.jadohealth.2019.11.308

PubMed Abstract | Crossref Full Text | Google Scholar

18. Norris SA, Richter LM. Are there short cuts to pubertal assessments? Self-reported and assessed group differences in pubertal development in African adolescents. J Adolesc Health. (2008) 42(3):259–65. doi: 10.1016/j.jadohealth.2007.08.009

PubMed Abstract | Crossref Full Text | Google Scholar

19. Curtis VA, Allen DB. Male pubertal timing—boys will be men, but when? JAMA Pediatr. (2019) 173(9):819–20. doi: 10.1001/jamapediatrics.2019.2306

PubMed Abstract | Crossref Full Text | Google Scholar

20. Eckert-Lind C, Busch AS, Petersen JH, Biro FM, Butler G, Bräuner EV, et al. Worldwide secular trends in age at pubertal onset assessed by breast development among girls: a systematic review and meta-analysis. JAMA Pediatr. (2020) 174(4):e195881. doi: 10.1001/jamapediatrics.2019.5881

PubMed Abstract | Crossref Full Text | Google Scholar

21. Lunddorf LLH, Ramlau-Hansen CH, Arendt LH, Patton GC, Sawyer SM, Dashti SG, et al. Characteristics of puberty in a population-based sample of Danish adolescents. J Adolesc Health. (2024) 74(4):657–64. doi: 10.1016/j.jadohealth.2023.10.005

PubMed Abstract | Crossref Full Text | Google Scholar

22. Cheng HL, Harris SR, Sritharan M, Behan MJ, Medlow SD, Steinbeck KS. The tempo of puberty and its relationship to adolescent health and well-being: a systematic review. Acta Paediatr. (2020) 109(5):900–13. doi: 10.1111/apa.15092

PubMed Abstract | Crossref Full Text | Google Scholar

23. Thomsen AML, Ramlau-Hansen CH, Olsen J, Brix N, Andersen AN, Lunddorf LLH, et al. The influence of parental age on timing of puberty: a study in the Danish national birth cohort. Scand J Public Health. (2022) 50(5):629–37. doi: 10.1177/14034948211019794

PubMed Abstract | Crossref Full Text | Google Scholar

24. Xi X, Wu D, Wu W, Zhou Y, Zhang Q, Wang Y, et al. The influence of the trajectory of obesity indicators on the age of pubertal onset and pubertal tempo in girls: a longitudinal study in Chongqing, China. Front Public Health. (2023) 11:1025778. doi: 10.3389/fpubh.2023.1025778

PubMed Abstract | Crossref Full Text | Google Scholar

25. Tanner JM. Normal growth and techniques of growth assessment. Clin Endocrinol Metab. (1986) 15(3):411–51. doi: 10.1016/S0300-595X(86)80005-6

PubMed Abstract | Crossref Full Text | Google Scholar

26. Carel JC, Leger J. Clinical practice. Precocious puberty. N Engl J Med. (2008) 358(22):2366–77. doi: 10.1056/NEJMcp0800459

PubMed Abstract | Crossref Full Text | Google Scholar

27. Carskadon MA, Acebo C. A self-administered rating scale for pubertal development. J Adolesc Health. (1993) 14(3):190–5. doi: 10.1016/1054-139X(93)90004-9

PubMed Abstract | Crossref Full Text | Google Scholar

28. Malkewitz CP, Schwall P, Meesters C, Hardt J. Estimating reliability: a comparison of Cronbach’s α, McDonald’s ωt and the greatest lower bound. Soc Sci Hum Open. (2023) 7(1):100368. doi: 10.1016/j.ssaho.2022.100368

Crossref Full Text | Google Scholar

29. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. (1977) 33(1):159–74. doi: 10.2307/2529310

PubMed Abstract | Crossref Full Text | Google Scholar

30. Lin S, Yang W, Yu J. Guideline for clinical diagnosis and treatment of pediatrics of traditional Chinese medicine sexual precocity (amendment). J Pediatr Tradit Chin Med. (2016) 12(03):1–5. doi: 10.16840/j.issn1673-4297.2016.03.01

Crossref Full Text | Google Scholar

31. Jaruratanasirikul S, Kreetapirom P, Tassanakijpanich N, Sriplung H. Reliability of pubertal maturation self-assessment in a school-based survey. J Pediatr Endocrinol Metab. (2015) 28(3–4):367–74. doi: 10.1515/jpem-2014-0053

PubMed Abstract | Crossref Full Text | Google Scholar

32. Rabbani A, Noorian S, Fallah JS, Setoudeh A, Sayarifard F, Abbasi F. Reliability of pubertal self assessment method: an Iranian study. Iran J Pediatr. (2013) 23(3):327–32.23795257

PubMed Abstract | Google Scholar

33. Chan NP, Sung RY, Kong AP, Goggins WB, So HK, Nelson EA. Reliability of pubertal self-assessment in Hong Kong Chinese children. J Paediatr Child Health. (2008) 44(6):353–8. doi: 10.1111/j.1440-1754.2008.01311.x

PubMed Abstract | Crossref Full Text | Google Scholar

34. Sun Y, Tao FB, Su PY. Self-assessment of pubertal Tanner stage by realistic colour images in representative Chinese obese and non-obese children and adolescents. Acta Paediatr. (2012) 101(4):e163–6. doi: 10.1111/j.1651-2227.2011.02568.x

PubMed Abstract | Crossref Full Text | Google Scholar

35. Saito-Abe M, Nishizato M, Yamamoto-Hanada K, Yang L, Fukami M, Ito Y, et al. Comparison of physician- and self-assessed pubertal onset in Japanese children. Front Pediatr. (2023) 11:950541. doi: 10.3389/fped.2023.950541

PubMed Abstract | Crossref Full Text | Google Scholar

36. Rasmussen AR, Wohlfahrt-Veje C, Tefre de Renzy-Martin K, Hagen CP, Tinggaard J, Mouritsen A, et al. Validity of self-assessment of pubertal maturation. Pediatrics. (2015) 135(1):86–93. doi: 10.1542/peds.2014-0793

PubMed Abstract | Crossref Full Text | Google Scholar

37. Li R, Lopez DA, Gupta M, Palermo TM. Pubertal development and pain incidence and characteristics in children: a 1-year prospective cohort study of a national sample. Pain. (2023) 164(12):2725–36. doi: 10.1097/j.pain.0000000000002969

PubMed Abstract | Crossref Full Text | Google Scholar

38. Sehovic E, Zellers SM, Youssef MK, Heikkinen A, Kaprio J, Ollikainen M. DNA methylation sites in early adulthood characterised by pubertal timing and development: a twin study. Clin Epigenetics. (2023) 15(1):181. doi: 10.1186/s13148-023-01594-7

PubMed Abstract | Crossref Full Text | Google Scholar

39. Castiello F, Suárez B, Beneito A, Lopez-Espinosa MJ, Santa-Marina L, Lertxundi A, et al. Childhood exposure to non-persistent pesticides and pubertal development in Spanish girls and boys: evidence from the INMA (environment and childhood) cohort. Environ Pollut. (2023) 316(Pt 2):120571. doi: 10.1016/j.envpol.2022.120571

PubMed Abstract | Crossref Full Text | Google Scholar

40. Torvik FA, Flatø M, McAdams TA, Colman I, Silventoinen K, Stoltenberg C. Early puberty is associated with higher academic achievement in boys and girls and partially explains academic sex differences. J Adolesc Health. (2021) 69(3):503–10. doi: 10.1016/j.jadohealth.2021.02.001

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: self-assessment, Tanner stages, pubertal, reliability, validity

Citation: Luo J, Wu D, Tian Y, Wang Y, Zhang Q, He Z, Wang H and Liu Q (2024) Validity of self-assessment pubertal Tanner stages by realistic color images and Pubertal Development Scale in a longitudinal cohort study. Front. Pediatr. 12:1380934. doi: 10.3389/fped.2024.1380934

Received: 2 February 2024; Accepted: 21 June 2024;
Published: 16 July 2024.

Edited by:

Tim S. Nawrot, University of Hasselt, Belgium

Reviewed by:

Elpis Hatziagorou, Aristotle University of Thessaloniki, Greece
Paul Anthony Camacho Lopez, Clínica FOSCAL, Colombia

© 2024 Luo, Wu, Tian, Wang, Zhang, He, Wang and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Qin Liu, bGl1cWluQGNxbXUuZWR1LmNu

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Validity of self-assessment pubertal Tanner stages by realistic color images and Pubertal Development Scale in a longitudinal cohort study

1 Introduction

2 Methods

2.1 Participants

2.2 Physical examination

2.3 Realistic color images

2.4 Pubertal Development Scale

2.5 Statistical analysis

3 Results

3.1 Characteristics of study participants

3.2 Comparison of PE and self-assessment by RCIs

3.3 Comparison of PE and self-reporting by the PDS

3.4 Concordance between PE and PDS in age-stratified analysis

4 Discussion

5 Strengths and limitations

6 Conclusions

Data availability statement

Ethics statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Publisher's note

Supplementary material

Abbreviations

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good