- 1China Basketball College, Beijing Sport University, Beijing, China
- 2Athletic Performance College, Shanghai University of Sport, Shanghai, China
- 3College of Physical Education, Dalian University, Dalian, China
This study aimed to (1) use the clustering method to build a classification model based on the play-type data of basketball players, to classify native and foreign players into different offensive roles; (2) use the clustered offensive role model to investigate how different offensive roles influence team performance. The sample was drawn from 20 teams spanning five seasons (2017–2021) in the Chinese Basketball Association, comprising 823 native and 228 foreign players. The clustering results obtained fourteen offensive roles for native players and five for foreign players. Subsequent analyses revealed that the offensive roles of two native player clusters, namely N6 Spot-up Wings who Attack (OR = 3.281, p < 0.05) and N13 Bigs who Cut to the Rim (OR = 4.272, p < 0.05), significantly influenced team performance. Conversely, no significant impact was observed for foreign players. The findings of this study offer novel insights into player dynamics and offer coaches a fresh perspective on team composition.
1. Introduction
The primary objective of sports performance analysis is to enhance the comprehension of sports and consequently guide decision-making processes for individuals aiming to improve their athletic performance (O’donoghue, 2009). Moreover, the dynamic and intricate nature of sports requires a careful analysis through quantitative measurement techniques (Hughes and Franks, 2007). In earlier years, the book ‘Moneyball’ discussed the use of advanced statistical analysis to evaluate baseball players’ performance and worth. This challenged established scouting strategies that typically depend on subjective judgment and outdated data metrics (Lewis, 2004). Previous research has primarily concentrated on retrospective static analyses utilizing historical data and statistical methods to evaluate the performance of individual players or teams (Gomez-Ruano et al., 2020). For instance, distinguishing between winning and losing teams (Gómez et al., 2006; Çene, 2018), elucidating the significance of contributions made by starters and substitutes during games (Gòmez et al., 2009), analyzing variances in physiological indicators among players in distinct positions (Teramoto and Cross, 2018; Zarić et al., 2020), investigating gender disparities (Gómez et al., 2013), and conducting cross-sectional comparisons of performance data across different leagues (Sampaio et al., 2006).
However, over the past decade, researchers have increasingly adopted dynamic and intricate analysis techniques, especially in developing performance models (Gomez-Ruano et al., 2020). These advanced techniques have proven valuable in processing and analyzing the wealth of available sports data, facilitating the construction of more accurate and reliable performance models (Haghighat et al., 2013). For instance, decision trees have been utilized by researchers to identify patterns in player behavior and performance (Willoughby and Kostuk, 2005; Morgan et al., 2013). Logistic regression has been employed to model the probability of specific game outcomes or player actions (Zdravevski and Kulakov, 2009; Igiri and Nwachukwu, 2014). Support vector machines (SVM) have been used for tasks such as player classification or predicting player attributes (Mustafa et al., 2017; Pai et al., 2017). Random forests have shown promise in capturing complex relationships within sports data and making accurate predictions (Lin et al., 2014; Knoll and Stübinger, 2020). In addition to machine learning models, deep learning approaches, like neural networks, have gained popularity in sports analytics due to their ability to automatically learn and extract intricate patterns from large datasets (Haghighat et al., 2013). These models have demonstrated success in tasks such as player trajectory prediction and play sequence analysis (Kahn, 2003; Ivanković et al., 2010). Moreover, Ranking models, such as Elo Rating, TrueSkill scores, and PageRank algorithms, have been commonly used to assess player or team performance relative to their peers (Barrow et al., 2013; Ibstedt et al., 2019; Kovalchik, 2020). Furthermore, sports research has extensively explored the application of both quantitative and qualitative models. Rule-based models offer interpretability and have been employed to derive decision-making strategies in various sports contexts (Zdravevski and Kulakov, 2009; Trawiński, 2010). Time series models are often used to capture temporal dependencies in sports data, enabling trend analysis and forecasting player performance over time (Yong et al., 2020). Clustering models have been utilized to group players or teams based on similar characteristics, facilitating player profiling and performance comparison (Patel, 2017; Mateus et al., 2020). This combination of analytic techniques, encompassing both quantitative and qualitative variables, provides the foundation of data and information. Importantly, the synergy between these two types of data enables researchers to understand and predict athlete performance more comprehensively, offering valuable insight into the multifaceted drivers of sports performance (O’Donoghue, 2014).
Assessing the critical performance indicators of players and teams is an equally important aspect of sports analysis (Sarlis and Tjortjis, 2020). Previously, basketball data expert Dean Oliver presented a pioneering understanding of data analysis (Oliver, 2004), prompting scholars to subsequently develop diverse metrics for evaluating the game from multiple viewpoints (Cooper et al., 2009; Metulini and Gnecco, 2022). Given that the intricacy of match performance stems from the unique contextual setting in which each match takes place, situational elements like location, opponent, and stage of play exert behavioral influences on the performance of athletes and sports teams (McGarry et al., 2013). Consequently, researchers must pay meticulous attention to the impact of contextual variables when constructing statistical models and investigating critical competition indicators within diverse competitive contexts and conditions, thereby establishing a standardized profile of technical performance during competitions (Hughes et al., 2001; Zhang et al., 2017). Certain studies have consolidated performance indicators across various sports, encompassing game, tactical, technical, and biomechanical indicators on a macro level (Hughes and Bartlett, 2002). Basketball scholars have refined these indicators by introducing Team Performance Indicators (TPI, Yu et al., 2008), which have been instrumental in investigating different facets of basketball.
Basketball is a team sport where success primarily hinges on integrating players with complementary skills (Katzenbach and Smith, 2015). The study of team composition has consistently regarded player positions as a crucial factor, traditionally determined by height and weight considerations (Deman et al., 2001). In particular, taller or more robust players are often stationed near the basket, while smaller players typically exhibit more significant activity on the perimeter (Latin et al., 1994; Ostojic et al., 2006). The conventional approach categorizes players into five positions: Point Guard, Shooting Guard, Small Forward, Power Forward, and Center (Trninić et al., 1999). However, as players develop physically and technically, they can assume multiple roles on the court. Consequently, the traditional method tends to oversimplify the skill sets of high-level players, resulting in suboptimal utilization of team resources (Cheng, 2017). Scholars commonly employ cluster analysis in unsupervised learning to study player positions. Examples encompass hierarchical clustering (Whitehead, 2017a), topology (Alagappan, 2012), K-means and model-based clustering (Kalman and Bosch, 2020), and Gaussian mixture clustering (Wang et al., 2022). Additionally, discriminant analysis (DAs) has demonstrated its suitability for predicting player positions (Pion et al., 2018). However, the studies mentioned above primarily rely on basic statistics, such as box scores, which have inherent limitations (Shea and Baker, 2013). For instance, high scoring can be attributed to a player’s shooting technique or the ability of teammates to create favorable shooting opportunities. Similarly, the frequency of defensive rebounds can be influenced by rebounding technique or the number of missed shots by opponents (Remmert, 2003; Christmann et al., 2018). Certain researchers have investigated player behavior by considering interactive information, such as passing and pick-and-roll actions, consistently demonstrating that cooperative play between players yields superior results compared to individual actions (Courel-Ibáñez et al., 2013; Gómez et al., 2015; Marmarinos et al., 2016). Consequently, the study of interactive information data can illuminate the technical and tactical aspects and is more favorable than traditional statistics for analyzing and evaluating athletic performance, as well as assisting coaches in decision-making (Christmann et al., 2018).
Synergy, a sports data science company, offers play-type data and game footage access. They also provide advanced visualization tools for coaches and analysts to analyze opponents’ playing styles, monitor player development over time, and gain insights into player and team performance (Božović, 2021; Fu and Stasko, 2022). There are 11 standardized play types in offense, such as spot-up, transition, isolation, off-screen, and so on (Božović, 2021). Basketball offensive tactics are aimed at creating optimal shot selection within a 24-s period. The interplay of gaining separation from the defender(s) (e.g., to shoot or receive a pass) and gaining proximity (for a screen, handoff, cut, or drive) is characteristic of the basketball game (Christmann et al., 2018). These patterns or actions are designed by the coaches, heavily practiced by the players, and then repeated in the game. Understanding how players and teams create successful scoring opportunities is, therefore, critical both practically and theoretically (Matulaitis and Bietkis, 2021). For example, poor pick-and-roll use is recognized as a significant cause of team failure (Vaquera et al., 2013); the most effective finishing moves in the Euroleague are isolation, pick-and-roll, spot-up, and cut (Matulaitis and Bietkis, 2021), while the finishing actions that contribute the least to scoring are off-screen and hand-off (Zukolo et al., 2019). The applicability of these data presents an excellent tool for detailed analysis and creating scout reports (Božović, 2021). Scholars have utilized this data to investigate team styles, player roles, and the influence of various game types on team performance (McBasketball, 2017; Božović, 2021), thereby providing basketball statisticians with a novel perspective for player observation. It is worth noting that player play type is a relatively understudied topic, with minimal research focused on it. Moreover, such studies are particularly scarce for high-level Asian leagues. Therefore, building on prior research, we postulate that classifying players’ play-type data will reveal contrasting offensive roles between native and foreign players, which, in turn, will have distinct impacts on team performance.
Given the above considerations, the aims of this study are as follows: (1) use the clustering method to build a classification model based on the play-type data of basketball players, to classify native and foreign players into different offensive roles; (2) use the clustered offensive role model to investigate how different offensive roles influence team performance.
2. Materials and methods
2.1. Sample and variables
The research data was obtained from the sports data website Synergy,1 and we gained the license to use play-type related data and video. Table 1 presents the category of 11 play types in basketball, these indicators are considered standardized and cover all scoring attempts in the game of basketball, and they were able to translate the team’s tactical decisions into countable data (McBasketball, 2017; Božović, 2021). Play-type data was collected for players from 20 teams during five CBA seasons, from 2017 to 2022, encompassing a total of 4,475 games including regular season, playoff qualifying rounds, and playoffs. The raw data consisted of 1,454 native players and 265 foreign players. To ensure the dependability of the clustering and subsequent analysis, players who had less than 3 % of the team’s total scoring attempts were excluded. Consequently, our final sample included 823 native players (n = 823) and 228 foreign players (n = 228).
2.2. Statistical analysis
The initial step involved K-means clustering in categorizing native and foreign players into distinct groups based on their play type data (Zhang et al., 2016; Patel, 2017). The elbow method was employed to determine the optimal number of clusters for the two groups. The clustering results were assessed using the silhouette coefficient (Zhou and Gao, 2014), which ranges between-1 and 1. A higher value closer to 1 indicates superior clustering results, while a negative value indicates poor results. After the clustering results were obtained, the dataset was depersonalized using the t-SNE method, and the results were then visualized (Van der Maaten and Hinton, 2008; Patel, 2017). The process above was implemented using the scikit-learn machine learning framework in Python.
In the second step, non-parametric tests were employed to examine the potential differences in the number of distinct offensive roles between teams that reached the semi-finals and those that did not. Effect sizes (ES) were calculated using Cohen’s d, and their interpretation followed the established criteria: 0.10 denoted a small effect, 0.30 indicated a medium effect, and 0.50 marked a large effect (Volker, 2006). Additionally, we constructed a logistic regression model to investigate the impact of each offensive role on the team’s performance. The number of distinct offensive roles within the team served as the independent variable, while whether the team reached the semi-finals was the dependent variable. Based on their rankings in the season, all teams were categorized as either non-semi-finalists or semi-finalists (code: 0, 1). The threshold for determining statistical significance was set at α = 0.05. The above analyses were conducted using IBM SPSS Statistics for Mac, version 24.0 (Armonk, NY: IBM Corp.).
2.3. Reliability and validity of data
To establish the validity of the play type data obtained from Synergy, a random selection of five games per season was observed by three analysts with over 5 years of experience in basketball video analysis. This observed data was then combined with the information provided by Synergy to conduct an intra-group correlation analysis using the Two-Way Mixed-Effects Model (Koo and Li, 2016), ultimately showing a high degree of consistency (ICC = 0.99). The local research institutional review board formally approved all procedures.
3. Results
3.1. Players offensive role definition
Before clustering, the “elbow method” is employed to determine the optimal number of clusters. The primary concept behind this method involves identifying the inflection point where the sum of squared errors (SSE) within a cluster decline significantly. This inflection point represents the optimal number of clusters (Zhang et al., 2016; Cui, 2020). According to Figure 1, the optimal number of clusters for native players is three, while for foreign players, it is two.
Figure 1. The elbow method determines the optimal number of clusters for a player’s offensive role. Subfigure (A) demonstrates a declining inflection point in SSE for native players when the number of clusters is three; for foreign players, Subfigure (B) demonstrates a declining inflection point in SSE when the number of clusters is two.
Based on the play type percentages depicted in Figure 2, the players can be macroscopically categorized as ball-handlers (C1, C4), wings (C0), and big men (C2, C3) (Whitehead, 2017a; Diambra, 2018). The specific definitions of these three types of roles are as follows:
Ball-Handlers: These roles have a high percentage of pick-and-roll ball-handlers and transition.
Wings: These roles have a high percentage of spot-up and transition.
Big men: These roles have a high percentage of cut, pick-and-roll man, post-up, and putbacks.
Nevertheless, in the context of basketball development, limiting the definition to only three roles deviates from the sport’s current developmental trend (Bianchi et al., 2017; Whitehead, 2017a). Consequently, we used K-means clustering to generate fourteen offensive roles (N0-N13) for native players and five (F0-F4) for foreign players. According to Figures 3, 4, the silhouette coefficient calculated for clustering native players is 0.20, indicating a moderate level of both cohesion and separation among the clusters. Put simply, the clusters formed for native players exhibit a certain level of internal similarity while maintaining reasonable distinction from each other. On the other hand, the silhouette coefficient for clustering foreign players is 0.24, indicating an enhanced clustering quality for this group. The clusters formed for foreign players demonstrate improved internal homogeneity and clearer boundaries between them (Zhou and Gao, 2014).
Figure 3. Visualization of clustering results for native players. Subplot (A) shows that the silhouette coefficients of all clusters exceed the mean value. Subfigure (B) transforms eleven play-type variables into two dimensions by the t-SNE dimensionality reduction technique, and the numbers in Subfigure (B) correspond to the Cluster Label in Subplot (A).
Figure 4. Visualization of clustering results for foreign players. Subplot (A) shows that the silhouette coefficients of all clusters exceed the mean value. Subfigure (B) transforms eleven play-type variables into two dimensions by the t-SNE dimensionality reduction technique, and the numbers in Subfigure (B) correspond to the Cluster Label in Subplot (A).
Figure 5 displays the distribution of player clusters based on play type percentages. Similar to other studies that have analyzed player positions and roles using clustering methods, we identify players by observing how they are represented in different types of play data (Lutz, 2012; Wang et al., 2022). The subsequent descriptions outline the offensive roles associated with each offensive role:
N0, “Primary Ball Handlers”: This group consists of native players with the highest percentage of pick-and-roll ball handlers.
N1, “Mobile Bigs who Cut”: This category comprises big men with exceptional mobility and a cutting style of scoring.
N2, “Impact Wings”: These players exhibit the highest percentage of participation in transition offenses.
N3, “Stationary Spot-up Wings”: This offensive role demonstrates the highest percentage of spot-up plays among all players.
N4, “Stationary Tall Wings”: These players exhibit a slightly lower percentage of spot-ups than N3 players but a higher percentage of roll-man plays and putbacks.
N5, “Ball Handlers who Share the Load”: This offensive role demonstrates a slightly lower percentage of pick-and-roll ball-handler than the N0 players.
N6, “Spot-up Wings who Attack”: These players possess a high spot-up percentage but primarily rely on cuts, roll-man, and putbacks to score.
N7, “Bigs who Roll to the Rim”: These players play a significant role in the pick-and-roll offense, frequently rolling to the basket to score after setting a screen.
N8, “Mobile Bigs who Post-up at the Rim”: These players exhibit the highest post-up percentage among native players and actively contribute to pick-and-roll plays and cuts for scoring opportunities.
N9, “Spot-up Wings who Transition”: This offensive role predominantly engages in spot-up and transition plays, while other play types are less prevalent.
N10, “Tall Wings who Pick-and-pop”: These players are characterized by a high spot-up and roll-man percentage, indicating their inclination toward catch-and-shoot plays following a pick.
N11, “Wings who Work Around Screen”: This offensive role excels at scoring by utilizing screens set by teammates on the court, leading to a high off-screen and handoff percentage.
N12, “Ball-handler with Off-ball Duties”: This offensive role exhibits a lower pick-and-roll ball-handler rate than N0 and N5 players but demonstrates a higher spot-up percentage.
N13, “Bigs who Cut to the Rim”: These players exhibit the highest percentage of cuts and putbacks while displaying the lowest percentage of spot-up plays.
F0, “Bigs with Skills Everywhere”: These skilled bigs employ a diverse array of plays to attempt scoring, including posting up inside, spotting up outside, and executing cuts or rolls to the basket.
F1, “Domineering Ball Handlers”: These players possess the highest percentage of pick-and-roll ball handlers and exhibit much isolation plays.
F2, “Wings who Initiate Offense”: These players demonstrate a particular percentage of pick-and-roll ball-handling and isolation plays. Despite not being the primary ball handlers on the court, they can initiate attacks with the ball.
F3, “Ball Stopping Mobile Bigs”: This offensive role is distinguished by a high percentage of spot-ups and isolation plays, often resulting in individual finishes for possessions.
F4, “Bigs with Skills in the Post”: This offensive role demonstrates the highest post-up percentage, indicating a greater emphasis on low-post offense.
3.2. Analysis of the influence of offensive roles on team performance
We initially examined if there existed differences in the diversity of offensive roles within the teams. Non-parametric test results (Table 2) indicate that categories N10 Tall Wings who Pick-and-pop (p < 0.01, ES = 0.84), N13 Bigs who Cut to the Rim (p < 0.05, ES = 0.62), and F4 Bigs with Skills in the Post (p < 0.05, ES = 0.67) exhibited significant disparities among the teams that advanced to the semi-finals and those that did not.
Further, we constructed a logistic regression model using the number of offensive roles and the team’s performance. Table 3 displays the logistic regression model’s indicators. For native players, significantly positive effects were observed for N6 Spot-up Wings who Attack (OR = 3.281, p < 0.05) and N13 Bigs who Cut to the Rim (OR = 4.272, p < 0.05). The offensive roles of foreign players did not significantly impact the team’s quality.
4. Discussion
The aim of this study was to (1) use the clustering method to build a classification model based on the play-type data of basketball players, to classify native and foreign players into different offensive roles; (2) employ the classified offensive role model to examine the impact of diverse offensive roles on team performance.
Performance modeling is a valuable tool with diverse applications, including predicting game results (Zdravevski and Kulakov, 2009), identifying tactical strategies (Tian et al., 2019), aiding in player draft and recruitment processes (Berri et al., 2011), and evaluating player and team performance (Terner and Franks, 2021). Several studies have utilized this approach to categorize players’ positions and roles and forecast their performance (Rangel et al., 2019; Wang et al., 2022). However, comparative studies between native and foreign players have rarely employed similar approaches. Most studies have consistently demonstrated the superior capabilities of foreign players compared to native players, suggesting their need for a more significant on-court role (Guimarãe et al., 2018; Gasperi et al., 2020). Performance modeling helps mitigate these biases and preconceptions, enabling a fair and data-driven assessment of players from different backgrounds. This is especially crucial for foreign players, as their style and skill sets may differ from native players due to cultural, training, and developmental factors (Ozmen, 2012). Nevertheless, it is essential to recognize the limitations of performance modeling. The reliability of predictions can be influenced by the quality, completeness, and accuracy of the data on which the models rely (Fu and Stasko, 2022). Additionally, basketball is a complex and dynamic game influenced by various situational factors, teamwork, and coaching strategies, which the models may not fully capture. Caution should be exercised by coaches and researchers when interpreting and relying solely on predictions generated by performance modeling, acknowledging the critical role of human judgment and expertise in the decision-making process (Remmert, 2003; Lamas et al., 2015).
To accurately reflect the contemporary dynamics of the sport, it is necessary to reassess the categorization of player roles in modern basketball. Conventional classifications, including point guard, shooting guard, small forward, power forward, and center, no longer encompass versatility and blurred positional boundaries (Latin et al., 1994; Ostojic et al., 2006). Contemporary players possess a broader spectrum of skills and physical conditioning, allowing them to assume multiple roles on the court. This transformation is propelled by the emphasis on versatility, floor spacing, and the ability to create mismatches (Bianchi et al., 2017; Whitehead, 2017b). In their study of the Brazilian professional basketball league, Rangel et al. noted that versatility players showed an upward trend over nine seasons and presented a higher frequency at the power forward and point guard positions (Rangel et al., 2019). Trninić et al. proposed multidimensional criteria for the various positions and roles of the game of basketball, and their findings showed that the ability to play multiple positions is crucial for players at positions 2 and 3, while players at position 1 are more expert players (Trninić et al., 2000). In basketball, certain guards assume the role of ball handlers within their team, which forwards or centers may also take on. Conversely, some guards do not fulfill the ball handler role, and this principle can be extended to other positions on the court (Sailofsky, 2018). This coincides with our findings: our clustering results yielded 8 categories of wings and 7 categories of big men, reflecting the importance of forward players playing multiple roles in the game; there were only 4 categories of ball-handlers, suggesting that combing the offense and creating scoring chances are their main responsibilities. In addition, the study by Rangel et al. noted that the roles of foreign players showed higher diversity (Rangel et al., 2019). In our study, except for F1 players who are required to be the team’s primary ball-handler, the other types of players showed diversity in their scoring styles. Although the diversity of player roles is a growing trend in basketball, having more than three such “glue guys” proved to be detrimental to team performance in the study by Lutz (2012). Therefore, coaches should try to balance the playing positions and tactical roles of players when constructing lineups.
Our study employed the elbow method to categorize players into three macro-level roles: ball handlers, wings, and big men. Previous studies have reported similar findings, corroborating and reinforcing our findings (Bianchi et al., 2017; Whitehead, 2017a). These findings offer valuable insights for analyzing player roles and conducting further research on the evolution of basketball tactics. Subsequent categorization revealed 14 offensive roles for native players and five for international players. Several previous studies have also created specific labels for basketball players’ roles in the game. For example, Alagappan et al. expanded the traditional five positions to 13 (Alagappan, 2012), and Lutz et al. defined players into 10 different clusters (Lutz, 2012). This categorization, however, fails to fully transcend the conventional positional constraints due to the substantial variation in statistical metrics across different player positions. For instance, guards tend to have higher counts of assists and successful three-point shots, whereas centers excel in rebounds (Sampaio et al., 2006). There are also clusters of a few players, such as “one-of-a-kind,” which are individuals with outstanding talent and statistics, For example, James Harden and Russell Westbrook, who perform well in a variety of statistical data (Alagappan, 2012; Lutz, 2012). While these players are not necessarily guaranteed to win (Lutz, 2012), they are key players that every team pursues. Stated differently, these player clusters might lack representativeness as not all teams can boast of having such players. Consequently, the delineation of players according to roles, rather than rigid positions, facilitates adaptability and flexibility in response to team strategies and individual proficiencies (Lorenzo et al., 2019; Zhang et al., 2019). In the course of a game, adept teams swiftly modify their successive offensive maneuvers in response to varying defensive shifts, whereas alterations in defensive strategies minimally affect offensive actions (Christmann et al., 2018). Hence, characterizing player roles through play-type data will better mirror a player’s skill set and designated role.
An analysis of the offensive role composition in different teams demonstrated significant variations among three distinct offensive role categories, all falling within the classification of big players (N10, N13, F4). Prior studies on player classification consistently demonstrate the relatively low number of big men. Furthermore, the rarity of big men with specific skills like long-range shooting or proficiency in the low post hampers their availability on most teams (Çene, 2018). Consequently, teams without such players must adjust their tactics or select alternative types of big men as substitutes (Wang et al., 2022). The logistic regression results revealed a positive impact of the two offensive roles (N6 and N13) on a team’s advancement to the semi-finals. Consistent with previous studies on play types, successful teams were found to exhibit more cuts, low-post offense, spot-up shots, and free throws (Zukolo et al., 2019; Matulaitis and Bietkis, 2021), which aligns with the characteristics of the two identified offensive roles. Furthermore, the significance of the off-ball movement was emphasized. Additionally, offensive rebounding emerged as a crucial aspect of performance. Offensive rebounding creates secondary scoring opportunities and diminishes the opponent’s chances of transitioning into offense (Zukolo et al., 2019). Ball handlers play a pivotal role within the team, as over half of the set-offense are initiated through ball screens (Gómez et al., 2015; Vaquera et al., 2016). However, research has indicated that lineups with more than two high-usage ball handlers tend to underperform, decreasing team efficiency (Kalman and Bosch, 2020; Wang et al., 2022). The same finding was also seen in the study by Lutz et al. who noted that ball handlers have a negative impact on a team’s point differential (Lutz, 2012). Transition offense, while efficient, does not exhibit significant correlations with game outcomes, likely due to its relatively small proportion within overall gameplay (Selmanovi, 2015; Suárez-Cadenas et al., 2016; Matulaitis and Bietkis, 2021). Consequently, players specializing solely in transition offense and spot-up shooting are considered highly interchangeable. Moreover, players proficient in scoring from screens (N11) heavily rely on the team’s ability to create space for them and face heightened defensive pressure (McIntyre et al., 2016). Notably, off-ball screens and hand-offs contribute the least to scoring; losing teams tend to have more attacks (Suárez-Cadenas et al., 2016; Zukolo et al., 2019).
Finally, our study indicates that the offensive role of foreign players has no substantial impact on team performance. These findings align with prior research that found no significant difference in efficiency between foreign players of top teams and average (non-top) teams (Ozmen, 2012; Wang et al., 2022). Conversely, native players of top teams are more efficient than native players of teams outside the top tier. Native players in the CBA league exhibit distinct offensive characteristics, with spot-up, transition, and cutting actions comprising a significant portion of their gameplay. These offensive strategies prioritize teamwork and have been proven effective in several studies (Christmann et al., 2018; Zukolo et al., 2019; Matulaitis and Bietkis, 2021). However, they rely on teammate support for execution, making these offensive roles often perceived as scoring beneficiaries (Whitehead, 2017b; Zukolo et al., 2019). In contrast, pick-and-roll ball handlers, isolation, and post-ups were more prevalent among foreign players. While some studies disagree on the efficiency of these playing styles (Selmanovi, 2015; Christmann et al., 2018; Zukolo et al., 2019), they tend to emphasize individual ability. Consequently, these offensive roles are regarded as scoring creators (Whitehead, 2017b; Zukolo et al., 2019), particularly in teams lacking strong scoring capabilities, where the role of these players becomes even more vital (Arcidiacono et al., 2017). Conversely, foreign players have demonstrated superior talent to domestic players in the league (Ozmen, 2012; Özmen, 2019; Gasperi et al., 2020). Due to the distinct playing time regulations concerning foreign players in the CBA league, teams are constrained to field a single foreign player during each quarter. This stipulation has led most teams to prioritize the scoring ability of these foreign recruits. Lutz et al. pointed out in their study that some player classifications such as aggressive bigs, big bodies, and perimeter scorers, have a negative impact on the game score difference (Lutz, 2012). Also, it was mentioned in their study that big players are generally unfavorable to the outcome of the game (Lutz, 2012), which diverges from our findings. Our explanation is that similar to the roles of native and foreign players in other leagues, native players play a secondary role in their team’s offense, relying on their teammates to create chances while focusing more on defense and other secondary responsibilities and playing a lesser role in the creative aspects of the game (Guimarãe et al., 2018; Özmen, 2019); while some studies have also pointed out that big players who focus on defense have a positive impact on the outcome of the game (Lutz, 2012; Wang et al., 2022), which support our research findings. These insights prompt coaches to recognize that with all teams having the funds to recruit talented foreign players, it is vital to develop native players, especially big men with a great sense of teamwork and defensive ability.
5. Limitations and future directions
This study has certain limitations. Firstly, similar to other studies employing clustering for player role classification, the final clustering results are inherently subjective, leading to the possibility of misclassifying players into different roles, which is an unavoidable issue in clustering studies. Secondly, basketball is a multifaceted game with various dimensions, where player roles can significantly fluctuate depending on team strategies, game situations, and opponent circumstances. This study solely analyzed players’ offensive types while neglecting relevant defensive metrics. Some players renowned for their defensive prowess may have been overlooked, potentially oversimplifying players’ roles and disregarding the diversity of their on-court contributions. Lastly, when studying the impact of offensive roles on team performance, we solely employed the number of players as the independent variable due to sample size limitations; other variables were not considered, such as players’ playing time and efficiency values across different play types, which constrained our ability to investigate further the influence of players’ roles on team performance.
Regarding future research directions, several areas warrant exploration. First, player roles evolve in response to coaching decisions, formation adjustments, and player development. Investigating the patterns and trends of these role changes is one of the areas we intend to explore. Second, incorporating a larger sample size and additional influential variables concerning the impact of player roles on team performance would enhance model stability, thereby facilitating the further investigation of the matter. Lastly, future research should integrate advanced machine learning techniques like deep or reinforcement learning to capture player roles’ intricate relationships and temporal dynamics. These approaches are adept at handling large-scale data, identifying complex patterns and dependencies, and yielding a more nuanced comprehension of offensive roles.
6. Conclusion
In this study, we utilize play-type data with interaction information to categorize native and foreign players in the CBA league into 14 and five distinct offensive roles, respectively, employing a clustering approach. These results provide a unique perspective for understanding and observing the player’s role. Further research found that big players made a difference in team composition. The offensive roles of two categories of native players, whose attributes emphasized both teamwork and off-ball movement, showed a significant influence on team performance. In contrast, no such impact was found for foreign players. Consequently, basketball coaches and managers should consider optimizing the composition of individual player clusters during the team-building process and prioritize the development of native players.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.
Ethics statement
Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent from the patients/participants or patients/participants' legal guardian/next of kin was not required to participate in this study in accordance with the national legislation and the institutional requirements.
Author contributions
RC: Data curation, Software, Visualization, Writing – original draft. MZ: Conceptualization, Funding acquisition, Methodology, Supervision, Writing – review & editing. XX: Methodology, Software, Writing – review & editing.
Funding
The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2023.1256796/full#supplementary-material
Footnotes
References
Alagappan, M. (2012). From 5 to 13: redefining the positions in basketball. in MIT sloan sports analytics conference, Boston, MA, USA. February, 2016.
Arcidiacono, P., Kinsler, J., and Price, J. (2017). Productivity spillovers in team production: evidence from professional basketball. J. Labor Econ. 35, 191–225. doi: 10.1086/687529
Barrow, D., Drayer, I., Elliott, P., Gaut, G., and Osting, B. (2013). Ranking rankings: an empirical comparison of the predictive power of sports ranking methods. J. Quant. Analysis Sports 9, 187–202. doi: 10.1515/jqas-2013-0013
Berri, D. J., Brook, S. L., and Fenn, A. J. (2011). From college to the pros: predicting the NBA amateur player draft. J. Prod. Anal. 35, 25–35. doi: 10.1007/s11123-010-0187-x
Bianchi, F., Facchinetti, T., and Zuccolotto, P. (2017). Role revolution: towards a new meaning of positions in basketball. Electronic J. App. Stat. Analysis 10, 712–734.
Božović, B. (2021). The use of “synergy sports technology” for the collection of basketball game statistics. In Sinteza 2021-international scientific conference on information technology and data related research Belgrade: Singidunum University, 272–276.
Çene, E. (2018). What is the difference between a winning and a losing team: insights from Euroleague basketball. Int. J. Perform. Anal. Sport 18, 55–68. doi: 10.1080/24748668.2018.1446234
Cheng, A. (2017). Using machine learning to find the 8 types of players in the NBA. Fastbreak Data. 9. Available at: https://medium.com/fastbreak-data/classifying-the-modern-nba-player-with-machine-learning-539da03bb824.
Christmann, J., Akamphuber, M., Müllenbach, A. L., and Güllich, A. (2018). Crunch time in the NBA–the effectiveness of different play types in the endgame of close matches in professional basketball. Int. J. Sports Sci. Coach. 13, 1090–1099. doi: 10.1177/1747954118772485
Cooper, W. W., Ruiz, J. L., and Sirvent, I. (2009). Selecting non-zero weights to evaluate effectiveness of basketball players with DEA. Eur. J. Oper. Res. 195, 563–574. doi: 10.1016/j.ejor.2008.02.012
Courel-Ibáñez, J., Suárez, E., Ortega Toro, E., Piñar López, M. I., and Cárdenas Vélez, D. (2013). Is the inside pass a performance indicator? Observational analysis of elite basketball teams. Revista de Psicología del Deporte 22, 0191–0194.
Cui, M. (2020). Introduction to the k-means clustering algorithm based on the elbow method. Account. Audit. Finance 1, 5–8.
Deman, B., Trnini, S., and Dizdar, D. (2001). Expert model of decision-making system for efficient orientation of basketball players to positions and roles in the game–empirical verification. Coll. Antropol. 25, 141–152.
Diambra, N. J. (2018). Using topological clustering to identify emerging positions and strategies in NCAA men’s basketball Master's Thesis, Knoxville, TN, University of Tennessee.
Fu, Y., and Stasko, J. (2022). Supporting data-driven basketball journalism through interactive visualization. in Proceedings of the 2022 CHI conference on human factors in computing systems, 1–17.
Gasperi, L., Conte, D., Leicht, A., and Gómez-Ruano, M.-Á. (2020). Game related statistics discriminate national and foreign players according to playing position and team ability in the Women’s basketball euro league. IJERPH 17:5507. doi: 10.3390/ijerph17155507
Gómez, M.-Á., Battaglia, O., Lorenzo, A., Lorenzo, J., Jiménez, S., and Sampaio, J. (2015). Effectiveness during ball screens in elite basketball games. J. Sports Sci. 33, 1844–1852. doi: 10.1080/02640414.2015.1014829
Gómez, M.-A., Lorenzo, A., Ibañez, S.-J., and Sampaio, J. (2013). Ball possession effectiveness in men’s and women’s elite basketball according to situational variables in different game periods. J. Sports Sci. 31, 1578–1587. doi: 10.1080/02640414.2013.792942
Gòmez, M.-Á., Lorenzo, A., Ortega, E., Sampaio, J., and Ibàñez, S.-J. (2009). Game related statistics discriminating between starters and nonstarters players in Women’s National Basketball Association League (WNBA). J. Sports Sci. Med. 8, 278–283.
Gómez, M. A., Lorenzo, A., Sampaio, J., and Ibáñez, S. J. (2006). Differences in game-related statistics between winning and losing teams in women’s basketball. J. Hum. Mov. Stud. 51, 357–369.
Gomez-Ruano, M.-A., Ibáñez, S. J., and Leicht, A. S. (2020). Performance analysis in sport. Lausanne: Frontiers Media SA
Guimarãe, E., Santos, A., Santos, E., Tavares, F., and Janeira, M. A. (2018). National players vs. foreign players: what distinguishes their game performances? A study in the portuguese basketball league. RICYDE. Revista Internacional de Ciencias del Deporte 14, 374–381.
Haghighat, M., Rastegari, H., Nourafza, N., Branch, N., and Esfahan, I. (2013). A review of data mining techniques for result prediction in sports. Advan. Comp. Sci. Int. J. 2, 7–12.
Hughes, M. D., and Bartlett, R. M. (2002). The use of performance indicators in performance analysis. J. Sports Sci. 20, 739–754. doi: 10.1080/026404102320675602
Hughes, M., Evans, S., and Wells, J. (2001). Establishing normative profiles in performance analysis. Int. J. Perform. Anal. Sport 1, 1–26. doi: 10.1080/24748668.2001.11868245
Hughes, M., and Franks, I. (2007). The essentials of performance analysis: an introduction. United Kingdom: Routledge.
Ibstedt, J., Rådahl, E., and Turesson, E. (2019). Application and further development of trueskill™ ranking in sports.
Igiri, C. P., and Nwachukwu, E. O. (2014). An improved prediction system for football a match result. IOSR J. Engin. 4, 2278–8719.
Ivanković, Z., Racković, M., Markoski, B., Radosav, D., and Ivković, M. (2010). Analysis of basketball games using neural networks. 2010 11th international symposium on computational intelligence and informatics (CINTI) Manhattan: IEEE, 251–256.
Kalman, S., and Bosch, J. (2020). NBA lineup analysis on clustered player tendencies: a new approach to the positions of basketball & modeling lineup efficiency of soft lineup aggregates. in 14th annual MIT sloan sports analytics conference.
Katzenbach, J. R., and Smith, D. K. (2015). The wisdom of teams: creating the high-performance organization. Brighton Harvard Business Review Press.
Knoll, J., and Stübinger, J. (2020). Machine-learning-based statistical arbitrage football betting. KI-Künstliche Intelligenz 34, 69–80. doi: 10.1007/s13218-019-00610-4
Koo, T. K., and Li, M. Y. (2016). A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J. Chiropr. Med. 15, 155–163. doi: 10.1016/j.jcm.2016.02.012
Kovalchik, S. (2020). Extension of the Elo rating system to margin of victory. Int. J. Forecast. 36, 1329–1341. doi: 10.1016/j.ijforecast.2020.01.006
Lamas, L., Santana, F., Heiner, M., Ugrinowitsch, C., and Fellingham, G. (2015). Modeling the offensive-defensive interaction and resulting outcomes in basketball. PLoS One 10:e0144435. doi: 10.1371/journal.pone.0144435
Latin, R. W., Berg, K., and Baechle, T. (1994). Physical and performance characteristics of NCAA division I male basketball players. J. Strength Cond. Res. 8, 214–218.
Lin, J., Short, L., and Sundaresan, V. (2014). Predicting national basketball association winners. CS 229 FINAL PROJECT, 1–5.
Lorenzo, J., Lorenzo, A., Conte, D., and Giménez, M. (2019). Long-term analysis of elite basketball players’ game-related statistics throughout their careers. Front. Psychol. 10:421. doi: 10.3389/fpsyg.2019.00421
Lutz, D. (2012). A cluster analysis of NBA players. In Proceedings of the MIT Sloan Sports Analytics Conference, Boston, MA, USA. Retrieved February, 2016.
Marmarinos, C., Apostolidis, N., Kostopoulos, N., and Apostolidis, A. (2016). Efficacy of the “pick and roll” offense in top level European basketball teams. J. Hum. Kinet. 51, 121–129. doi: 10.1515/hukin-2015-0176
Mateus, N., Esteves, P., Gonçalves, B., Torres, I., Gomez, M. A., Arede, J., et al. (2020). Clustering performance in the European basketball according to players’ characteristics and contextual variables. Int. J. Sports Sci. Coach. 15, 405–411. doi: 10.1177/1747954120911308
Matulaitis, K., and Bietkis, T. (2021). Prediction of offensive possession ends in elite basketball teams. IJERPH 18:1083. doi: 10.3390/ijerph18031083
McGarry, T., O’Donoghue, P., de Eira Sampaio, A. J., and Sampaio, J. (2013). Routledge handbook of sports performance analysis. Routledge, London UK.
McIntyre, A., Brooks, J., Guttag, J., and Wiens, J. (2016). Recognizing and analyzing ball screen defense in the NBA. Proceedings of the MIT sloan sports analytics conference, Boston, MA, USA, 11–12.
Metulini, R., and Gnecco, G. (2022). Measuring players’ importance in basketball using the generalized Shapley value. Ann. Oper. Res. 325, 441–465. doi: 10.1007/s10479-022-04653-z
Morgan, S., Williams, M. D., and Barnes, C. (2013). Applying decision tree induction for identification of important attributes in one-versus-one player interactions: a hockey exemplar. J. Sports Sci. 31, 1031–1037. doi: 10.1080/02640414.2013.770906
Mustafa, R. U., Nawaz, M. S., Lali, M. I. U., Zia, T., and Mehmood, W. (2017). Predicting the cricket match outcome using crowd opinions on social networks: a comparative study of machine learning methods. Malays. J. Comput. Sci. 30, 63–76. doi: 10.22452/mjcs.vol30no1.5
Oliver, D. (2004). Basketball on paper: Rules and tools for performance analysis. Dulles, VA Potomac Books, Inc.
Ostojic, S. M., Mazic, S., and Dikic, N. (2006). Profiling in basketball: physical and physiological characteristics of elite players. J. Strength Cond. Res. 20, 740–744. doi: 10.1519/00124278-200611000-00003
Ozmen, M. U. (2012). Foreign player quota, experience and efficiency of basketball players. J. Quant. Analysis Sports 8:1. doi: 10.1515/1559-0410.1370
Özmen, M. U. (2019). Short-term impact of a foreign player quota liberalisation policy on domestic player performance: evidence from a regression discontinuity design. Int. J. Sport Policy Politics 11, 39–55. doi: 10.1080/19406940.2018.1488758
Pai, P.-F., Chang Liao, L.-H., and Lin, K.-P. (2017). Analyzing basketball games by a support vector machines with decision tree model. Neural Comput. Applic. 28, 4159–4167. doi: 10.1007/s00521-016-2321-9
Patel, R. (2017). Clustering professional basketball players by performance. University of California, Los Angeles.
Pion, J., Segers, V., Stautemas, J., Boone, J., Lenoir, M., and Bourgois, J. G. (2018). Position-specific performance profiles, using predictive classification models in senior basketball. Int. J. Sports Sci. Coach. 13, 1072–1080. doi: 10.1177/1747954118765054
Rangel, W., Ugrinowitsch, C., and Lamas, L. (2019). Basketball players’ versatility: assessing the diversity of tactical roles. Int. J. Sports Sci. Coach. 14, 552–561. doi: 10.1177/1747954119859683
Remmert, H. (2003). Analysis of group-tactical offensive behavior in elite basketball on the basis of a process orientated model. Eur. J. Sport Sci. 3, 1–12. doi: 10.1080/17461390300073311
Sampaio, J., Janeira, M., Ibáñez, S., and Lorenzo, A. (2006). Discriminant analysis of game-related statistics between basketball guards, forwards and centres in three professional leagues. Eur. J. Sport Sci. 6, 173–178. doi: 10.1080/17461390600676200
Sarlis, V., and Tjortjis, C. (2020). Sports analytics—evaluation of basketball players and team performance. Inf. Syst. 93:101562. doi: 10.1016/j.is.2020.101562
Selmanovi, A. (2015). Basic characteristics of offensive modalities in the Euroleague. Acta Kinesiologica 9, 83–87.
Shea, S. M., and Baker, C. E. (2013). Basketball analytics: objective and efficient strategies for understanding how teams win. Scotts Valley, California: CreateSpace Independent Publishing Platform. Advanced Metrics.
Suárez-Cadenas, E., Courel-Ibáñez, J., Cárdenas, D., and Perales, J. C. (2016). Towards a decision quality model for shot selection in basketball: an exploratory study. Span. J. Psychol. 19:E55. doi: 10.1017/sjp.2016.53
Teramoto, M., and Cross, C. L. (2018). Importance of team height to winning games in the National Basketball Association. Int. J. Sports Sci. Coach. 13, 559–568. doi: 10.1177/1747954117730953
Terner, Z., and Franks, A. (2021). Modeling player and team performance in basketball. Annual Rev. Stat. Applicat. 8, 1–23. doi: 10.1146/annurev-statistics-040720-015536
Tian, C., De Silva, V., Caine, M., and Swanson, S. (2019). Use of machine learning to automate the identification of basketball strategies using whole team player tracking data. Appl. Sci. 10:24. doi: 10.3390/app10010024
Trawiński, K. (2010). A fuzzy classification system for prediction of the results of the basketball games. in International conference on fuzzy systems (IEEE), 1–7.
Trninić, S., Dizdar, D., and Dežman, B. (2000). Empirical verification of the weighted system of criteria for the elite basketball players quality evaluation. Coll. Antropol. 24, 443–465.
Trninić, S., Dizdar, D., and Jaklinović-Fressl, Ž. (1999). Analysis of differences between guards, forwards and centres based on some anthropometric characteristics and indicators of playing performance in basketball. Kinesiology 31, 28–34.
Van der Maaten, L., and Hinton, G. (2008). Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605.
Vaquera, A., Cubillo, R., García-Tormo, J. V., and Morante, J. C. (2013). Validation of a tactical analysis methodology for the study of pick and roll in basketball. Revista de psicología del deporte 22, 277–281.
Vaquera, A., García-Tormo, J. V., Ruano, M. G., and Morante, J. C. (2016). An exploration of ball screen effectiveness on elite basketball teams. Int. J. Perform. Anal. Sport 16, 475–485. doi: 10.1080/24748668.2016.11868902
Volker, M. A. (2006). Reporting effect size estimates in school psychology research. Psychol. Sch. 43, 653–672. doi: 10.1002/pits.20176
Wang, X., Han, B., Zhang, S., Zhang, L., Lorenzo Calvo, A., and Gomez, M.-Á. (2022). The differences in the performance profiles between native and foreign players in the Chinese basketball association. Front. Psychol. 12:788498. doi: 10.3389/fpsyg.2021.788498
Whitehead, T. (2017a). Nylon Calculus: Defining 23 offensive roles using the NBA’s play-type data. Available at: https://fansided.com/2018/10/15/nylon-calculus-defining-23-offensive-roles-nba-play-type-data/.
Whitehead, T. (2017b). Nylon Calculus: Ranking the best and worst scorers in every offensive role. Available at: https://fansided.com/2017/08/09/nylon-calculus-ranking-best-worst-scorers-every-offensive-role/.
Willoughby, K. A., and Kostuk, K. J. (2005). An analysis of a strategic decision in the sport of curling. Decis. Anal. 2, 58–63. doi: 10.1287/deca.1050.0032
Yong, W., Lingyun, P., and Jia, W. (2020). Statistical analysis and ARMA modeling for the big data of marathon score. Science. Sports 35, 375–385. doi: 10.1016/j.scispo.2020.01.009
Yu, K., Su, Z., and Zhuang, R. (2008). An exploratory study of long-term performance evaluation for elite basketball players. Int. J. Sports Sci. Engineer. 2, 195–203.
Zarić, I., Kukić, F., Jovićević, N., Zarić, M., Marković, M., Toskić, L., et al. (2020). Body height of elite basketball players: do taller basketball teams rank better at the FIBA world cup? Int. J. Environ. Res. Public Health 17:3141. doi: 10.3390/ijerph17093141
Zdravevski, E., and Kulakov, A. (2009). System for prediction of the winner in a sports game. in International conference on ICT innovations (Heidelberg: Springer), 55–63.
Zhang, S., Lorenzo, A., Gómez, M.-A., Liu, H., Gonçalves, B., and Sampaio, J. (2017). Players’ technical and physical performance profiles and game-to-game variation in NBA. Int. J. Perform. Anal. Sport 17, 466–483. doi: 10.1080/24748668.2017.1352432
Zhang, S., Lorenzo, A., Zhou, C., Cui, Y., Gonçalves, B., and Angel Gómez, M. (2019). Performance profiles and opposition interaction during game-play in elite basketball: evidences from National Basketball Association. Int. J. Perform. Anal. Sport 19, 28–48. doi: 10.1080/24748668.2018.1555738
Zhang, L., Lu, F., Liu, A., Guo, P., and Liu, C. (2016). Application of K-means clustering algorithm for classification of NBA guards. Int. J. Sci. Engineer. Applicat. 5, 1–6. doi: 10.7753/IJSEA0501.1001
Zhou, H. B., and Gao, J. T. (2014). Automatic method for determining cluster number based on silhouette coefficient. Adv. Mater. Res. 951, 227–230. doi: 10.4028/www.scientific.net/AMR.951.227
Keywords: team performance, cluster analysis, Chinese Basketball Association, play type, player roles
Citation: Chen R, Zhang M and Xu X (2023) Modeling the influence of basketball players’ offense roles on team performance. Front. Psychol. 14:1256796. doi: 10.3389/fpsyg.2023.1256796
Edited by:
Miguel-Angel Gomez-Ruano, Universidad Politécnica de Madrid, SpainReviewed by:
Leonardo Lamas, University of Brasilia, BrazilEnrique Alonso, Polytechnic University of Madrid, Spain
Iker Madinabeitia, University of Granada, Spain
Copyright © 2023 Chen, Zhang and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Mingxin Zhang, emhhbmdtaW5neGluQHN1cy5lZHUuY24=