Spatial patterns of phylogenetic diversity and endemism in the Western Ghats, India: A case study using ancient predatory arthropods

Abstract The Western Ghats (WG) mountain chain in peninsular India is a global biodiversity hotspot, one in which patterns of phylogenetic diversity and endemism remain to be documented across taxa. We used a well‐characterized community of ancient soil predatory arthropods from the WG to understand diversity gradients, identify hotspots of endemism and conservation importance, and highlight poorly studied areas with unique biodiversity. We compiled an occurrence dataset for 19 species of scolopendrid centipedes, which was used to predict areas of habitat suitability using bioclimatic and geomorphological variables in Maxent. We used predicted distributions and a time‐calibrated species phylogeny to calculate taxonomic and phylogenetic indices of diversity, endemism, and turnover. We observed a decreasing latitudinal gradient in taxonomic and phylogenetic diversity in the WG, which supports expectations from the latitudinal diversity gradient. The southern WG had the highest phylogenetic diversity and endemism, and was represented by lineages with long branch lengths as observed from relative phylogenetic diversity/endemism. These results indicate the persistence of lineages over evolutionary time in the southern WG and are consistent with predictions from the southern WG refuge hypothesis. The northern WG, despite having low phylogenetic diversity, had high values of phylogenetic endemism represented by distinct lineages as inferred from relative phylogenetic endemism. The distinct endemic lineages in this subregion might be adapted to life in lateritic plateaus characterized by poor soil conditions and high seasonality. Sites across an important biogeographic break, the Palghat Gap, broadly grouped separately in comparisons of species turnover along the WG. The southern WG and Nilgiris, adjoining the Palghat Gap, harbor unique centipede communities, where the causal role of climate or dispersal barriers in shaping diversity remains to be investigated. Our results highlight the need to use phylogeny and distribution data while assessing diversity and endemism patterns in the WG.

geoclimatic processes, there is also a contemporary seasonality gradient along the WG, with northern latitudes showing greater temperature and precipitation seasonality than the southern latitudes (Bose et al., 2019;Joshi & Karanth, 2013;Page & Shanker, 2020).
Studies based on distributions of plant and animal species from this region have revealed a decreasing trend in diversity from the southern to the northern WG (plants: Davidar et al., 2005, Page & Shanker, 2020snails: Aravind et al., 2005; frogs: Daniels, 1992, Aravind & Gururaja, 2011. The southern WG has also been associated with high endemism (Aravind & Gururaja, 2011;Daniels, 1992;Divya et al., 2020). However, these patterns are based on species distributions and have not been investigated using explicit phylogenetic data, with the exception of recent studies on the diversity of woody plants (Bose et al., 2019;Divya et al., 2020).
While taxonomic diversity (TD) is one of the most commonly used measures to characterize a community, it treats all species as independent units, which may not be true. To address this, phylogenetic diversity (PD) was proposed to explicitly incorporate the evolutionary history for each species, which would reflect the accumulated evolutionary history of a community (Faith, 1992). Further, the geographic distribution of a species along with phylogenetic divergence can be incorporated for all the taxa in a given region through phylogenetic endemism (PE), a metric that weights the branch lengths of each lineage by its respective geographic range (Rosauer et al., 2009).
The use of PD and PE allows us to assess the roles of ecological, historical, and evolutionary processes that structure communities, and their usefulness has been demonstrated in multiple biodiverse and complex landscapes (Azevedo et al., 2020;Fenker et al., 2020;Mishler et al., 2014), but remains limited in Asian tropical forests (Bose et al., 2019;Divya et al., 2020;Tamma & Ramakrishnan, 2015).
In such regions, including the WG, it is important to document the patterns of both phylogenetic diversity and endemism to understand the ecological and evolutionary processes shaping biodiversity and to identify areas of conservation importance.
While most existing global macroecological studies focus on plants (Massante et al., 2019), birds (Jetz et al., 2014), mammals (Safi et al., 2011), and herpetofauna (Fritz & Rahbek, 2012), arthropods have been largely ignored (Beck & McCain, 2020). Among arthropods, predatory soil-dwelling communities have been particularly neglected in macroecological studies (Finch et al., 2008), as they typically consist of many cryptic species occurring in low abundance, making them difficult to detect. This is coupled with a lack of taxonomic expertise in identifying them to the species-level, as well as the presence of many undescribed species/lineages. In many ecosystems, predatory soil arthropods are likely to be the oldest lineages and play an important role in maintaining the ecosystem, but their diversity patterns remain poorly understood.
Centipedes (class: Chilopoda) are one such group of soil arthropods, which represent one of the four main myriapod lineages, with a 420-million-year(Ma)-old fossil history, making them some of the oldest living terrestrial predators (Edgecombe & Giribet, 2019).
The centipede family Scolopendridae from the WG offers a unique opportunity for conducting macroecological studies as it is wellcharacterized taxonomically and phylogenetically. It is among the oldest (Late Cretaceous-100 Ma) soil arthropod communities in the WG and is one of the most diverse centipede groups in tropical Asian forests. Globally, the family Scolopendridae has more than 400 species, of which ~35 species belonging to seven genera occur in the WG.
The WG scolopendrid community has been extensively studied using both morphology and molecular data, resulting in the discovery of many endemic species and radiations (Joshi & Edgecombe, 2018;Joshi et al., 2020). These studies are accompanied by detailed K E Y W O R D S biodiversity hotspots, centipedes, diversity gradients and endemism, peninsular India, species richness primary distribution data spanning large latitudinal (8°N -20°N) and elevational gradients (100 -2,400 meters from mean sea level (msl)).
It is noteworthy that molecular data were integrated with morphology and species ecology in these studies, as the latter often receive lesser attention in the current barcoding era (Padial et al., 2010). The scolopendrid centipedes of the WG vary in their endemicity patterns, ranging from narrow-range endemics to species with wide distributions across the WG or exceptionally widespread distribution into SE Asia.
In this study, we focus on a diverse and predominantly tropical subfamily, Otostigminae (Figure 1). Among the three genera within Otostigminae studied here, Digitipes and Ethmostigmus occur in wet forests, where Digitipes is largely restricted to the WG, while Ethmostigmus has dispersed to the wet forests of the Eastern Ghats in the past (Joshi & Edgecombe, 2019). In contrast, Rhysida is widely distributed across various habitat types throughout peninsular India.
In terms of biogeographic history, there are instances of out of India dispersals across genera, but the origin of most species is in the Indian subcontinent/WG. There is evidence for dispersals and range expansions from the southern WG in Digitipes (Joshi & Karanth, 2013), a likely origin of Ethmostigmus species in south/central WG (Joshi & Edgecombe, 2019), and out of India dispersals in Rhysida (Joshi et al., 2020). The evidence from these larger phylogenies indicates that the WG species evolved in situ rather than through dispersals from NE India or SE Asia. We used this group of centipedes for assessing hypotheses related to the patterns of diversity and endemism for predatory soil arthropods in tropical forests at a community level.
Globally, such datasets are rare, most of them consisting of island communities (Caribbean: Crews & Esposito, 2020;Azores: Borges & Hortal, 2009;Galapagos: Peck, 2006;Hawaii: Gillespie, 2002). F I G U R E 1 (a) Bayesian phylogeny for Western Ghats Otostigminae (Joshi et al., 2020) (b) A detailed map of the distribution records across peninsular India with a focus on scolopendrid centipede communities of the Western Ghats Our aim in this study is to understand how the diversity of the centipede community is structured in the WG. Based on distribution and biogeographic studies of plants and animals from this region (plants: Davidar et al., 2005, Page & Shanker, 2020snails: Aravind et al., 2005;frogs: Aravind & Gururaja, 2011, Daniels, 1992, we expect a latitudinal diversity gradient (LDG) in which diversity increases from higher to lower latitudes in the WG. Additionally, the SWG refuge hypothesis provides expectations of ancient and high diversity in the southern WG and Nilgiris as a result of climatic stability allowing more time for speciation, while central and northern WG taxa have fewer and relatively younger (<65 Ma) lineages. These two hypotheses are not mutually exclusive and provide similar expectations, where the causal process for the LDG is generally associated with climatic stability (among other drivers), while the SWG refuge hypothesis specifically originates from geoclimatic processes related to Cretaceous volcanism in this landscape. To explore these questions, we used species distribution models to map the spatial patterns of diversity and endemism instead of relying only on point locations from sampling surveys. We then used predictions from these models to compare patterns of taxonomic diversity and endemism with phylogenetic diversity and endemism to identify areas with unique diversity. We asked the following specific questions: 1. How is centipede diversity distributed in the WG?
Based on the LDG and SWG refuge hypotheses, we expect to see a decreasing gradient in diversity from southern to northern WG.

Are there hotspots within the hotspot?
Since diversity may not be uniformly distributed given the climatic and topographic heterogeneity in the WG, we examined whether there are areas represented by disproportionately high diversity and endemism within the WG biodiversity hotspot.
3. What are the patterns of species turnover across the WG?
We assessed whether taxonomic and phylogenetic compositions are unique to each of the biogeographic subregions in the WG.

| Species distribution models
Primary location data (n = 100) for 19 species in the three genera of the centipede family Scolopendridae (Subfamily Otostigminae): Digitipes Attems, 1930 (five species), Ethmostigmus Pocock, 1898 (five species), and Rhysida Wood, 1862 (nine species) were obtained by systematic sampling across the Western Ghats (WG) from 2008 to 2010, spanning its latitudinal and elevational gradients (Joshi & Edgecombe, 2013Joshi & Karanth, 2012;Joshi et al., 2020) (see Appendix S1). These data were supplemented with opportunistic sampling which continued to 2018. These locations spanned the extent of peninsular India, with a focus on the wet forests of the WG (Figure 1b). Since it is challenging to identify centipede species in the field, specimens associated with the primary location data were collected and identified in the laboratory based on microscopic examination of morphological characters. Species identity was also assessed through molecular phylogenetic and species delimitation analyses (Joshi & Edgecombe, 2013Joshi & Karanth, 2012;Joshi et al., 2020). State forest department permits were obtained to collect centipedes in protected areas, and specimens were preserved in 70% ethanol. A few secondary locations (n = 10) were obtained from published sources, where we were certain about the species identification based on morphological characters described in the source literature (Jangi & Dass, 1984).
It is difficult to determine true absence for a group such as centipedes due to their low abundance, morphologically cryptic nature, and lack of systematic information on their distribution in less explored areas such as the WG. Therefore, we chose to model species distributions using Maxent version 3.4.1 (Phillips et al., 2006), which uses a presence-background approach to predict species distributions that have been shown to perform well across species, regions (Elith et al., 2006;Phillips & Dudík, 2008), and a range of sample sizes (Hernandez et al., 2006;Wisz et al., 2008). Maxent compares the environment at presence locations against background locations drawn from the model extent to arrive at a model of relative suitability for a species based on the underlying environmental variables (Elith et al., 2011;Merow et al., 2013). We ran Maxent models for each species for the model extent of peninsular India (8° -24°N, 68° -91°E) at 30 arc second resolution (0.0083 × 0.0083 degree resolution, 0.93 × 0.93 km at equator).
In these models, we used presence locations from each species (mean = 9.9 presence locations per species, range = 3-42 presence locations per species) and 10,000 background locations selected probabilistically from a bias layer. The bias layer was derived from a model built using a pooled dataset of presence locations including all species, 19 environmental variables and elevation from WorldClim (Fick & Hijmans, 2017) and a soil type layer (ATREE Spatial Archive, 2020) as predictor variables, 10,000 random background locations, and default Maxent parameters. Higher values of habitat suitability predictions (ranging from 0 to 1) obtained from this model represent areas with environments which are more likely to have been sampled, thus representing sampling bias across the model extent (Phillips et al., 2009).
The predictor variables for species-specific Maxent models consisted of six primary environmental layers and elevation from the WorldClim database (Fick & Hijmans, 2017) and a soil type layer (ATREE Spatial Archive, 2020). This reduced subset of WorldClim environmental variables (annual mean temperature, maximum temperature of the warmest month, minimum temperature of the coldest month, annual precipitation, precipitation of the wettest month, and precipitation of the driest month) has been recommended for species lacking ecological information or in community-level studies involving models of several species (Low et al., 2021). The use of these primary variables reduces complexity in the predictor dataset which can lead to model overfitting (Zeng et al., 2016) and avoids the inclusion of several correlated composite variables, while still allowing the use of different kinds of predictors (Low et al., 2021).
For each species, we built separate models using six different combinations of predictor transformations known as feature classes (LQH, LQ, QH, L, Q, H, where L-linear, Q-quadratic, H-hinge), each of which was tuned using ten different regularization parameters (0.5 to 5 with intervals of 0.5; Maxent default is 1), which smoothens model predictions (Elith et al., 2011). This was done as Maxent defaults for choosing feature classes and regularization parameters might not be appropriate across all species, and species-specific model fitting and tuning are recommended to build simpler models with better transferability (Hallgren et al., 2019;Low et al., 2021;Radosavljevic & Anderson, 2014).
To select the combination of feature classes and regularization multipliers in the best performing models, we carried out model evaluation using a cross-validation approach. Test and training datasets were obtained using four masked geographically structured partitions (Radosavljevic & Anderson, 2014) for species with 20 or more presence locations and k-1 jackknifing (Shcheglovitova & Anderson, 2013) for species with fewer presence locations. Average evaluation metrics calculated across partitions included measures of model transferability -OR MTP (omission rate of test presences in model predictions using a threshold of minimum training presence) and AUC DIFF (differ-

ence between training and test Area Under the Receiver Operator
Curve, which assesses model overfitting), and model discriminatory ability -AUC TEST (interpreted as the probability that the model ranks a randomly picked presence location higher in habitat suitability than a randomly picked background location, Fielding & Bell, 1997;Low et al., 2021). For each species, models with AUC TEST >0.6 were compared sequentially to select those with minimum OR MTP , followed by minimum AUC DIFF and finally maximum AUC TEST . For each species, the best performing model was then used to obtain predictions of habitat suitability across the model extent of peninsular India.
We additionally evaluated two other predictor datasets-1. all WorldClim variables + soil type (21 predictors in total) and 2. ecologically relevant WorldClim variables + soil type (7 predictors in total), along with an alternate AICc-based model selection procedure, details of which are provided in Appendix S2.

| Diversity and endemism measures
The diversity and endemism measures derived from predicted distributions at the scale of peninsular India were cropped for the WG for further analysis, since this biodiversity hotspot is the focus of our study where systematic sampling was undertaken. We aggregated continuous Maxent predictions of relative habitat suitability to obtain maps at a scale of 0.83 × 0.83 degrees (93 × 93 km at equator).
We assigned the maximum value of habitat suitability among the underlying cells to the larger aggregated cell and applied a threshold of maximum sum of sensitivity and specificity (Liu et al., 2005) to convert it into a presence-absence map for each species.
We used these binary maps to calculate taxonomic diversity (TD) and weighted endemism (WE) along with their phylogenetically informative counterparts-phylogenetic diversity (PD) and phylogenetic endemism (PE) for each cell in the model extent. As compared to taxonomic indices of diversity and endemism, the phylogenetic indices provide additional information on evolutionary relationships between species present within a community, helping to distinguish between closely and distantly related species. The diversity indices enable us to test the latitudinal diversity gradient hypothesis in the WG, where we expect to observe increasing diversity with decreasing latitude. Combined with the endemism indices, they help to test predictions from the southern WG refuge hypothesis from which we would expect to see high diversity and endemism within the southern WG. Additionally, both the indices of endemism allow us to identify hotspots consisting of range-restricted species within the WG, while PE additionally identifies evolutionarily unique and range-restricted species.
TD is calculated by stacking species distributions and summing species presences for each cell, whereas WE is calculated by scaling species presence with its range size (number of cells in the predicted map in which a species is present) and summing this across species found in a cell (Crisp et al., 2001). A species time tree for the subfamily Otostigminae based on a combined dataset of mitochondrial and nuclear markers (Joshi et al., 2020) was used to calculate PD and PE. PD is calculated by summing up branch lengths in the minimum spanning path derived from the larger phylogenetic tree, which includes all the species present in a cell (Faith, 1992). PE additionally scales each branch length in the minimum spanning path with its range size prior to summation across the lineages in a cell (Rosauer et al., 2009). Both PD and PE are presented as proportion of total tree length (range: 0 -1) ( Table 1).
To compare and identify differences in the relative distribution of evolutionary ages among lineages present within different communities, we used relative phylogenetic diversity (RPD) and relative phylogenetic endemism (RPE) ( Table 1). These indices compare the observed values of PD and PE with those obtained from a phylogenetic tree with equal branch lengths, which allows us to understand whether there is an over-representation of evolutionarily old or young lineages within a community. This information can provide insights into the biogeographic history or ecological processes operating in a region, for example, it can help to distinguish between centers of neo-and paleo-endemism (Mishler et al., 2014).
RPD and RPE were calculated as the ratio of PD and PE derived from the actual phylogenetic tree over the same indices derived from a phylogenetic tree where species relationships remain the same but lineages have equal branch lengths. An RPD or RPE value of 1 indicates that the lineages present in a cell have equal branch lengths.
Values larger than 1 indicate regions harboring species with longer branch lengths in the phylogenetic tree, and values smaller than 1 indicate regions harboring species with shorter branch lengths in the phylogenetic tree (Mishler et al., 2014).
We generated null distributions of phylogenetic diversity and endemism measures for comparison with observed values by random assignment of species occurrences to grid cells in the model extent without replacement, while keeping the taxonomic diversity and range size of each species constant (Mishler et al., 2014).
This procedure randomizes the species identities in a grid cell, and in doing so picks a random set of tip labels from the phylogenetic tree for each grid cell as compared to observed species identities (Mishler et al., 2014). The diversity and endemism indices and their corresponding null distributions were calculated using Biodiverse 3.1 (Laffan et al., 2010).
We used measures of beta diversity to assess whether taxonomic and phylogenetic composition varies between the different biogeographic subregions recognized along the WG. To identify patterns of taxonomic beta diversity, we used the Simpson dissimilarity index, which describes the variation in species composition due to species turnover alone (Baselga, 2010). We also calculated its phylogenetic counterpart in the form of PhyloSor Turn , which measures the loss of branch lengths between communities not explained by differences in phylogenetic diversity (Leprieur et al., 2012). The taxonomic and phylogenetic indices of turnover (Table 1) were used for cluster analysis using the UPGMA algorithm (Michener & Sokal, 1957) to group subregions in the WG (we applied cutoffs to obtain k = 4 clusters) based on patterns of species composition. Beta diversity and phylogenetic beta diversity indices were calculated using the "betapart" package (Baselga, 2010) in R 3.6.1 (R Core Team, 2019).   (Mishler et al., 2014) Ratio of observed phylogenetic diversity over the same index calculated using an identical phylogenetic tree but with equal branch lengths.

| Maxent predictions for the scolopendrid community
Values significantly larger than 1 indicate the overrepresentation of lineages with long branch lengths.
6 RPE Relative Phylogenetic Endemism (Mishler et al., 2014) Ratio of observed phylogenetic endemism over the same index calculated using an identical phylogenetic tree but with equal branch lengths.
Values significantly larger than 1 indicate the overrepresentation of lineages with restricted range sizes and long branch lengths.
7 Pairwise Simpson's dissimilarity index (Lennon et al., 2001;Simpson, 1943) The lower of the species numbers unique to each grid cell divided by sum of this value with the number of common species between grid cells.
Compositional differences between communities explained by species replacement alone, where higher values indicate greater dissimilarity.

8
PhyloSor Turn Phylogenetic Turnover (Leprieur et al., 2012) The lower of the sums of branch lengths unique to each grid cell divided by the above value added to the sum of branch lengths common between grid cells.
Evolutionary dissimilarity between communities explained by loss of branch lengths not explained by differences in phylogenetic diversity, where higher values indicate greater dissimilarity. than 1 was selected, and for 17 species, a simpler subset of fea-

| Patterns of diversity and endemism
The latitudinal trends in taxonomic and phylogenetic diversity were found to be broadly concordant within the WG (Figure 2a

| Patterns of species composition
Simpson dissimilarity and PhyloSor Turn , which describe the turnover component of taxonomic and phylogenetic beta diversity, indicated that grid cells within the southern WG and Nilgiris, along with a few cells on the eastern edge of the WG showed relatively greater compositional similarity. These cells differed in composition to most regions in the central and northern WG, which grouped separately.
This was evident when the first three principal components of the dissimilarity matrix were plotted in RGB color space and also when thresholds were applied to the UPGMA tree of dissimilarity to retrieve four clusters (Figure 3a, 3b).

| How is centipede diversity distributed in the WG?
This is one of the first studies to encompass the entire stretch of the Western Ghats (WG) to systematically evaluate the patterns of diversity, endemism, and composition using primary distribution data to model species distributions along with a robust, dated species-level phylogeny. Measures of taxonomic and phylogenetic diversity show support for predictions from the latitudinal diversity gradient (LDG) and southern WG refuge hypotheses, where we expect centipede diversity to decrease from lower to higher latitudes in the WG.
The decline in diversity with increasing latitude is a well-known pattern across taxa and regions (Hillebrand, 2004;ants: Economo et al., 2018;angiosperms: Kerkhoff et al., 2014;mammals: Rolland et al., 2014). Various explanations have been proposed to explain LDG, where higher species richness at lower latitudes has been attributed to greater time available for speciation, higher speciation rates, or lower extinction rates (Mittelbach et al., 2007). The causal drivers of variation in speciation and extinction rates can be related to geographic area, productivity, time, climatic stability, temperature, and biological interactions (Fine, 2015). The southern WG may have acted as a refuge during Cretaceous volcanism (ca 65 Ma), which is thought to be associated with widespread extinctions of plant and animal species in the northern WG (Joshi & Karanth, 2013 and references therein). This, along with climatic stability over long periods F I G U R E 3 Patterns of (a) Simpson dissimilarity (species turnover) and (b) phylogenetic turnover (PhyloSor Turn ) across the Western Ghats, India (maps in 0.83° × 0.83° grid cells). The left panel is compositional dissimilarity between grid cells represented as distance in color space where similar colors represent similar composition, and each plot needs to be viewed independently. Each plot was obtained by projecting the first three principal component axes of pairwise dissimilarity measures in the RGB scale. The center panel is a UPGMA tree generated from pairwise dissimilarity measures of composition, where the colors on the branches represent the different k = 4 clusters (dashed line representing the cutoff). The clusters recovered from the UPGMA tree are represented spatially in the panel on the right of time as inferred from past vegetation patterns (Divya et al., 2020;Prasad et al., 2009), could have led to the persistence and diversification of ancient centipede lineages in the southern WG. Our findings support the expectations of the southern WG refuge hypothesis, in that the southern WG has higher taxonomic and phylogenetic diversity represented by relatively older lineages as compared to the other WG subregions. The decline in diversity in the higher latitudes of the WG can be additionally related to an increase in temperature and precipitation seasonality, which is associated with the Miocene emergence of the Indian monsoon (Gunnell, 1997) and has been implicated in the decline in taxonomic (Page & Shanker, 2020) and phylogenetic diversity in plants (Bose et al., 2019).

| Are there hotspots within the hotspot?
We find that weighted and phylogenetic endemism broadly converged in the WG, where regions of high endemism represented by relatively older lineages were detected in the southern WG.
Interestingly, plateaus with stunted forests in the northern WG also showed the presence of unique endemic lineages.

| Southern Western Ghats-hotspot within a hotspot
The examination of relative phylogenetic diversity and endemism patterns allowed us to identify ancient, evolutionarily, and geographically unique lineages in the southern WG, which highlights this subregion as a hotspot of conservation importance within the larger WG biodiversity hotspot. While high taxonomic diversity in the southern WG has been documented in plants (Page & Shanker, 2020) and animals (Aravind & Gururaja, 2011;Aravind et al., 2005;Daniels, 1992), an understanding of both diversity and endemism patterns from a phylogenetic perspective remains limited (Divya, 2019). Our current results, using three genera within Otostigminae, align with the inferences drawn from the single genus Digitipes (Joshi & Karanth, 2013), where biogeographic analysis suggested high phylogenetic diversity and endemism in the southern WG.

| Less diverse but unique northern Western Ghats
The northern WG had the lowest phylogenetic diversity, but there was an increase in phylogenetic endemism from central WG to some cells in the northern WG. This is related to the presence of narrow range endemics such as Rhysida crassispina, R. sada, and Ethmostigmus sahyadrensis in this subregion. The centipede community found here also had significantly high relative phylogenetic endemism, which was smaller in magnitude than the southern WG, lateritic rock that has further undergone various levels of erosion (Watve, 2013). They show a diversity of unique seasonal microhabitats (Thorpe et al., 2018) and have a distinct vegetation consisting of several endemic herbaceous species that show adaptations to surviving in poor soil conditions (Joshi & Janarthanam, 2004;Lekhak & Yadav, 2012).
Unfortunately, the network of protected areas in the northern WG is not as extensive as in the southern WG, though they consist of areas identified to be of high conservation value (Das et al., 2006;Watve, 2013). The protected areas in the northern WG are small in size and consist of fragmented forests with high anthropogenic disturbance located in the vicinity of urban centers (Gadgil, 2011;Thorpe & Watve, 2015). Apart from centipedes, there have been records of other range-restricted species on these plateaus across different taxa (snails: Aravind et al., 2005; plants: Lekhak & Yadav, 2012;Shigwan et al., 2020;amphibians: Katwate et al., 2013). The present study highlights the need to systematically understand the evolutionarily unique species found in these plateaus across different taxonomic groups and identify key areas of conservation importance.

| What are the patterns of species turnover across the WG?
Taxonomic and phylogenetic turnover in centipedes revealed two major clusters, which were largely restricted to either the southern WG and Nilgiris, or to the central and northern WG, suggesting that there might be possible species replacement around the Palghat Gap. The Palghat Gap is a 30-km-wide valley interrupting the WG mountain chain, which has been identified as an important dispersal barrier for plants and animals based on distribution patterns (Subramanyam & Nayar, 1974) and genetic analyses (e.g. , Joshi & Karanth, 2013;Robin et al., 2015;Vidya et al., 2005). There is increasing evidence that biogeographic barriers in addition to the climatic barriers can shape community dynamics across tropical areas. In the tropical Andes, river valleys and elevation have been shown to drive distribution and phylogenetic breaks in endemic bird taxa. These barriers are found to encompass areas with high richness of narrowly distributed species (Hazzi et al., 2018).
Major rivers also demarcate bioregions which explain distribution patterns of anurans in Amazonia, followed in importance by the climatic and topographic variation seen in this region (Godinho & da Silva, 2018). In the Australian monsoon tropics, biogeographic barriers have shaped the distribution patterns in plants and several animal taxa (Edwards et al., 2017). Our results recommend the simultaneous assessment of geoclimatic factors while examining patterns of diversity and endemism in the WG, given its complex geological past and the contemporary gradient in temperature and precipitation seasonality.

| Centipede distributions within and outside the Western Ghats
The predictions of the species distribution models correspond to the known habitat affiliations of the centipede genera studied here, where Digitipes and Ethmostigmus are found in wet forests, while

| Limitations of a presence-background modeling approach
Maxent is a presence-background approach for species distribution modeling that has been shown to perform consistently well across a range of sample sizes as compared to other model algorithms (Wisz et al., 2008). Studies have also shown that Maxent models can provide useful predictions with even 5-10 presence locations and that prediction accuracy improves for species with small range sizes related to strong environmental gradients (Hernandez et al., 2006).
This modeling approach has also been shown to be less sensitive to prediction inaccuracies arising from predictor complexity (De Marco & Nóbrega, 2018) and tolerates correlations in predictor variables (Elith et al., 2011). However, Maxent lacks information on prevalence (proportion of presence locations in the model extent), which is necessary to calculate the conditional probability of presence given the environment at a location (Merow et al., 2013). In the absence of this information, Maxent provides predictions of habitat suitability that cannot be compared across models differing in their background. In addition to this, model evaluation is difficult when absence data are not available, as true absences and false presences in model predictions cannot be accurately estimated (Leroy et al., 2018). Despite these caveats, we believe that our use of model transferability measures to arrive at the optimal model and obtain habitat suitability predictions are helpful in surveying potential areas of distribution, which can be used to build a robust presence-absence dataset for future work.
To summarize, we demonstrated the use of primary distribution data along with species distribution modeling and a detailed species-level phylogeny to understand diversity gradients and identify hotspots of endemism in an ancient soil arthropod group within tropical wet forests. We discuss our results in the light of past climatic stability, contemporary patterns in seasonality as well as geography-factors shown to influence diversity and distribution patterns in the WG and globally. Our results from soil arthropods highlight the need for macroecological analyses on a diverse range of taxa to understand diversity and endemism patterns and evaluate their generality in these diverse tropical forests. These would also allow us to compare the relative importance of geological processes and climatic variables in shaping these patterns across a spectrum of life-history traits and evolutionary histories. This approach involving both ecological and evolutionary factors also promises to be useful in identifying areas of endemism across taxa within the biodiversity hotspot, an important exercise for identifying areas of conservation importance.

ACK N OWLED G M ENTS
D. K. Bharti was supported during this study by a start-up grant to Jahnavi Joshi from CSIR-Centre for Cellular and Molecular Biology, Uppal Road, Hyderabad, India. We would like to thank Dr Rohit Naniwadekar, Dr Navendu Page, and Abhishek Gopal for insightful discussions and comments on the paper. We also thank two anonymous reviewers for their detailed comments on the manuscript.

CO N FLI C T O F I NTE R E S T
The authors declare no conflict of interest.

O PE N R E S E A RCH BA D G E S
This article has been awarded Open Data Badge. All materials and data are publicly accessible via the Open Science Framework at https://github.com/bhart idk/centi pede_diver sity_endemism.

DATA AVA I L A B I L I T Y S TAT E M E N T
The location data used in this study are provided in Appendix S1.