Literature DB >> 36167963

Node, place, ridership, and time model for rail-transit stations: a case study.

Ahad Amini Pishro1, Qihong Yang2, Shiquan Zhang3, Mojdeh Amini Pishro4, Zhengrui Zhang1, Yana Zhao1, Victor Postel5, Dengshi Huang4, WeiYu Li4.   

Abstract

Nowadays, Transit-Oriented Development (TOD) plays a vital role for public transport planners in developing potential city facilities. Knowing the necessity of this concept indicates that TOD effective parameters such as network accessibility (node value) and station-area land use (place value) should be considered in city development projects. To manage the coordination between these two factors, we need to consider ridership and peak and off-peak hours as essential enablers in our investigations. To aim this, we conducted our research on Chengdu rail-transit stations as a case study to propose our Node-Place-Ridership-Time (NPRT) model. We applied the Multiple Linear Regression (MLR) to examine the impacts of node value and place value on ridership. Finally, K-Means and Cube Methods were used to classify the stations based on the NPRT model results. This research indicates that our NPRT model could provide accurate results compared with the previous models to evaluate rail-transit stations.
© 2022. The Author(s).

Entities:  

Mesh:

Year:  2022        PMID: 36167963      PMCID: PMC9515214          DOI: 10.1038/s41598-022-20209-4

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.996


Introduction

Public transit operations have now become a logical substitution for private transportation to eliminate the drawbacks such as air pollution and traffic congestion. Transport planners benefit from high-speed trains, subways, and BRTs to implement their cities' Transit-Oriented Development (TOD) concepts. Policymakers, governments, and municipal mayors look forward to providing better access to public transport systems in high-density cities. Thus, comprehensive models for transport planners sound essential. In the past, researchers investigated the potential approaches to match the rail-transit supply and demand. Network accessibility and land use have been considered stem factors to provide the Node-Place (NP) model[1,2]. Researchers define node value based on transport access, network design and structure, and other related network variables. In contrast, they determine place value by assessing the number, diversity, and interaction of urban economic, social, or cultural activities. The NP model is a regional scaled model concentrating on the rail-transit networks and stations to classify TOD typologies. One of the fundamental ideas of this model is providing the accessibility and conditions for the location to develop the transportation provision. In turn, increasing the demand for transport leads to enhancing the location growth and transport system. The relationship between node value and place value was mentioned in past research[3]. However, it seems there are two more dimensions as necessary as node and place values, which have not been considered yet. Ridership refers to the possibility of using transit centers and infrastructures by public transport takers, which relates to land use and time. To obtain a comprehensive model, we need to collect the data at peak and off-peak hours since the mentioned parameters and ridership are functionally linked with time. Analyzing the coordination between node-place-ridership-time (NPRT) values can better understand this model for transport planners. As an Asian city and a developing area in China, Chengdu benefits from multiple subway lines, high-speed trains, BRTs, and mono-rails. This brings us an idea to select Chengdu city as our case study to propose a comprehensive model for city planners and municipal government. It sounds beneficial for policymakers to apply an extensive model and know the interaction between NPRT values to provide strategic transit plans. Thus, we aimed to add ridership and time values as third and fourth dimensions to the previous node-place model and proposed a new model to evaluate transit stations. We structured the remainder of this research to derive our proposed NPRT model and check the accuracy of existing models. In the next section, the previous node-place model and also current research works are reviewed. Section “Methodology and data” covers the methodological approaches and data acquisition. Section “Results and discussion” presents the results, model evaluation, and discussion. Finally, we provide the central conclusion of this research in “Conclusion”.

Literature review

Peter Calthorpe introduced Transit-Oriented Development (TOD) concept in his book "The Next American Metropolis"[4]. TOD refers to mixed-use and walkable neighborhoods that provide easy access to public transit for people[5]. TOD neighborhoods include transit stations, public centers, high-density residential and commercial buildings, and walkable streets. Classifying station areas based on their similar functional characteristics and set of morphological is the meaning of TOD classification. Distinguishing the types of TOD is a significant concern described in Calthorpe's book[4]. He defined neighborhood TOD and urban TOD according to the spatial orientation of the area functions. Knowing the importance of accessible stations in TOD neighborhoods, many researchers researched how stations can be efficient and reachable. A Node-Place (NP) model was proposed to categorize and evaluate public transit stations in 1999 using node value and place value[1]. As we mentioned before, balancing land use with transportation is the principal aim of the NP model. This model was conveyed in a two-dimensional diagram, as shown in Fig. 1. In this diagram, the station-area land use corresponds to -axis (Place). Place content of an area indicates how human interaction is affected by the diversity of urban activities. Besides, -axis (Node) belongs to the accessibility of the node, which refers to the relationship between people and their interaction. Based on this diagram, five possible situations can be found.
Figure 1

The Node-place model and five ideal–typical situations for a location[1].

The Node-place model and five ideal–typical situations for a location[1]. The middle diagonal line area indicates the "Balance" area (1), which means if the node value and place value are similar and equally strong, that station is considered as an accessible or balanced station. In the Balance zone, infra-systems and land use match each other without any stress to maintain the environment and the system. The "Stress" area (2) shows that the diversity of transportation and activities is over-configurated, and the vital node has maximal physical human interaction, making a substantial place value. A station located in the Stress zone has numerous and realized potential facilities to provide a more efficient land use. The "Dependence" area (3) represents stations where the node and place values are matched but under-configuration. In this zone, the demand for public transportation is deficient. There are enough free spaces, but due to the low demand for public transit, there's no reasonable need for infra-system developments. A station is an "Unbalanced node" (4) if transportation facilities are more available than urban activities. In this area, the land use facilities are relatively lower than the public transit flow supply, leading to jammed traffic, massive transit lines, and environmental degradation. A station is considered an "Unbalanced place" (5) if the opposite situation is actual. Land use activities are more available compared to public transportation systems' supply. A review of the previous research shows that a two-dimensional (node-place) model cannot cover all of the analysis aspects of a station. According to the node-place model, increasing or decreasing the node and/or place value(s) would bring an unbalanced station in the balance area[6,7]. There should be more values to have a comprehensive model since a station with balanced node and place values cannot be efficient or advantageous. In contrast, it does not have a good ridership value. Moreover, the coordination between node, place, and ridership value might not remain steadily constructive in peak and off-peak hours. Therefore, the relationship between node, place, and ridership values should be defined following the time to consider a comprehensive function, including practical values. Engineers and transport planners should design structures and networks concerning the critical situation. Thus, time is as essential as node, place, and ridership values. This research considered all four mentioned values (node, place, ridership, and time) to create our NPRT model to evaluate the efficiency of rail-transit stations. The node value of a station in the node-place model proposed by Bertolini[1] is defined as the station's network accessibility, including daily service frequency, the number of stations located in the area within 45 min of traveling, and the number of accessible directions at the station. Other researchers added some indexes to measure the station's node value. Proximity to CBD area by Chorus and Bertolini[8] and congestion index by Olaru et al.[9] were added to the node value. At a station, network accessibility includes two significant factors: the accessible opportunities by a station and the transport possibilities to access the opportunities[10-13]. Zhejing Cao et al. recently added accessible opportunities and network centrality into the node value[14]. Bertolini measured the place value in his proposed node-place model by the station-area land use and the number of residents and employees in economic areas[1]. After Bertolini, other researchers added more indicators to the place value, such as population density, land prices, unemployment rate, number of flats, and core urban area[15,16]. Although density and diversity of activities are primary factors in place value measurement, it seems necessary to consider other essential indicators such as parking areas, fed buses at stations, and walking areas. The built environment features were also included in the place value by Zhejing Cao et al.[14]. Moreover, they studied and considered ridership as the third dimension of node-place value and created the node-place-ridership model. A comprehensive study on the subway stations and CBDs in Chengdu showed that applying the previous node-place and node-place-ridership models couldn't provide a fair and balanced class for stations. In most case study locations, during weekdays, many people need to change the line, go to work, or come back from their working areas. For example, the subway stations called "South Railway Station" and "Chunxi Road" face a lack of trains and enough space for the riders in the mornings from 6:00 to 9:00 and evenings from 17:00 to 20:00. It's due to the high-frequency trips, in the morning and evening, to and from the working destinations which can be reached via this subway station. Moreover, Chunxi Road is one of the CBDs in Chengdu. There are many shopping malls, offices, consulates, visa centers, and training schools at Chunxi Road station. Let's consider the previous models of node-place and node-place-ridership, in which Time was not considered a leading dimension. It's not possible to justify the reason for these unbalanced stations. These subway stations were designed and categorized without considering time as a significant factor. It can be seen that a station classification might change from balanced to unbalanced several times during the day. A fair comparison and investigation of the existing models proved that it's vital to provide a new and more accurate model for city planners, traffic policymakers, and governments to apply a constructive model to classify the stations based on the needs and demands of the society. This research work's main contribution and novelty present the Node-Place-Ridership-Time (NPRT) method and the Cube model with 27 classes to provide accurate classifications for rail-transit stations during different time-spans. The NPRT model provides a new contribution to the TOD concept, leading to a more progressive and beneficial policy for cities. To obtain the NPRT model, we added a fourth dimension of Time into the node-place-ridership model to evaluate and classify the transit stations. The coordination between ridership and time influences stations' classification to know which stations are balanced and unbalanced. Without a comprehensive model, we would not establish the relative position of a transit station in the urban regional network. This would assist the city planners and governments in updating their applied policies.

Methodology and data

Approach

We consider all of Chengdu city as our case study. First, the Chengdu transit system and the study area are presented in this research. Next, we provide a list of node, place, and ridership indicators. To make our research more accurate, we divide the time into four classes to determine the effect of time on ridership at peak, peak-off, weekend, and other regular hours. Different data resources were used to collect the information for our target variables. We apply the Min–Max Normalization method to normalize our data. Afterward, we apply Information Entropy Weighting (IEW) to combine all indicators and create composite nodes and places. Then, to investigate the relationship between four facets of node, place, ridership, and time, we apply the Multiple Linear Regression (MLR) method. This method investigates the relations between different parameters and factors in scientific research works[17,18]. Afterward, we propose a comprehensive Node-Place-Ridership-Time (NPRT) model. We apply the NP and NPR models proposed by other previous researchers and our proposed NPRT model to evaluate our research work. In this comparison, we use the same database and check all models' accuracy.

K-Means method

K-Means clustering is one of the most popular unsupervised machine learning algorithms. It is an extensively used technique for data cluster analysis. The goal of this algorithm is to find groups in the data, with the number of groups represented by the variable . The algorithm works iteratively to assign each data point to one of groups based on the provided features. Data points are clustered based on feature similarity. The steps of K-Means are as follows: Step 1: Give the parameter which means the number of groups we want the points to be assigned to. Step 2: Randomly select points as the initial cluster centers . Step 3: Calculate the distance between each point and each cluster center, then assign it to its nearest center, based on the squared Euclidean distance. where is the point that needs to be assigned to one group. Step 4: After assigning all the points to the groups, recompute the coordinates of the cluster center, which means replacing the cluster center with the new cluster center. where means the group, means the number of points in and means the point in . Step 5: Repeat Step3 and Step4 until a stopping criterion is met (i.e., no data points change clusters, the sum of the distances is minimized, or some maximum number of iterations is reached). value indicates the number of clusters and is a pre-defined value. In this research, we used the Elbow method to select for the K-Means algorithm[19]. Based on Fig. 2, we can find the value of where the Sum of Squared Errors decreases sharply .
Figure 2

Elbow method and parameter for the K-Means algorithm.

Elbow method and parameter for the K-Means algorithm.

Cube method

To classify our stations, we also applied the Cube method, which is made of 3 dimensions: node value, place value, and ridership regarding the time. According to the Cube method, there are three main layers on each node, place, and ridership measurement value, which are Low Balanced (LB), Balanced (B), and High Balanced (HB), as shown in Fig. 3. The combination of layers on the node, place, and ridership values, leads to 27 classes. Class 1 denotes LB stations in all three values, while class 27 represents HB stations. Cluster 14 means the station is balanced in all three dimensions during the defined time-span. This 3-Dimension illustration provides more understandable coordination between mentioned values and layers compared to the previous models.
Figure 3

Low Balanced (LB), Balanced (B), and High Balanced (HB) classes of the Cube Method.

Low Balanced (LB), Balanced (B), and High Balanced (HB) classes of the Cube Method. Moreover, the Cube method also shows the density of stations in or around critical classes. Therefore, policymakers and city planners can easily understand if their plans need to be revised to improve the efficiency of LB and HB stations. An accurate classification result from 1 to 27 would prove the efficiency of this method, as shown in Appendixes B and C. To understand the relationship between node, place, ridership, and time values, we applied the K-Means method[19] using the "sklearn.cluster.KMeans" measure of Python and also the Cube method on NP, NPR, and our proposed NPRT models to classify Chengdu rail-transit stations, shown in Appendixes B to F. As can be extracted from Fig. 4, our research has four main steps. We apply unique methods to prepare, compose, and analyze our data to approach our NPRT model in each step. Table 1 summarizes the application of all the methods used in this research work.
Figure 4

Research structure and applied methods.

Table 1

Description of applied methods.

MethodApplication
Min–Max NormalizationFor every feature, the minimum value of that feature gets transformed into a 0, the maximum value gets transformed into a 1, and every other value gets transformed into a decimal between 0 and 1
IEWThe Information Entropy Weighting (IEW) is used to combine all indicators and generate a composite node value index and place value index
MLRMultiple Linear Regression (MLR) is used to model the linear relationship between the ridership and node and place variables
MSEMean Squared Error (MSE) is the average squared difference between the estimated and actual values. The MSE is a measure of the quality of an MLR equation
R2

In statistics, the coefficient of determination is denoted R2 or r2. It is pronounced "R squared" is the proportion of the variance in the dependent variable that is predictable from the independent variables

R2 gives some information about the goodness of fit of an MLR equation

Adjusted R2Adjusted R2 is a particular form of R2, the coefficient of determination. R2 shows how good terms (data points) fit a curve or line. Adjusted R2 indicates how well terms fit a curve or line but adjusts for the number of terms in a model
VIFVariance inflation factor (VIF) measures multicollinearity in multiple regression variables
F-TestAn F-test is any statistical test in which the test statistic has an F-distribution under the null hypothesis. It is most often used when comparing statistical models fitted to a data set to identify the model that best fits the population from which the data were sampled
T-TestThe T-Test is used to judge the significance of each independent variable. If it is significant, the variable significantly impacts the model
Elbow MethodThe K-value (number of clusters) is a pre-defined parameter. We Search for the optimal K-value using the Elbow method where the distortion (i.e., within-cluster-sum of squared errors) begins to decrease most rapidly
K-MeansWe apply the K-Means method to cluster all stations by their node value, place value, and ridership
CubeWe apply the Cube method to cluster all stations by their node value, place value, and ridership
Research structure and applied methods. Description of applied methods. In statistics, the coefficient of determination is denoted R2 or r2. It is pronounced "R squared" is the proportion of the variance in the dependent variable that is predictable from the independent variables R2 gives some information about the goodness of fit of an MLR equation

The case study area and Chengdu rail transit network

The Chengdu Metro system is considered the rapid rail-transit network of the capital city of Sichuan province, China, with a daily passenger flow of 5,906,123 rides. The system includes twelve subway lines and one light rail line, operated by Chengdu Rail Transit Group Company. Table 2 presents brief information about Chengdu subway lines. Figure 5 presents the Chengdu rail-transit stations.
Table 2

Chengdu metro lines.

Metro LineOperation dateNewest ExtensionLength \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(\mathrm{km})$$\end{document}(km)Stations
12010201840.9935
22012201442.3232
32016201849.8937
42015201743.2830
5201949.0241
6202068.8856
7201738.6131
8202029.125
9202022.1813
102017201937.97216
17202026.159
18202069.3912
Tram R22018201939.335
Figure 5

Chengdu rail-transit network and stations.

Chengdu metro lines. Chengdu rail-transit network and stations.

Node, place, ridership, and time indicators

Node indicators

We measure a station's node value by four facets: station facility, accessible transits, accessible destinations, and network centrality. Eight node indicators under these four facets are presented in Table 3.
Table 3

Node and place indicators.

DimensionBranchIndicatorMaxMeanMin
Node valueStation facilityN1. Number of entrances and exits in each metro station (unit)10.00004.65352.0000
Accessible transitsN2. Number of metro stations that one station can reach within 20 min (unit)88.000041.92578.0000
N3. Number of stations to CBD (Chunxi Road) (unit)23.000010.07920.0000
N4. Number of stations to CBD (3rd Tianfu Street) (unit)33.000014.04460.0000
Accessible destinationsN5. Distance to CBD (Chunxi Road) (km)45.323013.60330.0000
N6. Distance to CBD (3rd Tianfu Street) (km)43.777018.35960.0000
Network centralityN7. Degree centrality6.00002.46532.0000
N8. Closeness centrality (1/1000 km)0.00040.00030.0001
Place valueDesignP1. The average price of office land inside the 1000 m—radius catchment area (CNY/m2)74,000.000011,118.26585550.0000
DensityP2. Number of offices within 1000 m (unit)197.000026.76730.0000
DesignP3. The average price of commercial land inside the 1000 m − radius catchment area (CNY/m2)50,480.000021,285.58558571.0000
DensityP4. Number of shops within 1000 m (unit)397.0000117.45541.0000
DesignP5. The average price of residential land inside the 1000 m − radius catchment area (CNY/m2)42,663.307718,405.80818423.0000
DensityP6. Number of residences within 1000 m (unit)552.0000110.79701.0000
DiversityP7. Number of public facilities (parks,cultural facilities,schools,hospitals) inside the 1000 m − radius catchment area(unit)41.000010.92080.0000
DesignP8. Number of parking lots inside the 500 m − radius catchment area(unit)132.000021.48510.0000
P9. Number of bus stops inside the 500 m − radius catchment area(unit)26.00007.35151.0000
Node and place indicators. The station facility is measured by the number of entrances and exits (N1) in each metro station. The accessible transits are measured by the number of metro stations (N2) that one station can reach within 20 min, the number of the station to CBD (Chunxi Road) (N3), and the number of stations to CBD (3rd Tianfu Street) (N4). It is well known that there are 2 CBDs in Chengdu: Chunxi Road and 3rd Tianfu Street. Therefore, we calculate the number of stations and the distance to Chunxi Road and 3rd Tianfu Street. The distance measures the accessible destinations to the CBDs Chunxi Road and 3rd Tianfu Street indicated by (N5) and (N6), respectively. The network centrality consists of degree centrality (N7) and closeness centrality (N8). Based on the graph modeling, we applied the network centrality to capture the impedance of a station in the transit network[20]. To translate the Chengdu rail-transit network into a graph we assign of vertices to indicate our stations, and the set of edges is for the station linkages. The transit traveling distance is used to weigh the [21]. We measure Chengdu network degree centrality (N7) of a transit station by the number of links connected to station in Eq. (3), wherein represents the linkage between station and station , and shows the number of all stations in set [22]: Closeness centrality reflects the node's proximity and reachability within the network component. We measure the closeness centrality (N8) of station by the inverse of the sum of shortest transit distances from station to all other stations in set in Eq. (4), wherein denotes the shortest transit distance between station and station : Table 4 presents the node indicators values of some stations.
Table 4

The values of node indicators in each subway station normalized by Min–Max Normalization method.

Subway stationN1N2N3N4N5N6N7N8
Weijianian0.37500.43750.30430.54550.19740.44790.00000.6289
Shengxian Lake0.25000.56250.26090.51520.16400.41330.00000.7216
North Railway Station0.50000.88750.21740.48480.12890.37690.50000.8593
Renmin Rd.North0.62500.82500.17390.45450.10280.34990.50000.8778
Wenshu Monastery0.50000.78750.13040.42420.07300.31910.00000.9332
Luomashi0.37501.00000.08700.39390.05350.29890.50000.9769
Tianfu Square1.00000.97500.04350.36360.03110.27560.50001.0000
Jinjiang Hotel0.25000.82500.08700.33330.04940.25660.00000.9720
The values of node indicators in each subway station normalized by Min–Max Normalization method.

Place indicators

We use a 500-m and 1000-m radius to define the transit catchment area in Chengdu, considering the low-density context of some areas. We measure the station's place value by three facets: design, density, and diversity. Nine place indicators under three facets are presented in Table 3. Table 5 provides the place indicators values of some stations.
Table 5

The values of place indicators in each subway station normalized by Min–Max Normalization method.

Subway stationP1P2P3P4P5P6P7P8P9
Weijianian0.05640.02540.36670.13130.21130.03810.14630.00760.2800
Shengxian Lake0.04760.01520.30730.24490.20000.11620.21950.03790.0000
North Railway Station0.07370.15230.33180.57830.20180.29760.21950.20450.2800
Renmin Rd.North0.07470.32490.34940.61620.19870.44460.58540.23480.2400
Wenshu Monastery0.07980.55330.50130.55810.24490.65880.36591.00000.2400
Luomashi0.07980.66500.33370.69190.41870.88570.58540.42420.2400
Tianfu Square0.10320.84260.36150.69700.55050.66420.46340.57580.3200
Jinjiang Hotel0.06820.58380.38280.57070.63870.52451.00000.55300.2800
The values of place indicators in each subway station normalized by Min–Max Normalization method. The design is measured by the average price of office land inside the 1000 m-radius catchment area (P1), the average price of commercial land inside the 1000 m-radius catchment area (P3), the average price of residential land inside the 1000 m-radius catchment area (P5), the number of parking lots inside the 500 m-radius catchment area (P8) and the number of buses stops inside the 500 m-radius catchment area (P9). The design is measured by the number of offices within 1000 m (P2), the number of shops within 1000 m (P4), and the number of residences within 1000 m (P6). The diversity consists of public facilities (parks, cultural facilities, schools, hospitals) inside the 1000 m-radius catchment area (P7).

Ridership and time indicators

Because the NPR model's limitation does not consider the implication of time and ignores the difference in ridership about departure and coming, we record the tapped-in and tapped-out arrival trips and construct an NPRT model by considering different conditions. As we mentioned before, ridership has a direct relationship with time. Therefore, we categorized the passenger traffic into two groups: inbound traffic (I) and the second group for outbound traffic (O). We also divided the time into peak hours, off-peak hours, regular hours, and weekends (T1 to T4). Therefore, we get eight different conditions. IT1 means inbound traffic during working hours, IT2 means inbound traffic during off-hours, IT3 means inbound traffic during the rest of the working day, and IT4 means inbound traffic on two weekend days. OT1 means the ridership of passengers leaving the station during working hours, OT2 means the ridership of passengers leaving the station during off-hours, OT3 means the ridership of passengers leaving the station during the rest of the working day, and OT4 means the ridership of passengers leaving the station on two days of the weekend. The definition of each class and time-spans from IT1 to OT4 is written in Table 6.
Table 6

Time class definition for the NPRT model.

TimeDefinitionDaysHoursMaxMeanMin
IT1Inbound traffic during working hoursMonday to Friday6:00–9:0027,654.34784451.605153.2609
IT2Inbound traffic during off-hoursMonday to Friday17:00–20:0046,668.60874281.0174113.6957
IT3Inbound traffic during the rest of the dayMonday to Friday9:00–17:00/20:00–23:0054,702.08705156.2546140.3043
IT4Inbound traffic on two days of the weekendSaturdays & Sunday6:00–23:0051,955.62504607.0829122.3750
OT1Passengers leaving the station during working hoursMonday to Friday6:00–9:0056,982.34785456.5258151.3913
OT2Passengers leaving the station during off-hoursMonday to Friday17:00–20:0026,532.47834367.853069.4348
OT3Passengers leaving the station during the rest of the dayMonday to Friday9:00–17:00/20:00–23:0033,976.52174064.498372.0000
OT4Passengers leaving the station on both days of the weekendSaturdays & Sunday6:00–23:0055,496.87504607.0829126.6250
Time class definition for the NPRT model. Table 7 shows the ridership values of some stations during eight time-spans mentioned above.
Table 7

Ridership during different time, normalized by Min–Max Normalization method.

Subway stationIT1IT2IT3IT4OT1OT2OT3OT4
Weijianian0.25060.02310.04090.05090.04990.15580.02330.0407
Shengxian Lake0.11320.02210.03710.03230.04170.08130.03430.0292
North Railway Station0.25070.12920.18280.16820.20400.23350.17120.1671
Renmin Rd.North0.19000.17320.16610.14830.15500.22210.22500.1387
Wenshu Monastery0.17480.13850.15010.11660.14150.17990.21420.1127
Luomashi0.15840.31320.27070.15330.21910.20260.56600.1559
Tianfu Square0.10060.43880.35280.26900.32150.22060.62210.2636
Jinjiang Hotel0.05740.13390.08790.05390.07420.07890.20190.0534
Ridership during different time, normalized by Min–Max Normalization method. As for data sources and processing, the number of entrances and exits, offices, shops, residences, parking lots, and bus stops was acquired from Amap (https://www.amap.com/) and SOSO (https://map.qq.com/). The number of stations that one station can reach within 20 min and stations to CBDs (https://www.chengdurail.com/index_en.html), the distance to CBDs, and closeness centrality could be required and calculated via the API of Chengdu Metro Website (https://www.chengdurail.com/index_en.html). The degree of centrality was acquired from a map of Chengdu Metro Station in 2021 in Fig. 5. The average price of office, commercial, and residential land was acquired from Anjuke (https://chengdu.anjuke.com/) and Fang (https://cd.newhouse.fang.com/). We collected ridership of all stations from Chengdu Metro. Each station counts both tapped-in departure trips and tapped-out arrival trips for the station's ridership statistics. All the data was acquired in March 2021.

Information entropy weighting (IEW)

To practice our data analysis and compose the indicators, we applied Information Entropy Weighting (IEW)[23] to provide a composite node or place value index. We use the IEW method to integrate into one Node value and into one Place value. First, the decision matrix should be constructed, shown in Eq. (5). stations and node value indicators have consisted in . Moreover, indicates the value of indicator at station . We apply Eq. (6) to normalize the decision matrix: Then, computes the proportion of station for indicator : We can calculate the entropy value of indicator in Eq. (8), knowing that if , then In the next step, we need to calculate the imbalance coefficient using Eq. (9): is the weight of indicator , which can be extracted from Eq. (10). Then, to compose the node value index for station , we can apply Eq. (11): Afterward, we need to normalize the node value index between 0 and 1. In Eq. (12), indicates the array of node value index, is the number of stations, and is the target station:

Results and discussion

Equations

We obtain the equations through Multiple Linear Regression (MLR). Table. 8 provides a list of constants and variables coefficients of our equations. The results of our MLR models are presented in Table 9.
Table 8

Constants and variable coefficients of MLR models.

CoefficientMLR models
IT1IT2IT3IT4OT1OT2OT3OT4
α0.50980.19510.22250.24390.23180.49790.18930.2196
β1 − 0.0409 − 0.0596 − 0.0915 − 0.0747 − 0.0809 − 0.0804 − 0.0572 − 0.0722
β2 − 0.2230.0848 − 0.005 − 0.0224 − 0.0312 − 0.0810.0797 − 0.0076
β30.1690.09970.01110.01090.00870.15860.14330.0077
β4 − 0.3049 − 0.1247 − 0.1031 − 0.1239 − 0.1118 − 0.304 − 0.1265 − 0.1063
β5 − 0.6797 − 0.3089 − 0.2937 − 0.3087 − 0.3025 − 0.7084 − 0.315 − 0.281
β60.02040.05810.0870.08930.08740.09220.02170.0831
β70.12110.11320.17150.14860.15710.15730.13340.141
β8 − 0.2113 − 0.2221 − 0.1939 − 0.2033 − 0.1928 − 0.3575 − 0.184 − 0.2047
γ10.0825 − 0.0686 − 0.0762 − 0.0621 − 0.05910.0345 − 0.086 − 0.0592
γ2 − 0.28140.48590.21120.12210.1617 − 0.08310.69990.1378
γ30.05680.01610.02420.02540.03560.0730.01870.032
γ40.41880.10680.17190.23670.22710.46130.0170.2269
γ50.11750.03820.03540.02440.0272 − 0.04490.02010.0275
γ6 − 0.1733 − 0.1741 − 0.1119 − 0.1041 − 0.1129 − 0.1663 − 0.2469 − 0.1006
γ70.0711 − 0.0558 − 0.0085 − 0.028 − 0.02040.0016 − 0.0084 − 0.0298
γ80.15970.04470.06860.04850.06420.11130.09080.0427
γ90.12730.02340.03050.01960.03110.10490.05230.0187
Table 9

MLR models results.

IT1IT2IT3IT4OT1OT2OT3OT4
Adjusted R20.38750.59460.32210.26890.32840.34990.69810.2983
R20.43930.62890.37940.33070.38520.40490.72360.3576
MSE0.00900.00530.00810.00680.00670.0110.00650.0059
F-Test8.4800 ****18.339 ****6.6173 ****5.3487 ****6.7814 ****7.3642 ****28.34 ****6.0256 ****
T-Constant4.2463 ****2.122 **1.9528 *2.3366 **2.2334 **3.7494 ****1.8495 *2.2587 **
T – N1 − 0.9934 − 1.8903 * − 2.3418 ** − 2.0869 ** − 2.2731 ** − 1.7656 * − 1.6297 − 2.1656 **
T – N2 − 1.6851 *0.8367 − 0.0398 − 0.1947 − 0.2727 − 0.55340.7064 − 0.0709
T – N31.56811.20790.10850.11630.09341.33041.55960.0882
T − N4 − 3.0495 *** − 1.6286 − 1.0865 − 1.4253 − 1.2935 − 2.7489 *** − 1.4841 − 1.3129
T – N5 − 3.6314 **** − 2.155 ** − 1.6533 * − 1.8969 * − 1.8695 * − 3.4217 **** − 1.974 ** − 1.8538 *
T – N60.17720.6590.79630.89220.87820.72410.22110.8913
T – N72.7893 ***3.4046 ****4.1622 ****3.9367 ****4.1857 ****3.2756 ****3.6041 ****4.0104 ****
T – N8 − 1.0942 − 1.5018 − 1.058 − 1.2109 − 1.1549 − 1.6737 * − 1.1177 − 1.309
T – P10.9914 − 1.0764 − 0.9648 − 0.8583 − 0.82150.3748 − 1.2122 − 0.8785
T – P2 − 3.9816 ****8.9774 ****3.1487 ***1.9871 **2.6466 *** − 1.06311.616 ****2.4077 **
T – P30.73430.27180.32970.37770.53240.85330.28360.5109
T – P44.7953 ****1.59682.0739 **3.1172 ***3.0079 ***4.7753 ****0.22833.2082 ***
T – P5 − 2.0838 **0.88460.66150.49770.558 − 0.71990.41810.6022
T – P6 − 1.9438 * − 2.5499 ** − 1.3225 − 1.343 − 1.4648 − 1.6863 * − 3.2483 **** − 1.3933
T – P71.0671 − 1.0935 − 0.1344 − 0.4833 − 0.35420.0217 − 0.1479 − 0.5523
T – P81.7577 *0.64240.79560.6140.81741.10751.17230.5803
T – P92.4117 **0.57890.60880.42710.68151.7967 *1.16220.4375

Variance Inflation Factor (VIF) = 10.7532.

If p value < 0.001 ⇒ ****; p value < 0.01 ⇒ ***; p value < 0.05 ⇒ **; p value < 0.1 ⇒ *.

Constants and variable coefficients of MLR models. MLR models results. Variance Inflation Factor (VIF) = 10.7532. If p value < 0.001 ⇒ ****; p value < 0.01 ⇒ ***; p value < 0.05 ⇒ **; p value < 0.1 ⇒ *. The general format of our MLR equations is as follows: where is the equation constant, and and are the coefficient of node value and place value, respectively. To better understand how a station's node value and place value impact its ridership at different times, we must analyze our eight MLR models below. Concerning the parameters of eight MLR models, we can know that the number of entrances and exits, the number of stations to CBD (3rd Tianfu Street), the distance to CBD (Chunxi Road), closeness centrality, and the number of residences within 1000 m are negatively associated with ridership in all facets of time. The number of stations to CBD (Chunxi Road), the distance to CBD (3rd Tianfu Street), degree of centrality, the average price of commercial land, the number of shops within 1000 m, and the number of parking lots and bus stops inside the 500 m-radius catchment area are positively associated with ridership in all facets of time. The number of stations that one station can reach within 20 min is positively associated with ridership of stations in off-peak hours, is positively associated with ridership of getting outstations in other hours on working days, and is negatively associated with ridership in other times. The average price of office land and the number of public facilities are positively associated with ridership of getting in stations in peak hours, are positively associated with ridership of getting outstations in off-peak hours, and are negatively associated with ridership in other times. The number of offices within 1000 m and the average price of residential land are negatively associated with ridership of getting in stations in peak hours, are negatively associated with ridership of getting outstations in off-peak hours, and are positively associated with ridership in other times. The distance to CBD (Chunxi Road) is significantly negatively associated with ridership of getting in stations in peak hours and ridership of getting outstations in off-peak hours. The number of offices within 1000 m is significantly positively associated with ridership of getting in stations in off-peak hours and ridership of getting outstations on other working days. The number of shops is incredibly positively associated with the ridership of getting in peak hours and the ridership of getting outstations in off-peak hours. Using Table 8 in the MLR Eq. (13), we have eight equations from to For instance, the equation of Inbound traffic during working hours from 6:00 a.m. to 9:00 a.m. would be as follows: All variables have been 0–1 normalized by Min–Max Normalization for the model input, shown in Appendix A. The variance inflation factor (VIF) is approximately equal to 10, indicating no severe multicollinearity. The adjusted R2 and R2 are more extensive than 0.25, showing that the results are promising in model fitting. When using 0.05 as a significance level threshold, F-test shows that our MLR models are significant. The T-Test shows the number of stations to CBD (3rd Tianfu Street), the distance to CBD (Chunxi Road), degree of centrality, the number of offices within 1000 m, the number of shops within 1000 m, the average price of residential land and the number of bus stops are significant with equation IT1. The distance to CBD (Chunxi Road), degree of centrality, the number of offices, and the number of residences are significant with equation IT2. The number of entrances and exits, degree of centrality, the number of offices, and the number of shops are significant with equations IT3, IT4, and OT1. The number of stations to CBD (3rd Tianfu Street), the distance to CBD (Chunxi Road), degree centrality, and the number of shops are significant with equation OT2. The distance to CBD (3rd Tianfu Street), the number of offices, and the number of residences are significant with equation OT3. The number of entrances and exits, degree of centrality, the number of offices, and the number of shops are significant with equation OT4.

Methods and classification results

The coordination between ridership and time influences stations' classification to know which stations are balanced and unbalanced. Regarding the node value, place value, and ridership extracted in four time-spans, five classes resulted from the K-Means method. Figure 6 summarizes the classification results extracted from the K-Means method for our proposed NPRT model.
Figure 6

Number of stations in K-Means classification method for NPRT model.

Number of stations in K-Means classification method for NPRT model. In each model from IT1 to OT4, shown in Appendix B, F, and Fig. 6, based on the NPRT values, the results show that some stations can be balanced or unbalanced; low, medium, high, or extremely high ridership; stress or dependent. For example, in the model IT1, Xipu station with the result of [0.4523, 0.2703, 1.0] for the node value, place value, and ridership is categorized in class 4, with high ridership and balanced, while Chunxi Road station with the values of [0.5185, 0.8886, 0.1691], is in class 5, low ridership and unbalanced place. Compared to the IT4 model, on weekends, Xipu station is medium ridership and balanced class 4, with the NPR values of [0.4523, 0.2703, 0.4807]. Chunxi Road station for the same model indicates the results of [0.5185, 0.8886, 1.0], falling into class 5, extremely high ridership, and an unbalanced place. Therefore, it can be seen that although the node and place values are essential factors in our classifications model, the ridership at different time-spans can significantly change the results. As already mentioned, in both K-Means and Cube methods, the concept of ridership is influenced by time. The relationship between ridership and time can also be proved by analyzing the results of the Cube Method. The number of stations in each class of the Cube method is presented in Fig. 7. Based on Fig. 3, class 1 has a low node, place, and ridership values, while class 27 comprises high node, place, and ridership values.
Figure 7

Number of stations in Cube classification method for NPRT model.

Number of stations in Cube classification method for NPRT model. Regarding Appendix C, F, and Fig. 7, class 2 includes 52.8% to 58.04% of Chengdu rail-transit stations. In contrast, some other classes, such as class 1, have 0% of the stations. This difference is not only because of the low or high node and place value, but the time as a significant factor in ridership caused the differences as mentioned. Some stations have good status on node and place values. If we only consider the ridership value as a constant value during our investigation, the results would be different from the reality. Stations can be balanced at 1 h but unbalanced at another hour. As an instance, in the OT1 model, Chunxi Road, for NPR values of [0.5185, 0.8886, 1.0] is classified in cluster 27, high node, place, and ridership value, whereas this station in the IT1 model is in class 9, [0.5185, 0.8886, 0.1691], high node and place value, but low ridership. Chunxi Road is one of the CBDs of Chengdu. It is surrounded by many shopping malls, companies, institutions, consulates and visa centers, and headquarters. During the time-span OT1, 6:00–9:00 a.m. on weekdays (Table 6), the number of passengers going to work in the Chunxi Road area is considerably higher than the number of people traveling from this location to the other part of the city (IT1 model). Therefore, we can find the influence of time on the ridership and, consequently, on the classification of a station. To compare our proposed NPRT model with the NP model by Bertolini[1] and the NPR model by Zhejing Cao[14], we also applied our case study area and rail-transit stations to the NP and NPR models, presented in Appendixes D to F. Regarding many subway stations in Chengdu and the massive number of classification results, we provided the results of four stations in Table 10 as a sample. Table 10 shows the station classifications resulting from the K-Means method and Cube model, using NP, NPR, and NPRT. The results prove that Chunxi Road was classified as an unbalanced station with high NPR values over different time-spans, while the Financial City is a balanced station during OT3 (class 14 of the Cube model). According to our previous discussion about Fig. 2, there are five classes in the K-Means method (K = 5). In comparison, the Cube model provides 27 classes, leading to more accurate classifications. For instance, in Table 10, the K-Means method for the Chunxi Road station has the same result (class 5) during the time-spans IT1 and IT2, whereas the Cube method puts this station at class 9 during IT1 and class 27 over IT2. The results of the Xipu station experience the same situation for the time-spans IT1, IT2, OT3, and OT4.
Table 10

Stations classification results (NP, NPR, NPRT).

StationMethodNode value (N)Place value (P)Ridership value (R)Time span (T)Class type
Chunxi RoadNP0.51850.88864
NPR [K–Means]0.51850.88861.05
NPRT [K-Means]0.51850.88860.1691IT15
NPRT [Cube]9
NPRT [K-Means]0.51850.88861.0IT25
NPRT [Cube]27
NPRT [K-Means]0.51850.88861.0IT35
NPRT [Cube]27
NPRT [K-Means]0.51850.88861.0IT45
NPRT [Cube]27
NPRT [K-Means]0.51850.88861.0OT15
NPRT [Cube]27
NPRT [K-Means]0.51850.88861.0OT25
NPRT [Cube]27
NPRT [K-Means]0.51850.88860.853OT35
NPRT [Cube]27
NPRT [K-Means]0.51850.88861.0OT45
NPRT [Cube]27
Financial CityNP0.06050.4265 –4
NPR [K-Means]0.06050.42650.20962
NPRT [K-Means]0.06050.42650.054IT13
NPRT [Cube]5
NPRT [K-Means]0.06050.42650.3215IT23
NPRT [Cube]5
NPRT [K-Means]0.06050.42650.1412IT32
NPRT [Cube]5
NPRT [K-Means]0.06050.42650.0671IT42
NPRT [Cube]5
NPRT [K-Means]0.06050.42650.1212OT12
NPRT [Cube]5
NPRT [K-Means]0.06050.42650.0852OT23
NPRT [Cube]5
NPRT [K-Means]0.06050.42650.5194OT33
NPRT [Cube]14
NPRT [K-Means]0.06050.42650.0747OT42
NPRT [Cube]5
Southwest Jiaotong UniversityNP0.57180.42962
NPR [K-Means]0.57180.42960.08284
NPRT [K-Means]0.57180.42960.089IT14
NPRT [Cube]6
NPRT [K-Means]0.57180.42960.0678IT24
NPRT [Cube]6
NPRT [K-Means]0.57180.42960.0658IT34
NPRT [Cube]6
NPRT [K-Means]0.57180.42960.0533IT44
NPRT [Cube]6
NPRT [K-Means]0.57180.42960.0669OT14
NPRT [Cube]6
NPRT [K-Means]0.57180.42960.0982OT24
NPRT [Cube]6
NPRT [K-Means]0.57180.42960.1058OT34
NPRT [Cube]6
NPRT [K-Means]0.57180.42960.0555OT44
NPRT [Cube]6
XipuNP0.45230.27032
NPR [K-Means]0.45230.27030.54614
NPRT [K-Means]0.45230.27031.0IT14
NPRT [Cube]21
NPRT [K-Means]0.45230.27030.2357IT24
NPRT [Cube]3
NPRT [K-Means]0.45230.27030.4422IT34
NPRT [Cube]12
NPRT [K-Means]0.45230.27030.4807IT44
NPRT [Cube]12
NPRT [K-Means]0.45230.27030.4997OT14
NPRT [Cube]12
NPRT [K-Means]0.45230.27030.8806OT25
NPRT [Cube]21
NPRT [K-Means]0.45230.27030.2356OT34
NPRT [Cube]3
NPRT [K-Means]0.45230.27030.4351OT44
NPRT [Cube]12
Stations classification results (NP, NPR, NPRT). Moreover, the results show that the classification result to check the station efficiency would not be accurate without considering the relationship between ridership value and time. For example, comparing Chunxi Road station in three different models, we can see the node value, place value, and the ridership in NP, NPR, and NPRT IT1 models are [0.5185, 0.8886, –-], [0.5185, 0.8886, 1.0], and [0.5185, 0.8886, 0.1691], respectively. This station's ridership for the NPR model is extremely high, although it is low for the NPRT IT1 model. This situation is true for some other stations, such as North Railway Station, Wenshu Monastery, Tianfu Square, Sichuan Gymnasium, Hi-Tech Zone, Financial City, and Century City. Therefore, as one of the most important factors in policymaking, the ridership should be considered regarding the critical time-spans, from IT1 to OT4. The periods IT1 to OT4, NPRT method, and Cube model can assist the policymakers and city planners update their applied policies. We can consider the Chunxi Road station as an example. The NPR values at the time-span OT1, [0.5185, 0.8886, 1.0], and its class 27 would let the municipal government know this location needs some charter trains at the time-span OT1. The charter trains would travel directly between the high frequently-demanded stations to the Chunxi Road. The Chengdu Metro Co. can calculate the frequency and number of required charter trains by knowing the number of riders during the critical time-span. Moreover, since the Chunxi Road station is located at the junction of lines 2 and 3, the Chengdu Metro Co. would be able to find the Low Balanced (LB) stations on lines 2 and 3 at the time-spans T1. Therefore, every second train can stop at the LB stations during the time-span T1. Southwest Jiaotong University is almost a steady station (class 6). Since this station is located in the Jinniu district, the Jinniu municipal government would be able to apply the results of this study in their potential plans to enhance the classification of this station toward cluster 14, which creates a fully balanced station. These are some examples of potential revised policies based on this study's innovation in developing the station classifications. Regarding the 27 classes from the Cube model and the NPRT method, governments can access the accurate classification results of the stations during critical time-spans T1 to T4 to implement appropriate policies and enhance the rail-transit network efficiency.

Conclusion

In this research, we conducted a case study on Chengdu rail-transit stations to present the relationship between node, place, and ridership. Since the number of riders during the daytime and over the week is not constant, we divided our investigation into four time-spans. It was proved that ridership has a direct relationship with time. So, we included this factor in our study. After collecting the data and providing all the influential parameters, Multiple Linear Regression (MLR) was applied to create our Node-Place-Ridership-Time (NPRT) equations. MLR is a constructive method to model the coordination and relationship between the effective parameters on the NPRT model. We developed our classifications using k-Means and Cube methods and analyzed the results. Stations with exemplary node and place values can not be necessarily balanced or efficient since the ridership and time-span play essential roles on the other side. The policymakers, city planners, and governments need to apply NPRT models to analyze the efficiency of transit stations. Compared with Node-Place (NP) and Node-Place-Ridership (NPR) models presented by previous researchers, our proposed NPRT model provides more accurate results.

Possible directions for future studies

This research investigated the impact of node, place, and time values on ridership to present the NPRT model for classifying rail-transit stations. However, the effect of ridership on node and place values which leads to the bi-directional relationship between the dependent and independent variables would be an open discussion for future studies. Moreover, the effect of the economy, ecology, and sociodemographic characteristics (such as transit mode share, household going-out rate, and age composition) on the NPRT model would be essential for future studies. Supplementary Information.
  1 in total

1.  Application of artificial neural networks and multiple linear regression on local bond stress equation of UHPC and reinforcing steel bars.

Authors:  Ahad Amini Pishro; Shiquan Zhang; Dengshi Huang; Feng Xiong; WeiYu Li; Qihong Yang
Journal:  Sci Rep       Date:  2021-07-23       Impact factor: 4.379

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.