Political regimes are a means of society in sovereign states to decide policies and alternate their political leaders. A society is said to have a democratic regime when they can determine a policy in a collective consensus by delegating political representatives through a regular competitive election, while others have autocratic regimes when these methods of policy and leaders selection are absent (e.g. Cheibub, Ghandi, and Vreeland 2010).
The choice of political regimes have been a great concerned in social science literature. Some argue that a feature in political regimes determine whether a country is more likely to go to a war with other countries (e.g. Weeks 2012) and other suggest that when political leaders can stay in power as much as they want, they would have less incentive to produce policies that generate welfare to the many (e.g. Sen 1999).
The purpose of this report is to examine a latent process of which a society to choose democracy as the mode of their political regime. The unit of analysis is a society with sovereign states, that is a centralized organization in a specific world’s territory that can project internal order and command external respects from other states through the monopolization of military technology (e. g. Weber [1918] 1946 and Mann 1984).
In a game-theoretic literature, it is common to model a sovereign state as a political economy with two players: elites and the mass (e. g. Boix 2003; Acemoglu and Robinson 2006). Elites want to defend their privileged and wealth by controlling the state administrations, hence preferring to establish autocracy, while the mass want a redistribution of wealth and power by expanding participation in policy-making process and leadership selection, hence preferring a democracy.
According to one of these model (e.g. Acemoglu and Robinson 2006), elite will repress the mass until the cost of repression marginally increase due to revolutionary threats. At the time of marginal increased in the cost of repression, elites will compromise with the mass and establish a democratic regime.
Exploiting insights from this literature, in the following report, I will model a democratization, a shifting from autocratic to democratic regime, as a partially observed Markov model (POMP) with a discrete compartment population.
The model developed in this report suggests that democracies are a result of negotiation (henceforth \(N\)) between powerful elites (henceforth \(P\)) and the mass when the threats of revolution increases (henceforth \(R\)). This idea describes in the following diagram. Denote that the covariate in this model is \(S\), which is the number of sovereign states.
The following report utilizes the updated version of Boix, Miller, and Rosato’s (2013) dataset on the classification of political regimes from 1800 and 2020. Classifying and measuring what type of a political regime a society creates has been discussed in length in the literature (e.g. Little and Meng 2023).
Some employ a continuous index to capture a latent variable that can only be evaluated subjectively by countries experts (see Coppedge et. al. 2023), other prefer a dichotomous classification, e. g. democracy and autocracies, by examining an observable characteristic like constitution and the presence of regular election to determine a class of political regime (e. g. Cheibub, Ghandi, and Vreeland 2010).
Boix, Miller, and Rosato’s dataset follows the latter school of thought. This dataset classifies a country as a democracy when the legislative branches of the government is elected directly by the majority of male population and the executive branch is responsible fully to the parliament and/or the voters. This dataset is chosen because it provides a simpler discrete classification of regimes with a longest period, that is, from 1800 to 2020.
The dataset is downloaded through \(\texttt{democracyData}\) package in \(\texttt{R}\), developed by xmarquez’s account on Github. The dataset is a panel with \(222\) units of observations and \(19.755\) year-observations. Almost all unit of observations are classified as sovereigns but in some periods of observation they were not. Excluding the period of units when they are not sovereign, this gives \(19.130\) year-observations in the dataset.
The following report will use an aggregate number of democracies in sovereign states in a given year. Consider \(Y_{it}\) as a binomial classification of political regime of unit \(i\) in the year \(t\) with \(N\) as the total number of unit observations at time \(t\). This quantity is mathematically expressed as follow.
\[ Z(t) = \sum_{i = 1}^N Y_{it} \] In particular, the following report will count the change in the aggregate number of democracies in time \(t\), which is expressed in the following quantities.
\[ \Delta Z(t) = \max \left(0, Z(t) - Z(t - 1)\right) \]
The realized values of the quantities \(Z(t)\) and \(\Delta Z(t)\) in the dataset are visualized respectively in the graph on the right and the left panel below. Notice that the negative value on the right panel indicates that at that given year there were democracies transiting to autocracies or losing sovereignty, hence no longer exist in the dataset as a democratic country.
Therefore, the research question to solve in the following report is below:
The estimation of parameters in the latent process will be estimated through the particle filter algorithm with multiple number of iteration and a different arbitrarily starting initial values with POMP package developed by King, Nguyen, and Ionides (2015). The next section of this report will discuss the parameters of the latent observations will be estimated in the simulation study.
The latent process in this model starts from the covariate \(S(t)\) or the number of sovereign states in year \(t\). This is counted as an aggregate number of democracies and autocracies at year t. The next step of the latent process is to capture the subset of sovereign states that have elites with powerful endowment, \(P(t)\), at the time \(t\).
Elites may inherit sophisticated military technologies from the previous foreign domination or simply receive exogenous endowment, such as oils or natural resources, that make them powerful enough to not establish democracies. The transition from \(S(t)\) to \(P(t)\) is captured as a binomial approximation with exponential transition probabilities, as suggested in Euler solutions to stochastic dynamic (King, Ionides, and Wheeler 2024):
\[ \tilde{N}_{SP}(t + \delta) + \tilde{N}_{SP}(t) + \text{Binomial}\left[\tilde{S}(t), 1 - \text{exp}\left( -\beta \frac{\tilde{R}(t)}{\tilde{S}(t)} \Delta t \right) \right] \] Drawing insight from SIER model in epidemology research (see King, Ionides, and Wheeler 2024), I consider \(P(t)\) as a latency period before revoluationary threats occur. Observing powerful elites after the state’s sovereignty is established, the mass soon prepares for the revolution to negotiate with the elites with the following expected value.
\[ \beta \frac{R(t)}{\zeta(t)} \] Notice that \(\beta\) and \(R(t)\) are the parameters to be estimated, while \(\zeta(t)=S(t)\) is the covariate smoothed with a cubic spline. The initial value of covariates \(S(0)\) is fixated at 23 to follow the number of sovereign states in the dataset when \(t=0\).
The rate of transition from \(P(t)\) to \(R(t)\) is captured by the coefficient \(\mu_{PR}\) in the following Euler stochastic binomial approximation. \(R(0)\) is fixated at two to represent the sovereign states where revolutions had unfolded, the United States and France, in the earliest year in the report.
\[ \tilde{N}_{PR}(t + \delta) = \tilde{N}_{PR}(t) + \text{Binomial}\left[\tilde{P}(t), \mu_{PR} \Delta t \right] \] Finally, the transition from \(R(t)\) to \(N(t)\) is depicted in the equality below with the rate of change \(\mu_{RN}\). The initial values of \(N(0)\), is fixated at one to represent the first sovereign state, the United States, that was classified as a democracy in the earliest date of observation
\[ \tilde{N}_{RN}(t + \delta) = \tilde{N}_{RN}(t) + \text{Binomial}\left[\tilde{R}(t), \mu_{RN} \Delta t \right] \] This counting process is implemented in the following \(\texttt{Csnippets}\), where \(\texttt{tot_sov}\) is the covariate from the column in the dataset.
sprn_step <- Csnippet("
double dN_SP = rbinom(S,1-exp(-Beta* N/tot_sov*dt));
double dN_PR = rbinom(P,1-exp(-mu_PR * dt));
double dN_RN = rbinom(R,1-exp(-mu_RN * dt));
S -= dN_SP;
P += dN_SP - dN_PR;
R += dN_PR - dN_RN;
N += dN_RN;"
)
sprn_init <- Csnippet("
S = 23;
P = 1;
R = 2;
N = 1;"
)
Similar to SEIR model, the dataset will be modeled as a negative binomial with overdispersion parameter \(k\), which also will be estimated in the simulation. This parameter helps to capture the discrepencies between the model and the data (see King, Ionides, and Wheler 2024).
The overdispersion parameter \(k\) will interact with the rate of success, or transition to democracy, \(\rho \cdot N(t)\). Similar to SEIR model, \(\rho\) represents a reporting efficiency, or in this context, the efficiency of past sovereign states to develop an accessible archive for the future coders of political regimes.
To express this idea formally, the number of democracy given time \(t\) is measured in the following quantity.
\[ \Delta Z(t) \sim \text{NegBin}(\rho \cdot N(t), k) \]
Finally, measuring democracies has been a subject of debate in the literature (see Little and Meng 2023), so it might be a good idea to factorize the measurement error in the model, as in the SEIR model. The measurement error function is included in the \(\texttt{pomp}\) object with coefficient \(\rho\) multiplied by \(N(t)\).
The following \(\texttt{Csnippet}\) implements respectively \(\texttt{dmeasure}\), \(\texttt{rmeasure}\), and measurement error as a \(\texttt{pomp}\) object.
dmeas <- Csnippet("
lik = dnbinom_mu(democracy,k,rho* N, give_log);"
)
rmeas <- Csnippet("
democracy = rnbinom_mu(k,rho*N);"
)
emeas <- Csnippet("
E_democracy = rho*N;"
)
With a consideration of time-constraints, the likelihood of parameters above is estimated through two hundred iteration with two thousands number of particle filter. The loglikelihood standard error is estimated through two thousands particle filter with ten number of replicate. Finally, the cooling fraction with the size \(0.5\) in addition to random walk with the size \(0.02\) to all estimands is included to give a stochastic perturbation in each iterations.
The starting value of each parameter is chosen arbitrarily from a uniformly distributed two hundred sequence of numbers. The following plot represent the design matrix parameter space used in the iteration algorithm.
The computation time took about four hours in the University of Michigan’s Greatlakes computer with \(36\) cores. The result is discussed in the next section.
The estimation of parameters value of the simulation is represented in the following plot.
Examining the plot above, it seems fair to state that although the number of iteration and particle filters are relatively moderate, the parameter estimates are well identified. More specifically, the profile likelihood confidence interval is represented in the following plot.
The plot above show that \(\beta\), \(\mu_{PR}\), \(\mu_{RN}\) have a value below one within \(95\%\) and \(97.5\%\) confidence interval. This means, the transition of population moving from one to another compartment unfold in a less than a year. Substantively, this means, soon when a sovereign state is established, the mass prepares revolutionary threats to negotiate with the elites to establish democracy.
To be more precise, the following pair plot is presented to examine the interaction between these substantive parameters.
The pair plot above indicates an exponential decay relationship between the parameters \(\beta\) with \(\mu_{RN}\) and \(\mu_{PR}\). The relationship between these two parameters are consistent with the interpretation above. This implies, as the expected value of revolutionary threats from the mass increases, the the latency, \(\mu_{PR}\), and the revolutionary periods \(\mu_{RN}\) decreases as well. Substantively, this means, compromising with the mass is an effective mean of elites to appease the revolutionary threats, which confirms the insight from Acemoglu and Robinson’s (2006) model of democratization.
With regards to the measurement error and the data quality on the regime, the result of simulation indicates if the model is correct, the overdispersion value, \(k\), and coding efficiency \(\rho\) should be low. The value \(k\) within the confidence interval indicates that there is a high discrepancy between the model and the data. The value \(\rho\) indicates that if the revolutionary threats is the only route to democratization, there should be more number of democracies reported in the data.
In a more detail, the following plot examines the relationship between \(\rho\) and \(k\). The plot indicates, within \(95\%\) confidence interval, one can expect that the coding efficiency is around \(0.07\) with overdispersion parameter between \(0.56\) and \(0.60\).
Rather than the weakness of the model, the measurement error value in the model suggests that there might additional compartments before \(\mu_{RN}\) which complicates the road to democratization. The model suggests that elites are not that much willing to negotiate with the mass even though the revolutionary threats occurs. It might be the cost of repression decreases over time, which allows them to stay in the position of power, regardless the presence of revolutionary threats. Both of these are plausible insights that the model suggests.
To check whether the POMP model with five parameters are worth to consider, I compare it with simpler models such as a negative binomial regression and Poisson regression with year as its explanatory variable. The choice of these model seem reasonable because the outcome variable is a non-negative natural numbers. Further, independent and identically distributed (IID) model is also chosen as additional benchmark.
Examining the Figure 1 shown earlier, the realization of the measurement model in the dataset does not indicate the sign of stationarity, hence using ARMA model as a benchmark might be not appropriate.
The negative binomial model is fitted by using \(\texttt{glm.nb}\) function from \(\texttt{MASS}\) library while the Poisson regression is fitted by using \(\texttt{glm}\) function in \(\texttt{R}\). IID model is fitted by maximum likelihood value function of negative binomial distribution using King, Iondies, and Wheeler’s (2024) code.
The log likelihood and AIC values of all these model, including the POMP model is presented in the following table. The log likelihood of the POMP model is chosen from the maximum value in the simulation, while its AIC value is computed by considering five parameters estimated during the simulation
Model | Log.Likelihood | AIC |
---|---|---|
POMP | -211.8487 | 433.6974 |
Poisson Regression | -250.7523 | 505.5046 |
Negative Binomial | -210.6217 | 427.2434 |
IID | -237.3759 | 476.7517 |
Examining and comparing these models, it seems the negative binomial regression performs better as opposed to POMP and all other models. The coefficient value of explanatory variable in this model shows a positive sign (see Appendix I), which implies the aggregate number of democracy can simply be explained as a function of time.
According to the mainstream model of political evolution in social science, e. g. modernization theory, society will adopt similar political institutions, e.g. democracy, as they progress economically (Przeworski and Limongi 1997). As the world has become more economically developed in the last two hundred years, the explanatory variable in the negative binomial model probably captured this variable.
However, while negative binomial regression model perform better than the POMP model, this does not capture the nuances of the endogenous mechanism in which a society adopts a democracy as a product of strategic interaction between elites and the mass. Abandoning the POMP model in favour the negative binomial regression implies we have to abandon a rich explanation of this model suggests.
Further, despite having more parameters, POMP model’s log-likelihood is still competitive with the simpler model. As a resolution, rather than abandoning this model whatsover, it might be better to refining its structure, so it can reflect the actual incidence of democracies in the dataset.
The next step of model diagnostic is by probing the simulation and the observational data. The grey lines in the following plot shows the data generated from twenty simulation from the model with the highest log-likelihood value, while the red line is the actual data point
Examining the plot above, the simulation predicts at least one incidence of democratization must occur every year. The simulation data indicates a higher exponential growth of democracies in comparison the actual data. Additionally, it might be worth to consider examining the variability of incidence of democracies from the data and from the simulation.
To examine this relationship more precisely, I fit the exponential growth rates model with the simulation and actual data with a function written by King, Ionides, and Wheler (2024) and the standard deviation of its residual fits.
The result is shown in the following plot.
The plot above indicates there are some moderate evidences in which the growth rate from the model is different from the data, as shown on the two plots on the left panel above. From the plot from the right, additionally, there is a moderate evidence that the simulation on the lower quartile has a high variability from the data.
However, rather than a weakness of the model, this observation ensures the reliability of parameters \(\rho\) and \(k\) in the simulation, so too the reliability of other parameters. This implies, the simulation works well, but the structure of the model needs a further refinement.
This report has examined the latent process of democracy in the last two centuries. More specifically, this report asks, given the realization of the annual incidences of democracies in the dataset, what are the value of parameters that can best capture the latent process in the model? The answer of this question is presented in the following table:
Coefficient | Estimates |
---|---|
Beta | 0.2650704 |
rho | 0.0663423 |
k | 0.5804159 |
mu_RN | 0.0147181 |
mu_PR | 0.0356604 |
loglik | -211.8486945 |
loglik.se | 0.0224633 |
The estimates of coefficient above is chosen from the simulation of the estimated parameters with the maximum log-likelihood.
Given the low value of coefficient estimates, this model optimistically predicts that incident of democracies occur at least once every year. A diganostic of this model in the previous section shows that this result is overestimating, but is still competitive even with a more simpler model.
As discussed and suggested in the previous section, a future study of political regime with a simulation method can improve this estimation by adding more compartments so it gives a precise estimate on the latent process that generated the incidence of democracies. Finally, examining the Figure 1 in the early section, it seems there is a random spike of democratization occuring after the year 1900. Hence, the future simulation study on this topic can consider to include extra-demographic stochasticity to get a more precise estimate.
Estimate | Std. Error | z value | Pr(>|z|) | |
---|---|---|---|---|
(Intercept) | -36.8606386 | 4.9207691 | -7.490829 | 0 |
time | 0.0188072 | 0.0025273 | 7.441666 | 0 |
Estimate | Std. Error | z value | Pr(>|z|) | |
---|---|---|---|---|
(Intercept) | -31.2785116 | 3.2392738 | -9.656026 | 0 |
time | 0.0159366 | 0.0016484 | 9.667830 | 0 |