Storage onboarding structural break testing

Block Science observed that there is a strong structural break in the onboarding time series before and after the China regulatory activity. This structural break appears to have disrupted the entire onboarding process (and the associated dependencies, such as gas usage) from a stationary process to a (likely transitory) non-stationary process. In other words, the current situation since early October has been one of the transients, creating volatility around a downward trend. This phenomenon has rendered the system identification techniques, which use the previous history and a technical assumption of its stationarity to create the DT’s extrapolation engine, less effective when applied to recent data (e.g. from October onwards). There is in-progress work on helping to improve the extrapolation, but we are also exploring running the DT simulations with the dynamic BatchBalancer on data before the China regulatory activity and then extrapolating forward. This not only assures us of the stationarity of the time series observed before the announcement (allowing us to continue using the VARx methodology for the gas dynamics) but also allows for a comparison of the extrapolated simulated system with the actual time series that occurred after the structural break.

A statistical test such as the Chow Test is performed on storage onboarding time-series data to determine, formally, if the Chinese regulation of the cryptocurrency market of September 24, 2021, caused a structural break (a structural break is a change in the parameters of regression models) at that time or thereafter. If a structural break is present, we will investigate to determine if the onboarding time series is stationary after the structural break. To test this hypothesis, we will use the Chow test.

The Chow test was created in 1960 by Gregory Chow, and it is a test that checks whether the actual coefficients in two linear regressions on different time series or different periods of the same time series are equal. It is most often used to check for a structural break that is known before analysis, as in our case with the Chinese regulations.

To illustrate how the Chow Test functions, we will enumerate below:

We have the following regression data: $y_t=a+bx_{1t} + cx_{2t} + \varepsilon.\$

If split into two groups, we have:

$y_t=a_1+b_1x_{1t} + c_1x_{2t} + \varepsilon \,$

and

$y_t=a_2+b_2x_{1t} + c_2x_{2t} + \varepsilon \,$

The null hypothesis of the Chow test asserts that $a_1=a_2$, $b_1=b_2$, and $c_1=c_2$ and that the errors and residuals I.D.D from a normal distribution with an unknown variance.

$S_C$ is the sum of squared errors and residuals from the combined data, with $S_1$ the sum of squared errors and residuals from the first group. $S_2$ is the sum of squared errors and residuals from the second group. $N_1$ and $N_2$ are the numbers of observations in each group while $k$ is the total number of parameters. Thus, the Chow test statistic is:

$$\frac{(S_C -(S_1+S_2))/k}{(S_1+S_2)/(N_1+N_2-2k)}$$

The test statistic follows the F distribution wit$k$ and $N_1+N_2-2k$ the degrees of freedom.

Below, we will test the storage power_rb and the daily sum of gas used to check for structural breaks. We will set our breakpoint for September 24th, and use David Woroniuk's Python implementation of the Chow test.

References:

Pull data

Chow test

Conclusion

From the Chow tests and plots, we can statistically say that a structural break occured in network onboarding following the new Chinese cryptocurrency regulations of September 24th, 2021. Based on the plots above, we can see that the effect is becoming more pronounced the further we are from the event as the impact reverberates through the network.