Datasets

surpyval.datasets.load_bearing_failures()

Data on the failure of bearings, from 1. “Cycles to Failure (millions)” is the number of cycles to failure in millions of cycles.

References

1

Lieblein, J. and Zelen, M. (1956) Statistical Investigation of the Fatigue Life of Deep-Groove Ball Bearings. Journal of Research of the National Bureau of Standards, 57, 273-315.

surpyval.datasets.load_bofors_steel()

Returns a Pandas DataFrame containing the data of the tensile strength of Bofors Steel from 2.

First 5 rows of the dataset:

x

n

0

40.800

10

1

42.075

23

2

43.350

48

3

44.625

80

4

45.900

63

References

2

Weibull, W., A statistical distribution function of wide applicability, Journal of applied mechanics, Vol. 18, No. 3, pp 293-297 (1951).

surpyval.datasets.load_boston_housing()

The Boston house-price data of 3.

This is a well-known data set used in machine learning. It can be analysed with survival analysis methods by considering the fact that the highest prices appear to be right censored.

References

3

Harrison, D. and Rubinfeld, D.L. (1978) Hedonic prices and the demand for clean air. J. Environ. Economics and Management 5, 81-102.

surpyval.datasets.load_g1_kaminskiy_krivtsov()

Data on the survival of a repairable system from 4.

References

4

Kaminskiy, M.P. and Krivtsov, V.V. (2010). G1-renewal process as repairable system model. Reliability Engineering and System Safety, 95(1), 1-9.

surpyval.datasets.load_heart_transplants()

Data on the survival of patients who may or may not have received a heart transplant, from 5.

References

5

Crowley, J. and Hu, M. (1977) Covariance analysis of heart transplant survival data. Journal of the American Statistical Association, 72, 27-36.

surpyval.datasets.load_lung()

Data on the survival of patients with advanced lung cancer from 6.

References

6

Loprinzi CL. Laurie JA. Wieand HS. Krook JE. Novotny PJ. Kugler JW. Bartel J. Law M. Bateman M. Klatt NE. et al. Prospective evaluation of prognostic variables from patient-completed questionnaires. North Central Cancer Treatment Group. Journal of Clinical Oncology. 12(3):601-7, 1994.

surpyval.datasets.load_meeker_lfp()

Data on failures of integrated circuits from 12.

Very difficult for LFP calculations since the data is heavily right censored.

References

12

Meeker, W.Q. (1987) Limited Failure Population Life Tests: Application to Integrated Circuit Reliability. Technometrics, 29(1), 51-65.

surpyval.datasets.load_mettas_and_zhao()

Data on the survival of a repairable system from 7.

References

7

Mettas, A. and Zhao, Y.Q. (2005). Modeling and analysis of repairable systems with general repair. IEEE Transactions on Reliability, 54(1), 1-10.

surpyval.datasets.load_rossi_static()

Data on the recidivism of released prisoners from 8. Uses only static covariates.

References

8

Rossi, P.H., R.A. Berk, and K.J. Lenihan (1980). Money, Work, and Crime: Some Experimental Results. New York: Academic Press. John Fox, Marilia Sa Carvalho (2012). The RcmdrPlugin.survival Package: Extending the R Commander Interface to Survival Analysis. Journal of Statistical Software, 49(7), 1-32.

surpyval.datasets.load_rossi_time_varying()

Data on the recidivism of released prisoners from 9. Includes time varying covariates.

References

9

Rossi, P.H., R.A. Berk, and K.J. Lenihan (1980). Money, Work, and Crime: Some Experimental Results. New York: Academic Press. John Fox, Marilia Sa Carvalho (2012). The RcmdrPlugin.survival Package: Extending the R Commander Interface to Survival Analysis. Journal of Statistical Software, 49(7), 1-32.

surpyval.datasets.load_sae()

Data on failures in automotive industry from 11.

Features heavily (right) censored data.

References

11

V.V. Krivtsov and J. W. Case (1999), Peculiarities of Censored Data Analysis in Automotive Industry Applications, SAE Technical Paper Series, # 1999-01-3220

surpyval.datasets.load_tires_data()

Data on the survival of tires from 10.

References

10

Krivtsov, V.V., Tananko, D.E., Davis, T.P. (2002). Regression approach to tire reliability analysis. Reliability Engineering and System Safety, 78(3), 267-273.