Datasets
- surpyval.datasets.load_bearing_failures()
Data on the failure of bearings, from 1. “Cycles to Failure (millions)” is the number of cycles to failure in millions of cycles.
References
- 1
Lieblein, J. and Zelen, M. (1956) Statistical Investigation of the Fatigue Life of Deep-Groove Ball Bearings. Journal of Research of the National Bureau of Standards, 57, 273-315.
- surpyval.datasets.load_bofors_steel()
Returns a Pandas DataFrame containing the data of the tensile strength of Bofors Steel from 2.
First 5 rows of the dataset:
x
n
0
40.800
10
1
42.075
23
2
43.350
48
3
44.625
80
4
45.900
63
References
- 2
Weibull, W., A statistical distribution function of wide applicability, Journal of applied mechanics, Vol. 18, No. 3, pp 293-297 (1951).
- surpyval.datasets.load_boston_housing()
The Boston house-price data of 3.
This is a well-known data set used in machine learning. It can be analysed with survival analysis methods by considering the fact that the highest prices appear to be right censored.
References
- 3
Harrison, D. and Rubinfeld, D.L. (1978) Hedonic prices and the demand for clean air. J. Environ. Economics and Management 5, 81-102.
- surpyval.datasets.load_g1_kaminskiy_krivtsov()
Data on the survival of a repairable system from 4.
References
- 4
Kaminskiy, M.P. and Krivtsov, V.V. (2010). G1-renewal process as repairable system model. Reliability Engineering and System Safety, 95(1), 1-9.
- surpyval.datasets.load_heart_transplants()
Data on the survival of patients who may or may not have received a heart transplant, from 5.
References
- 5
Crowley, J. and Hu, M. (1977) Covariance analysis of heart transplant survival data. Journal of the American Statistical Association, 72, 27-36.
- surpyval.datasets.load_lung()
Data on the survival of patients with advanced lung cancer from 6.
References
- 6
Loprinzi CL. Laurie JA. Wieand HS. Krook JE. Novotny PJ. Kugler JW. Bartel J. Law M. Bateman M. Klatt NE. et al. Prospective evaluation of prognostic variables from patient-completed questionnaires. North Central Cancer Treatment Group. Journal of Clinical Oncology. 12(3):601-7, 1994.
- surpyval.datasets.load_meeker_lfp()
Data on failures of integrated circuits from 12.
Very difficult for LFP calculations since the data is heavily right censored.
References
- 12
Meeker, W.Q. (1987) Limited Failure Population Life Tests: Application to Integrated Circuit Reliability. Technometrics, 29(1), 51-65.
- surpyval.datasets.load_mettas_and_zhao()
Data on the survival of a repairable system from 7.
References
- 7
Mettas, A. and Zhao, Y.Q. (2005). Modeling and analysis of repairable systems with general repair. IEEE Transactions on Reliability, 54(1), 1-10.
- surpyval.datasets.load_rossi_static()
Data on the recidivism of released prisoners from 8. Uses only static covariates.
References
- 8
Rossi, P.H., R.A. Berk, and K.J. Lenihan (1980). Money, Work, and Crime: Some Experimental Results. New York: Academic Press. John Fox, Marilia Sa Carvalho (2012). The RcmdrPlugin.survival Package: Extending the R Commander Interface to Survival Analysis. Journal of Statistical Software, 49(7), 1-32.
- surpyval.datasets.load_rossi_time_varying()
Data on the recidivism of released prisoners from 9. Includes time varying covariates.
References
- 9
Rossi, P.H., R.A. Berk, and K.J. Lenihan (1980). Money, Work, and Crime: Some Experimental Results. New York: Academic Press. John Fox, Marilia Sa Carvalho (2012). The RcmdrPlugin.survival Package: Extending the R Commander Interface to Survival Analysis. Journal of Statistical Software, 49(7), 1-32.