From “Clothing Standard” to “Chemometrics”

Review of Prof. Kai-Tai Fang’s Contributions in Data Mining
  • Ping HeEmail author
  • Xiaoling Peng
  • Qingsong Xu


This paper reviews Prof. Kai-Tai Fang’s contributions in data mining. Since the 1970s, Prof. Fang has been committed to applying statistical ideas and methods to deal with large amounts of data in practical projects. By analyzing more than 400,000 pieces of data, he found representative clothing indicators and established the first adult clothing standard in China; through cleaning and modeling steel-making data from steel mills all over the country, he revised the national standard for alloy structural steel; by studying various data in chemometrics, he introduced many new advanced statistical methods to improve the identification and classification of chemical components, established more effective models for the relationship between quantitative structure and activity, and promoted the application of the traditional Chinese medicine (TCM) fingerprint in TCM quality control. Professor Fang and his team’s research achievements in data mining have been highly appreciated by relevant experts. This article is written to celebrate the 80th birthday of Prof. Kaitai Fang.



This work was partially supported by Guangdong Natural Science Foundation (No. 2018A0303130231) and Guangdong Innovation and Enhancement Project: Education Research Programme (R5201920).


  1. 1.
    Atkinson, A.C., Bogacka, B., Bogacki, M.B.: \(D\)- and \(T\)-optimum designs for the kinetics of a reversible chemical reaction. Chemom. Lab. Syst. 43, 185–198 (1998)Google Scholar
  2. 2.
    Cox, D.R.: Note on grouping. J. Am. Stat. Assoc. 52, 543–547 (1957)zbMATHGoogle Scholar
  3. 3.
    Fang, K.T.: Using conditional distributions to formulate a national clothing standard. J. Appl. Math. 2, 63–74 (1976). (Chinese)Google Scholar
  4. 4.
    Fang, K.T.: Uniform design—application of number theory method in experimental design. J. Appl. Math. 3, 363–372 (1980). (Chinese)Google Scholar
  5. 5.
    Fang, K.T.: Endless and unbending journey to statistics research: the oral autobiography of Kai-tai Fang. Hunan Education Press (2016) (in Chinese)Google Scholar
  6. 6.
    Fang, K.T., He, S.: The problem of selecting a given number of representative points in a normal population and a generalized mill’s ratio. Technical report, No. 5, Department of Statistics, Stanford University, USA (1982)Google Scholar
  7. 7.
    Fang, K.T., Wang, Y.: Number-Theoretic Methods in Statistics. Chapman and Hall, London (1994)zbMATHGoogle Scholar
  8. 8.
    Fang, K.T., Wu, C.Y.: A probability extremum problem. J. Appl. Math. 2, 132–148 (1979) (in Chinese)Google Scholar
  9. 9.
    Fang, K.T., Zhang Y.T.: Introduction to Multivariate Statistical Analysis. Science Press (1982) (in Chinese)Google Scholar
  10. 10.
    Fang, K.T., Bentler, P.M., Yuan, K.H.: Applications of number-theoretic methods to quantizers of elliptically contoured distributions. In: Multivariate Analysis and Its Applications, IMS Lecture Notes—Monograph Series, pp. 211–225 (1994)Google Scholar
  11. 11.
    Fang, K.T., Liang, Y.Z., Yin, X.L., Chen, K., Lu, G.H.: Critical value determination on similarity of fingerprints. Chemom. Intell. Lab. Syst. 82, 236–240 (2006)Google Scholar
  12. 12.
    Fang, K.T., Liang, Y.Z., Yu, R.Q. (eds.): Data Mining and Bioinformatics in Chemistry and Chinese Medicines. Hong Kong Baptist University (2003)Google Scholar
  13. 13.
    Fang, K.T., Liang, Y.Z., Yu, R.Q. (eds.): Data Mining and Bioinformatics in Chemistry and Chinese Medicines, vol. 2, Hong Kong Baptist University (2004)Google Scholar
  14. 14.
    Fang, K.T., Xiang, K.F., Liu, G.Y.: Precision of Test Method. China Standard Press, Beijing (1988)Google Scholar
  15. 15.
    Fang, K.T., Yin, H., Liang, Y.Z.: New approach by Kriging methods to problems in QSAR. J. Chem. Inform. Model. 44, 2106–2113 (2004)Google Scholar
  16. 16.
    Fang, K.T., Zhou, M., Wang, W.J.: Applications of the representative points in statistical simulations. Sci. China Ser. A 57, 2609–2620 (2014)MathSciNetzbMATHGoogle Scholar
  17. 17.
    He, P., Fang, K.T., Xu, C.J.: The classification tree combined with SIR and its applications to classification of mass spectra. J. Data Sci. 1, 425–445 (2003)Google Scholar
  18. 18.
    He, P., Fang, K.T., Liang, Y.Z., Li, B.Y.: A Generalized boosting algorithm and its application to two-class chemical classification problem. Analytica Chimica Acta 543, 181–191 (2005)Google Scholar
  19. 19.
    He, P., Xu, C.J., Liang, Y.Z., Fang, K.T.: Improving the classification accuracy in chemistry via boosting technique. Chem. Intell. Lab. Syst. 70, 39–46 (2004)Google Scholar
  20. 20.
    Hu, Q.N., Liang, Y.Z., Fang, K.T.: The matrix expression, topological index and atomic attribute of molecular topological structure. J. Data Sci. 1, 361–389 (2003)Google Scholar
  21. 21.
    Hu, Q.N., Liang, Y.Z., Peng, X.L., Yin, H., Fang, K.T.: Structural interpretation of a topological index. 1. External factor variable connectivity index (EFVCI). J. Chem. Inf. Comput. Sci. 44, 437–446 (2004)Google Scholar
  22. 22.
    Hu, Q.N., Liang, Y.Z., Yin, H., Peng, X.L., Fang, K.T.: Structural interpretation of a topological index. 2. The molecular connectivity index, the Kappa index, and the atom-type E-state index. J. Chem. Inf. Comput. Sci. 44, 1193–1201 (2004)Google Scholar
  23. 23.
    Lee, A.W.M., Chan, W.F., Yuen, F.S.Y., Tse, P.K., Liang, Y.Z., Fang, K.T.: An example of a sequential uniform design: application in capillary electrophoresis. Chem. Intell. Lab. Syst. 39, 11–18 (1997)Google Scholar
  24. 24.
    Liang, Y.Z., Fang, K.T.: Robust multivariate calibration algorithm based on least median squares and sequential number theoretic optimization method. Anal. Chem. 121, 1025–1029 (1996)Google Scholar
  25. 25.
    Liang, Y.Z., Fang, K.T., Xu, Q.S.: Uniform design and its applications in chemistry and chemical engineering. Chem. Intell. Lab. Syst. 58, 43–57 (2001)Google Scholar
  26. 26.
    Peng, X.L., Hu, Q.N., Liang, Y.Z.: Variable selection via nonconcave penalty function in structure-boiling points correlations. J. Mol. Struct.: THEOCHEM, 714, 235–242 (2005)Google Scholar
  27. 27.
    Peng, X.L., Yin, H., Li, R., Fang, K.T.: The application of Kriging and empirical Kriging based on the variables selected by SCAD. Analytica Chimica Acta 578, 178–185 (2006)Google Scholar
  28. 28.
    Tang, Y., Liang, Y.Z., Fang, K.T.: Data mining in chemometrics: sub-structures learning via peak combinations searching in mass spectra. J. Data Sci. 1, 481–496 (2003)Google Scholar
  29. 29.
    Varmuza, K., He, P., Fang, K.T.: Boosting applied to classification of mass spectral data. J. Data Sci. 1, 391–404 (2003)Google Scholar
  30. 30.
    Xu, Q.S., Liang, Y.Z., Fang, K.T.: The effects of different experimental designs on parameter estimation in the kinetics of a reversible chemical reaction. Chem. Intell. Lab. Syst. 52, 155–166 (2000)Google Scholar
  31. 31.
    Xu, Q.S., Massart, D.L., Liang, Y.Z., Fang, K.T.: Two-step multivariate adaptive regression spline for modeling a quantitative relationship between gas chromatography retention indices and molecular descriptors. J. Chromatogr. A 998, 155–167 (2003)Google Scholar
  32. 32.
    Xu Q.S., Xu Y.D., Li L., Fang K.T.: Uniform experimental design in Chemometrics. J. Chem. e3020 (2018). Scholar
  33. 33.
    Yin, X.L., Fang, K.T., Liang, Y.Z., Wong, R.N.S., Ha, A.W.Y.: Assessing phylogenetic relationships of Lycium samples using RAPD and entropy theory. Acta Pharmacologica Sinica 26, 1217–1224 (2005)Google Scholar
  34. 34.
    Ying, H., Li, R.Z., Fang, K.T., Liang, Y.Z.: Empirical Kriging models and their applications to QSAR. J. Chem. 20, 1–10 (2007)Google Scholar
  35. 35.
    Zhang, L., Liang, Y.Z., Yu, R.Q., Fang, K.T.: Sequential number-theoretic optimization (SNTO) method applied to chemical quantitative analysis. J. Chem. 11, 267–281 (1997)Google Scholar
  36. 36.
    Zhang, L., Liang, Y.Z., Jiang, J.H., Yu, R.Q., Fang, K.T.: Uniform design applied to nonlinear multivariate calibration by ANN. Analytica Chimica Acta 370, 65–77 (1998)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  1. 1.Beijing Normal University and Hong Kong Baptist University United International CollegeZhuhaiChina
  2. 2.Department of Mathematics and StatisticsCentral South UniversityChangshaChina

Personalised recommendations