Abstract
In order to provide reference for the fruit quality research and fruit selective breeding, in this paper, data mining methods of fruit quality in recent years, including fuzzy comprehensive evaluation method, analytic hierarchy process method, gray correlation degree analysis method and so on, which were compared for the characteristics of advantages and disadvantages. Furtherly, the main evaluation factors of the common fruits were summarized. Finally, the research on data mining methods of the fruit quality was summarized and prospected. This review indicated that data mining methods could evaluate multi-index of fruit quality comprehensively, which will provide reference for rapid detection of fruit quality and cultivation of the excellent species. Meanwhile, it will be a new direction in the field of fruit quality research by studying more main factors of fruit quality and simplifying the evaluation procedures in the near future.
You have full access to this open access chapter, Download conference paper PDF
Similar content being viewed by others
Keywords
- Fruit quality
- Data mining
- Fuzzy evaluation
- Analytic hierarchy process
- Principal component cluster analysis
1 Introduction
Fruit quality is one of the most important factors that dictates the economic value and directly affects the market competitiveness of the fruit. China firmly remains as the world’s superpower in terms of fruit trees, fruit yield, and planting area [1, 2]. However, the proportion of fruit exports trail behind those of other countries [3]. A study found that, on one hand, given the country’s lagging fruit quality evaluation and sorting technology [4], grading of fruit quality is disordered, and resulting quality is uneven. Single fruit indices, such as strawberry [5] and apple [6] soluble solids, sweet orange titratable acid [7], and kiwi fruit hardness [8], which were analyzed by scholars. However, single indicators can only evaluate the quality of one aspect of the fruit but fails to meet the requirements of fruit quality evaluation, because these indicators cannot be used in comprehensively evaluating fruit quality and present certain limitations. On the other hand, fruit quality evaluation contains numerous factors, including the internal and external quality factors of fruit, are involved in evaluating fruit quality, and each factor present close correlation and relative independence, resulting in difficulty in conducting fruit quality evaluation and grading work. In view of the above problems, the fruit quality data mining analysis of the indicators of hierarchical method and classification system, and relevant evaluation methods can be used to simplify quality indicators, extract the main evaluation factors, and simplify the evaluation process. The use of data mining study on comprehensive evaluation methods for fruit quality has become a research hotspot in recent years, scholars have used data mining methods on fruit quality of Nanfeng tangerine [9], apple [10], pineapple [11], pear [12] and other fruit. Results show that data mining method can be used to effectively evaluate fruit quality. At present, the literature on fruit quality data mining is rarely reported. In this paper, data mining methods applied on fruit quality in recent years were reviewed and analyzed. Finally, the main evaluation factors of common fruits were consolidated for evaluating fruit quality research and providing a reference.
2 Main Fruit Quality Indicators and Access Methods
Fruit quality includes the appearance and the intrinsic qualities. The main evaluation indices include fruit shape index, fruit weight, fruit color, fruit firmness, soluble solids, vitamin C, and others, as shown in Table 1. These indicators are representative of the different aspects of fruit characteristics. Close relationships exist among these indicators, such as total sugar, including soluble solid matter represented by sucrose and other reducible carbohydrate carbonyl components, which denote different attributes that are also related. At present, methods for obtaining quality indices mainly involve chemical and instrument measurement methods, but the difficult quantitative analysis indices of fruit flavor can be obtained only by depending on expert scoring.
3 Data Mining Overview
3.1 Simple Mathematical Method
For simple-featured and small amounts of data, existence of the unknown and potential information can be handled by simple mathematical processing method, such as mean, percentage, classification method. A simple data processing method can mine the data set of potential, valuable information.
3.2 Mathematical Statistics Method
Statistical analysis is mainly used to complete knowledge summary and relational knowledge mining. For some data, a function or relationship that cannot be expressed in a function exists. At this point, implicit data information can be excavated by using mathematical statistics method. Common methods include regression analysis, correlation analysis, and principal component analysis.
3.3 Artificial Intelligence Method
For large amounts and particularly complex data sets, a general data mining method cannot obtain the data set of implicit information. At this point, we can use artificial intelligence method of data mining, which is extremely complex. The main methods include fuzzy evaluation, association rules, and clustering analysis.
4 Fruit Quality Data Mining Method
4.1 Single Evaluation Method
-
(1)
Fuzzy evaluation method
Fuzzy evaluation method is influenced by numerous factors so as to conduct a comprehensive evaluation of a highly effective multi-factor decision method. One characteristic of this method is that, instead of an absolutely positive or negative evaluation result, fuzzy sets are used to represent the results [13]. The advantages of fuzzy evaluation are that we can quantify several qualitative indices, overcome the disadvantages of qualitative analysis, and objectively and accurately evaluate the pros and cons of varieties [14]. Its disadvantages include information duplication problem, which is caused by the unresolved correlation between the evaluation indexes. Thus, the confirmation of membership function and fuzzy correlation matrix, among others should be studied in the present research [15]. This method is mainly used in fruit quality identification and breeding of good varieties, and is presently applied in the quality evaluation of longan [16], persimmon [17], and other fruits.
-
(2)
Analytic hierarchy process method
The analytic hierarchy process is a multi-objective decision analysis method that combines qualitative and quantitative analysis methods [18]. The main concept of this method is to decompose the complex problem of fruit quality evaluation into several levels and factors. Comparison between two indices is essential for judgment. The judgment matrix is established by computing the largest eigenvalue in the matrix, and corresponding eigenvectors can indicate the different degrees of importance weights and provide a basis for selecting the optimal evaluation index. One advantage of the analytic hierarchy process is that not only the weight coefficient of each evaluation index is obtained but simultaneous filtering by accidental factors determines the perception of differences and the different dimension of factors in a unified evaluation system with high reliability and small error. On the other hand, one disadvantage is the limited number of fruit indicators, with the maximum generally being 9. This method has been applied in the cultivation of good varieties of jinxixiaozao [19], pear [20], and other fruits.
-
(3)
Gray correlation degree analysis
Correlation analysis is the main tool in grey correlation analysis method using the grey system theory for the comprehensive evaluation of the research object. The correlation coefficient and correlation between the sequence of numbers and the reference sequence are compared to determine the primary and secondary factors and their correlation degree [21]. This method offers the advantages of simplicity, ease of operation, and intuitiveness. On the other hand, its disadvantages include strong subjectivity and difficulty in determining certain optimal values. This method is mainly used in the situations where in the index correlation between is too high. Grey correlation degree analysis method has performed an important function in the comprehensive evaluation on muskmelon [22], peach [23], amomum [24], and other fruits.
-
(4)
Principal component analysis
The goal of principal component analysis is to secure the data under the principle of minimum information loss and convert the more original data and related indicators into new, fewer data at smaller orthogonal transforms to each other or comprehensive indices with slight correlation to simplify the evaluation process [25, 26]. Principal component analysis presents advantages of calculating the comparison standard, capability of being realized on the computer and using special software for analysis. Its disadvantage is that the new comprehensive index is difficult to explain, and the general method of combining clustering is used. Principal component analysis method is mainly used for more quality indicators, and the correlation among the indices of strong case and multiple correlated stochastic variables according to the main component of the contribution rate are simplified into several variables to avoid traits and related traits caused by error evaluation [29]. At present, the analysis is used for the comprehensive evaluation for selecting fruit quality evaluation factors and fruit quality [27, 28].
4.2 Hybrid Evaluation Method
-
(1)
Principal component cluster analysis method
For the multi-index evaluation of sorting fruit quality, the variance contribution of the first principal component F1 rate is not sufficiently high. In other words, the first principal component expression of original data information is not large enough, only the first principal component scores for evaluating the sample sort are one-sided. At this point, the two methods of combining principal component analysis and clustering analysis are combined to form “principal component clustering analysis method”. As an advantage, the method can extract multiple indicators simultaneously with most of the information, prevent the artificial selection evaluation factor of subjectivity, and provide a true reflection of varieties of comprehensive characteristics so as to offer an objective basis for breeding materials [29]. One disadvantage is clustering difficulty when the data is too large. Principal component cluster analysis method can effectively extract the main quality factors, simplify the fruit quality evaluation work, and provide theoretical basis for fruit speed measurement. The method has been used for tomato [30] and Lee apricot [31] quality rapid detection.
-
(2)
Rationalization-satisfaction degree and multiple value method
The so-called “reasonable–satisfaction” refers to fruit varieties that demonstrate the characteristics of satisfaction that people need. The reasonable degree is1 if a characteristic species is in full compliance with the “rule”. If not in line with “rule”, then the reasonable degree is 0 [32]. The advantages of the algorithm are simplicity, ease of calculation, and the ability to distinguish between good quality and poor quality. Its disadvantage is larger algorithm error. The algorithm objectively and accurately reflects the people’s needs and satisfaction degree of fruit quality. The method can not only be used as a method to identify fruit quality but also can be used as a reference value of fruit tree breeding species, especially for the breeding of commercial varieties. At present, the method has been used on pear [33] and other fruits for the cultivation of good varieties.
-
(3)
Principal component cluster combined with rationalization-satisfaction multidimensional value analysis theory of merger rules
The algorithm presents new ideas and methods of comprehensively evaluating fruit quality in combination with principal component analysis, cluster analysis, and multidimensional value theory “reasonable–satisfaction” composite evaluation method. This method can be used to extract the main factors of common fruit so as to simplify the evaluation process. Moreover, the method can be used for fruit breeding. Its disadvantage is that the method computation is trivial, complex, and requires large amounts of calculation. This combined method has been applied in mango [34] fruit quality assessment factor selection and simplify the work of mango fruit quality evaluation.
4.3 Parts of Comprehensive Fruit Evaluation Factors
Table 2 summarizes the evaluation factors and the use of the method of data mining for fruit quality after the main evaluation factors of fruit in certain literature. Numerous fruit quality indicators, the presence of fruit quality evaluation using a single index presents certain limitations and evaluation of all indicators inevitably requires too much work. Fruit quality data mining methods can effectively reduce fruit evaluation indices and simplify the evaluation process. Jiyun Nie, et al. [35]. used principal component analysis to select five indicators of the contribution rate of more than 95.75 % of the previous four components reflecting apple quality as the main evaluation factors. Haying Zhang, et al. [36], simplified 19 peach quality indicators for five items according to principal component analysis, clustering analysis, and the national standard of GB - 10653-1989 regarding “the fresh peach” requirements indicators. Table 2 shows that the use of data mining methods can effectively reduce the evaluation index, provide good evaluation of fruit quality, and solve the problems of limited single index evaluation and hefty workload of multi-index evaluation. The approach provides new ideas and methods for the evaluation of fruit quality.
5 Conclusion and Prospect
As people’s living standards continue to improve, the demand for fruit quality keeps growing. The search for rapid, simple methods of evaluating fruit quality has become a hot topic in the field of fruit quality analysis. The composition of fruit quality evaluation factors is too numerous, and different degrees of correlation and relative independence exist among and between different quality factors. The use of single quality index to evaluate the quality of fruit exist certain limitations. Moreover, a single indicator can only explain the quality of fruit in a certain aspect but cannot evaluate the overall quality of fruit. Through the use of data mining methods can combine multiple quality metrics for the comprehensive evaluation of fruit quality to obtain a comprehensive and objective assessment. In actual fruit quality assessment process, the use of relevant data mining method to determine the main fruit quality evaluation factors of common fruits can substantially reduce the workload of fruit quality appraisal.
Data mining method provides a new thinking and approach to the selection and breeding of fruit. First, data mining can identify the good traits of prominent fruit varieties, which can provide hybrid parent reference for the improvement of the fruit quality. Second, for single specific varieties, data mining methods can distinguish between fruit quality and provide a basis for directional breeding, further improvement of fine varieties, thereby yielding more excellent varieties. For the comprehensive evaluation of overall poor quality or general varieties and given the highly prominent individual quality, selection of a specific function and strong varieties not only will aid in improving the level of comprehensive utilization of fruit but is also conducive to determining the different uses and maximizing the performance of fruits on the basis of quality characteristics.
In recent years, with the development of cloud platforms, computers, massive databases, networking, and other technologies, data mining will perform a more important function in fruit production, distribution, sales, and consumption sectors. Although various studies have thorough, all kinds of fruit quality assessment method of research are more mature, but the existing methods are still hard to meet the needs of actual production and consumption. Therefore, finding a rapid and easy method for evaluating fruit quality remains a hot topic in the field of fruit quality analysis. The application of data mining method to more fruits and to extract major evaluation factors and simplify the evaluation process will become a new direction in fruit quality research.
References
Zhang, J., Hu, J., Zhang, Z.: Research on the fruit industry development in China. J. Shandong Agric. Univ. (Soc. Sci.) 4(3), 31–34 (2002)
Chen, M.: Non-destructive Detection of Fruit Internal Quality Based on Portable Near Infrared Spectrometer. Zhejiang University, Hangzhou (2010)
Li, J.: Research on the Quality Grading of Fruit Based Machine Vision and NIRS. Nanjing University of Aeronautics and Astronautics, Nanjing (2011)
Li, G., Wei, X., Li, L., et al.: Reserching actuality and development of fruit grader. J. Agric. Mech. Res. 9, 20–23 (2009)
Sánchez, M.-T., José De la Haba, M., Benítez-López, M., et al.: Non-destructive characterization and quality control of intact strawberries based on NIR spectral data. J. Food Eng. 110(1), 102–108 (2012)
Aiguo, O., Xiaoqiang, X., Yande, L.: Selection of NIR variables for online detecting soluble solids content of apple. Trans. Chin. Soc. Agric. Mach. 45(4), 220–225 (2014)
Sun, Q., Xie, R., Deng, L., et al.: Analysis of fruit quality and acid constituents of three bred cultivars selected from Jiangjin sweet orange. Food Sci. 36(6), 124–129 (2015)
Liu, H., Guo, W., Yue, R.: Non-destructive detection of Kiwi fruit firmness based on near-infrared diffused spectroscopy. Trans. Chin. Soc. Agric. Mach. 42(3), 145–149 (2011)
Ni, Z., Zhang, S., Gu, Q., et al.: Evaluation of fruit quality of Nanfeng Tangerine based on multivariate statistics. J. Fruit Sci. 28(5), 918–923 (2011)
Dong, Y., Zhang, Y., Liang, M., et al.: Selection of main indexes for evaluating apple fruit quality. Acta Agric. Boreali Sin. 26(S1), 74–79 (2011)
Lu, X., Sun, D.: Evaluation on fruit quality of 12 pineapple cultivars introduced from Thailand. Chin. J. Trop. Crops 32(12), 2205–2208 (2011)
Tian, R., Hu, H., Yang, X., et al.: Selection of factors for evaluating pear fruit quality. J. Yangtze Univ. (Nat. Sci. Edit.) 6(3), 8–11 (2009)
Li, L., Shen, L.: An improved multilevel fuzzy comprehensive evaluation algorithm for security performance. J. Chin. Univ. Posts Telecommun. 13(4), 48–53 (2006)
Shen, Q., Zhu, J., Peng, H., et al.: Fuzzy comprehensive evaluation of fruit characteristics in early mature seedling Litchi resource of Southwestern Guangxi. Southwest Chin. J. Agric. Sci. 24(4), 508–515 (2011)
Meng, L., Chen, Y., Li, W., et al.: Fuzzy comprehensive evaluation model for water resources carrying capacity in Tarim River Basin, Xinjiang, China. Chin. J. Geogr. Sci. 19(1), 89–95 (2009)
Zhu, J., Yu, P., Huang, F., et al.: Quantifying analysis of main fruit characters of Longyan germsplasm in Guangxi. Southwest Chin. J. Agric. Sci. 19(2), 283–286 (2006)
Deng, L., He, X., Xu, J., et al.: Diversity analysis of fruit traits and fuzzy comprehensive evaluation of persimmon germplasm resources in Guangxi. Guihaia 33(4), 508–515 (2013)
Saaty, T.L.: Decision making – the analytic hierarchy and network processes (AHP/ANP). J. Syst. Sci. Syst. Eng. 13(1), 1–35 (2004)
Liu, N., Zhang, Y., Zhao, Z., et al.: Quality evaluation of Jinsixiaozao fruits and its relationship with soil body configuration. J. Agric. Mech. Res. (12), 125–129 (2009)
Zhao, J., Qiao, J., Li, H., et al.: Comprehensive evaluation on the internal quality of Yali’ Pear fruit from the main production area in Hebei Province. Northern Hortic. (16), 33–35 (2010)
Li, X., Wang, W.: Estimation of apple storage quality properties with mechanical property based on grey system theory. Trans. Chin. Soc. Agric. Eng. (2), 80–86 (2005)
Lin, B., Gao, S., Lin, F., et al.: Grey relational grade analysis on yield characteristics of inbred melons. Fujian J. Agric. Sci. 23(2), 178–181 (2008)
Zhang, X., Sun, Y., Wang, Y., et al.: Determination of freezing point temperature of different yellow peach cultivars and correlation analysis of impact factors. J. Food Sci. Technol. 31(4), 37–41 (2013)
Zhang, W., Wei, X., Long, Y., et al.: Gray correlation degree analysis on main agronomic traits of Amomum tsao-ko fruits. J. Honghe Univ. 11(2), 52–53 (2013)
Sai, T.Y., Li, J.S., Wang, S.B.: Principal component analysis in construction of 3D human knee joint models using a statistical shape model method. Comput. Methods Biomech. Biomed. Eng. 18(7), 721–729 (2015)
Xu, C., Gao, D.: Comprehensive evaluation on fruit quality of peach cultivars in green house based on principal component analysis. Sci. Technol. Food Ind. 23, 84–94 (2014)
Qin, H., Xu, P., Ai, J., et al.: Diversity of fruit quality and phenotypic traits of Actinidia arguta Planch Germplasm resources and their principal component analysis. Chin. Agric. Sci. Bull. 31(1), 160–165 (2015)
Wang, X., Bi, J., Liu, X., et al.: Different origin Fuji apple quality evaluation factors choice. J. Nucl. Agric. Sci. 27(10), 1501–1510 (2013)
Ding, C.: Principal component analysis of water quality monitoring data in XiaSha region. Environ. Transp. Eng., 2321–2324 (2011)
Zhao, S., Yuan, D., Zhang, L., et al.: Study of the screening of sand pear (Pyrus pyrifolia) cultivars. J. Cent. South Univ. Forest. Technol. (Nat. Sci.) 27(1), 30–34 (2007)
Han, Z., Jiang, B.: A study on comprehensive evaluation of the processing tomato varieties multiple traits. Sci. Agric. Sin. 47(2), 357–365 (2014)
Niu, J., Liu, M., Peng, Q.: Analysis of main index components and cluster of Plumcot “Weihou” during different harvest periods. Xinjiang Agric. Sci. 52(1), 33–36 (2015)
Ayiguli, T., Yusufu, A., Patiman, A., et al.: Analysis and evaluation on of fruit quality of main Pear varieties in Xinjiang. Xinjiang Agric. Sci. 51(3), 417–422 (2014)
Xin, M., Zhang, E., He, Q., et al.: Selection of evaluation factors for mango fruit quality. J. South. Agric. 45(10), 1818–1824 (2014)
Nie, J., Li, Z., Li, H., et al.: Evaluation indices for apple physicochemical quality. Sci. Agric. Sin. 45(14), 2895–2903 (2012)
Zhang, H., Tao, H., Wang, Y., et al.: Selection of factors for evaluating peach (Prunus persica) fruit quality. Trans. CSAE 22(8), 235–239 (2006)
Acknowledgment
Funds for this research was provided by the Beijing Municipal Science and Technology Commission “The capital of food safety science and technology excellence special cultivation” project – “near infrared fruit quality rapid nondestructive testing equipment research and development” (Z141100002614021123).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 IFIP International Federation for Information Processing
About this paper
Cite this paper
Hou, J., Wang, D., Jia, W., Pan, L. (2016). Commentary on Application of Data Mining in Fruit Quality Evaluation. In: Li, D., Li, Z. (eds) Computer and Computing Technologies in Agriculture IX. CCTA 2015. IFIP Advances in Information and Communication Technology, vol 479. Springer, Cham. https://doi.org/10.1007/978-3-319-48354-2_51
Download citation
DOI: https://doi.org/10.1007/978-3-319-48354-2_51
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48353-5
Online ISBN: 978-3-319-48354-2
eBook Packages: Computer ScienceComputer Science (R0)