OVERSAMPLING METHOD FOR ENVIRONMENTAL MONITORING DATA AUGMENTATION IN CADMIUM-POLLUTED PADDY FIELD

Peitong Hao∗,† Yue Li∗∗,∗∗∗,† Weiman Xu,∗∗ Cong Li,∗∗ Mingzhu Huo,∗ Xiaoyu Zhang,∗ and Yi An∗

References

  1. [1] L.Y. Luo, L.L. Xie, D.C. Jin, B.B. Mi, D.H. Wang, X.F. Li, X.Z. Dai, X.X. Zou, Z. Zhang, and Y.Q. Ma, Bacterial community response to cadmium contamination of agricultural paddy soil, Applied Soil Ecology, 139, 2019, 100–106.
  2. [2] J.H. Liu, H. Hou, L. Zhao, Z.J. Sun, Y.F. Lu, and H. Li, Mitigation of Cd accumulation in rice from Cd-contaminated paddy soil by foliar dressing of S and P, Science of the Total Environment, 690, 2019, 321–328.
  3. [3] F. Ghoreishy, M. Salehi, and J. Fallahzade, Cadmium and lead in rice grains and wheat breads in Isfahan (Iran) and human health risk assessment, Human and Ecological Risk Assessment, 25(4), 2019, 924–934.
  4. [4] K. Liu, J.L. Lv, W.X. He, H. Zhang, Y.F. Cao, and Y.C. Dai, Major factors influencing cadmium uptake from the soil into wheat plants, Ecotoxicology and Environmental Safety, 113, 2015, 207–213.
  5. [5] H.Y. Hu, Z.J Li, Y. Feng, Y.W. Liu, J.M. Xue, M. Davis, and Y.C. Liang, Prediction model for mercury transfer from soil to corn grain and its cross-species extrapolation, Journal of Integrative Agriculture, 15(10), 2016, 2393–2402.
  6. [6] M.A. Halim, R.K. Majumder, and M.N. Zaman, Paddy soil heavy metal contamination and uptake in rice plants from the adjacent area of Barapukuria coalmine, northwest Bangladesh, Arabian Journal of Geosciences, 8, 2015, 3391–3401.
  7. [7] B. Wu, S.H. Guo, L.Y. Zhang, and F.M. Li, Risk forewarning model for rice grain Cd pollution based on Bayes theory, Science of the Total Environment, 618, 2018, 1343–1349.
  8. [8] X.M. Wang, X.J. Li, R.Y. Ma, Y. Li, W. Wang, H.Y. Huang, C.Z. Xu, and Y. An, Quadratic discriminant analysis model for assessing the risk of cadmium pollution for paddy fields in a county in China, Environmental Pollution, 236, 2018, 366–372.
  9. [9] J. Yang, C. Zhao, J.X. Yang, J.Y. Wang, Z.T. Li, X.M. Wan, G.H. Guo, M. Lei, and T.B. Chen, Discriminative algorithm approach to forecast Cd threshold exceedance probability for rice grain based on soil characteristics, Environmental Pollution, 261, 2020, 114211.
  10. [10] Y.M. Wang, S.H. Wu, D.H. Yan, F.F. Li, C.C. Wang, C. Min, and W.Y. Sun, Determining and mapping the spatial 6 mismatch between soil and rice cadmium (Cd) pollution based on a decision tree model, Environmental Pollution, 265, 2020, 115029.
  11. [11] S.W. Deng, J. Yu, Y.T. Wang, S.Q. Xie, Z.X. Ran, and W. Wei, Distribution, transfer, and time-dependent variation of Cd in soil-rice system: A case study in the Chengdu plain, Southwest China, Soil and Tillage Research, 195, 2019, 104367.
  12. [12] R. Malhotra and S. Kamal, An empirical study to investigate oversampling methods for improving software defect prediction using imbalanced data, Neurocomputing, 343, 2019, 120–140.
  13. [13] W.H. Han, Z.Z. Huang, S.D. Li, and Y. Jia, Distributionsensitive unbalanced data oversampling method for medical diagnosis, Journal of Medical Systems, 43(2), 2019, 39.
  14. [14] N.V. Chawla, K.W. Bowyer, and L.O. Hall, SMOTE: Synthetic minority over-sampling technique, Journal of Artificial Intelligence Research, 16, 2002, 321–357.
  15. [15] T.L Zhang, Y. Li, X.Z. Wang, Gaussian prior based adaptive synthetic sampling with non-linear sample space for imbalanced learning, Knowledge-Based Systems, 191, 2020, 105231.
  16. [16] R. Zhu, Y.W. Guo, and J.H. Xue, Adjusting the imbalance ratio by the dimensionality of imbalanced data, Pattern Recognition Letters, 133, 2020, 217–223.
  17. [17] V. Matamoros, C. Nuno, R. Victoria, O. Hernandez, and C. Ibanez, The role of rice fields and constructed wetlands as a source and a sink of pesticides and contaminants of emerging concern: Full-scale evaluation, Ecological Engineering, 156, 2020, 105971.
  18. [18] M.U. Khan, R.N. Malik, S. Muhammad, F. Ullah, and A. Qadir, Health risk assessment of consumption of heavy metals in market food crops from Sialkot and Gujranwala districts, Pakistan, Human and Ecological Risk Assessment, 2, 2015, 21.
  19. [19] S. Saha, M. Saha, K. Mukherjee, A. Arabameri, and P.T.T.G.C. Paul, Predicting the deforestation probability using the binary logistic regression, random forest, ensemble rotational forest, REPTree: A case study at the Gumani River Basin, India, Science of the Total Environment, 730, 2020, 139197.
  20. [20] C.T. Zhang and Y. Yang, Modeling the spatial variations in anthropogenic factors of soil heavy metal accumulation by geographically weighted logistic regression, Science of the Total Environment, 717, 2020, 137096.
  21. [21] Y. Lu, Y.M. Cheung, and Y.Y. Tang, Bayes imbalance impact index: A measure of class imbalanced dataset for classification problem, IEEE Transactions on Neural Networks and Learning Systems, 99, 2019, 1–15.
  22. [22] S. Ebenuwa, M.S. Sharif, M. Alazab, and A. Al-Nemrat, Variance ranking attributes selection techniques for binary classification problem in imbalance data, IEEE Access, 2019, 24649–24666.
  23. [23] K. Maxwell, M. Rajabi, and J. Esterle, Automated classification of metamorphosed coal from geophysical log data using supervised machine learning techniques, International Journal of Coal Geology, 214, 2019, 103284.
  24. [24] C. Shang and F.Q. You, Data analytics and machine learning for smart process manufacturing: recent advances and perspectives in the big data era, Engineering, 5(6), 2019, 1010–1016.
  25. [25] E. Hüllermeier, Fuzzy methods in machine learning and data mining: Status and prospects, Fuzzy Sets and Systems, 156(3), 2005, 387–406.
  26. [26] W.L. Feng, Z.H. Guo, X. Xiao, C. Peng, L. Shi, H.Z. Ran, and W.X. Xu, Atmospheric deposition as a source of cadmium and lead to soil-rice system and associated risk assessment, Ecotoxicology and Environmental Safety, 180, 2019, 160–167.
  27. [27] Y.F. Wang, Y. Su, and S.G. Lu, Predicting accumulation of Cd in rice (Oryza sativa L.) and soil threshold concentration of Cd for rice safe production, Science of the Total Environment, 738, 2020, 139805.
  28. [28] Y.T. Bi and D.R. Jeske, The efficiency of logistic regression compared to normal discriminant analysis under class-conditional classification noise, Journal of Multivariate Analysis, 101(7), 2010, 1622–1637.
  29. [29] M.L. Zhang, H. Qu, and J.P. Li, Feedforward computational model for pattern recognition with spiking neurons, International Journal of Robotics and Automation, 33(1), 2018, 81–88.

Important Links:

Go Back