Digital soil class mapping using legacy soil profile data: a comparison of a genetic algorithm and classification tree approach
M. A. Nelson A B and I. O. A. Odeh AA Faculty of Agriculture, Food and Natural Resources, The University of Sydney, NSW, Australia.
B Corresponding author. Email: michael.n@usyd.edu.au
Australian Journal of Soil Research 47(6) 632-649 https://doi.org/10.1071/SR08224
Submitted: 1 October 2008 Accepted: 21 May 2009 Published: 30 September 2009
Abstract
Digital soil class mapping (DSCM) provides a means of meeting the growing global demand for soil information. The search for optimal models for digital soil class mapping to take advantage of increasing availability of ancillary information, such as gamma radiometric data, is ongoing. One of the novel approaches to DSCM is based on genetic algorithms, which provide predictive function for DSCM. This paper aims: to develop a scheme for implementing genetic algorithms for rule-set production (GARP) in digital soil class mapping; to compare the performance of GARP and classification tree model (CT); and to evaluate the usefulness of gamma radiometrics as a predictor for DSCM of legacy soil data. We first collated the legacy soil class data from databases of soil profiles and the associated ancillary data from disparate sources. We then created a 200-m resolution DSCM based on the Australian Soil Classification, for the Namoi catchment in north-western New South Wales, using GARP based on the general scorpan-sspfe model and compared the GARP performance with the widely used CT model. Elevation, terrain attributes, magnetic survey, land use, NDVI, and, where available, radiometric data were used as the ancillary variables. In this implementation, inclusion of radiometric data in either of the prediction models significantly improved the classification accuracy and the resulting DSCM. Based on various classification and prediction performance measures, GARP was shown to be outperformed by the CT. We conclude that GARP needs further improvement for its full potential to be realised for digitally mapping soil classes.
Additional keywords: digital soil mapping, scorpan, genetic algorithm, GARP, classification tree.
Acknowledgements
The authors acknowledge the support of the Cotton Catchment Communities Cooperative Research Centre for their financial support through their Summer Scholarship program for the first author. We also thank Dr Budiman Minasny of the University of Sydney for his suggestion on some of the techniques used in this paper.
Anderson RP,
Gomez-Laverde M, Peterson AT
(2002a) Geographical distributions of spiny pocket mice in South America: insights from predictive models. Global Ecology and Biogeography 11, 131–141.
| Crossref | GoogleScholarGoogle Scholar |
Anderson RP,
Lew D, Peterson AT
(2003) Evaluating predictive models of species’ distributions: criteria for selecting optimal models. Ecological Modelling 162, 211–232.
| Crossref | GoogleScholarGoogle Scholar |
Anderson RP,
Peterson AT, Gomez-Laverde M
(2002b) Using niche-based GIS modeling to test geographic predictions of competitive exclusion and competitive release in South American pocket mice. Oikos 98, 3–16.
| Crossref | GoogleScholarGoogle Scholar |
Bailey N,
Clements T,
Lee JT, Thompson S
(2003) Modelling soil series data to facilitate targeted habitat restoration: a polytomous logistic regression approach. Journal of Environmental Management 67, 395–407.
| Crossref | GoogleScholarGoogle Scholar | PubMed |
Behrens T,
Forster H,
Scholten T,
Steinrucken U,
Spies ED, Goldschmitt M
(2005) Digital soil mapping using artificial neural networks. Journal of Plant Nutrition and Soil Science-Zeitschrift Fur Pflanzenernahrung Und Bodenkunde 168, 21–33.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
Bishop TFA,
McBratney AB, Whelan BM
(2001) Measuring the quality of digital soil maps using information criteria. Geoderma 103, 95–111.
| Crossref | GoogleScholarGoogle Scholar |
Bishop TFA,
Minasny B, McBratney AB
(2006) Uncertainty analysis for soil-terrain models. International Journal of Geographical Information Science 20, 117–134.
| Crossref | GoogleScholarGoogle Scholar |
Bui EN,
Loughhead A, Corner R
(1999) Extracting soil-landscape rules from previous soil surveys. Australian Journal of Soil Research 37, 495–508.
| Crossref | GoogleScholarGoogle Scholar |
Bui EN, Moran CJ
(2003) A strategy to fill gaps in soil survey over large spatial extents: an example from the Murray-Darling basin of Australia. Geoderma 111, 21–44.
| Crossref | GoogleScholarGoogle Scholar |
Carre F,
McBratney AB,
Mayr T, Montanarella L
(2007) Digital soil assessments: Beyond DSM. Geoderma 142, 69–79.
| Crossref | GoogleScholarGoogle Scholar |
Chen GJ, Peterson AT
(2002) Prioritization of areas in China for the conservation of endangered birds using modelled geographic distributions. Bird Conservation International 12, 197–209.
| Crossref | GoogleScholarGoogle Scholar |
Cohen J
(1960) A coefficient of agreement for nominal scales. Educational and Psychological Measurement 20, 37–46.
| Crossref | GoogleScholarGoogle Scholar |
Cook SE,
Corner RJ,
Groves PR, Grealish GJ
(1996) Use of airborne gamma radiometric data for soil mapping. Australian Journal of Soil Research 34, 183–194.
| Crossref | GoogleScholarGoogle Scholar |
Gallant JC, Dowling TI
(2003) A multiresolution index of valley bottom flatness for mapping depositional areas. Water Resources Research 39(12), 1347.
| Crossref | GoogleScholarGoogle Scholar |
Giasson E,
Clarke RT,
Inda AV,
Merten GH, Tornquist CG
(2006) Digital soil mapping using multiple logistic regression on terrain parameters in Southern Brazil. Scientia Agricola 63, 262–268.
| Crossref | GoogleScholarGoogle Scholar |
Hammond EH
(1964) Analysis of properties in land form geography – an application to broad-scale land form mapping. Annals of the Association of American Geographers 54, 11–19.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
Hengl T,
Toomanian N,
Reuter HI, Malakouti MJ
(2007) Methods to interpolate soil categorical variables from profile observations: lessons from Iran. Geoderma 140, 417–427.
| Crossref | GoogleScholarGoogle Scholar |
Jaques AL,
Wellman P,
Whitaker A, Wyborn D
(1997) High-resolution geophysics in modern geological mapping. AGSO Journal of Australian Geology & Geophysics 17, 159–173.
Lagacherie P, Holmes S
(1997) Addressing geographical data errors in a classification tree for soil unit prediction. International Journal of Geographical Information Science 11, 183–198.
| Crossref | GoogleScholarGoogle Scholar |
Landis JR, Koch GG
(1977) The measurement of observer agreement for categorical data. Biometrics 33, 159–174.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
McBratney AB,
Odeh IOA,
Bishop TFA,
Dunbar MS, Shatar TM
(2000) An overview of pedometric techniques for use in soil survey. Geoderma 97, 293–327.
| Crossref | GoogleScholarGoogle Scholar |
McBratney AB,
Santos MLM, Minasny B
(2003) On digital soil mapping. Geoderma 117, 3–52.
| Crossref | GoogleScholarGoogle Scholar |
Minasny B, McBratney AB
(2007) Incorporating taxonomic distance into spatial prediction and digital mapping of soil classes. Geoderma 142, 285–293.
| Crossref | GoogleScholarGoogle Scholar |
Moran CJ, Bui EN
(2002) Spatial data mining for enhanced soil map modelling. International Journal of Geographical Information Science 16, 533–549.
| Crossref | GoogleScholarGoogle Scholar |
Oberhauser K, Peterson AT
(2003) Modeling current and future potential wintering distributions of eastern North American monarch butterflies. Proceedings of the National Academy of Sciences of the United States of America 100, 14 063–14 068.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
PubMed |
Odeh IOA,
Todd AJ, Triantafilis J
(2003) Spatial prediction of soil particle-size fractions as compositional data. Soil Science 168, 501–515.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
Peterson AT, Cohoon KP
(1999) Sensitivity of distributional prediction algorithms to geographic data completeness. Ecological Modelling 117, 159–164.
| Crossref | GoogleScholarGoogle Scholar |
Peterson AT,
Papeş M, Kluza DA
(2003) Predicting the potential invasive distributions of four alien plant species in North America. Weed Science 51, 863–868.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
Peterson AT,
Papeş M,
Reynolds MG,
Perry ND,
Hanson B,
Regnery RL,
Hutson CL,
Muizniek B,
Damon IK, Carroll DS
(2006) Native-range ecology and invasive potential of Cricetomys in North America. Journal of Mammalogy 87, 427–432.
| Crossref | GoogleScholarGoogle Scholar |
Peterson AT,
Scachetti-Pereira R, Hargrove WW
(2004) Potential geographic distribution of Anoplophora glabripennis (Coleoptera: Cerambycidae) in North America. American Midland Naturalist 151, 170–178.
| Crossref | GoogleScholarGoogle Scholar |
Pickup G, Marks A
(2000) Identifying large-scale erosion and deposition processes from airborne gamma radiometrics and digital elevation models in a weathered landscape. Earth Surface Processes and Landforms 25, 535–557.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
Raimundo RLG,
Fonseca RL,
Scachetti-Pereira R, Townsend Peterson A
(2007) Native and exotic distributions of siamweed modeled using the genetic algorithm for rule-set production. Weed Science 55, 41–48.
| Crossref | GoogleScholarGoogle Scholar |
CAS |
Roura-Pascual N,
Suarez AV,
Gomez C,
Pons P,
Touyama Y,
Wild AL, Peterson AT
(2004) Geographical potential of Argentine ants (Linepithema humile Mayr) in the face of global climate change. Proceedings of the Royal Society of London. Series B. Biological Sciences 271, 2527–2534.
| Crossref | GoogleScholarGoogle Scholar |
Stockwell D,
Beach JH,
Stewart A,
Vorontsov G,
Vieglais D, Pereira RS
(2006) The use of the GARP genetic algorithm and internet grid computing in the Lifemapper world atlas of species biodiversity. Ecological Modelling 195, 139–145.
| Crossref | GoogleScholarGoogle Scholar |
Stockwell D, Peters D
(1999) The GARP modelling system: problems and solutions to automated spatial prediction. International Journal of Geographical Information Science 13, 143–158.
| Crossref | GoogleScholarGoogle Scholar |
Stockwell DRB, Noble IR
(1992) Induction of sets of rules from animal distribution data: a robust and informative method of data analysis. Mathematics and Computers in Simulation 33, 385–390.
| Crossref | GoogleScholarGoogle Scholar |
Stockwell DRB, Peterson AT
(2002) Effects of sample size on accuracy of species distribution models. Ecological Modelling 148, 1–13.
| Crossref | GoogleScholarGoogle Scholar |
Wang R, Wang YZ
(2006) Invasion dynamics and potential spread of the invasive alien plant species Ageratina adenophora (Asteraceae) in China. Diversity & Distributions 12, 397–408.
| Crossref | GoogleScholarGoogle Scholar |
Wilford JR,
Bierwith PN, Craig MA
(1997) Application of airborne gamma-ray spectrometry in soil/regolith mapping and applied geomorphology. AGSO Journal of Australian Geology & Geophysics 17, 201–216.
Young RW,
Young ARM,
Price DM, Wray RAL
(2002) Geomorphology of the Namoi alluvial plain, northwestern New South Wales. Australian Journal of Earth Sciences 49, 509–523.
| Crossref | GoogleScholarGoogle Scholar |
Zhang L,
Beavis SG, Gray SD
(1999) Development of a spatial database for large-scale catchment management: geology, soils and landuse in the Namo Basin, Australia. Environment International 25, 853–860.
| Crossref | GoogleScholarGoogle Scholar |