Applicability of Data Mining Techniques for Predicting Electrical Resistivity of Soils Based on Thermal Resistivity

doi:10.1061/(ASCE)GM.1943-5622.0000253

Home
/
Papers
/
Applicability of Data Mining Techniques for Predicting Electrical Resistivity of Soils Based on Thermal Resistivity

Journal Article•DOI•

Applicability of Data Mining Techniques for Predicting Electrical Resistivity of Soils Based on Thermal Resistivity

Pijush Samui¹•Institutions (1)

VIT University¹

01 Oct 2013-International Journal of Geomechanics (American Society of Civil Engineers)-Vol. 13, Iss: 5, pp 692-697

TL;DR: In this article, two data mining techniques, support vector machine (SVM) and least squares SVM (LSSVM), were used for prediction of soil electrical resistivity based on soil pr...

read less

Abstract: This article adopts two data mining techniques, support vector machine (SVM) and least-squares support vector machine (LSSVM), for prediction of soil electrical resistivity based on soil pr...

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

A cross-disciplinary comparison of multimodal data fusion approaches and applications: Accelerating learning through trans-disciplinary information sharing

[...]

Rohit Bokade¹, Alfred Navato¹, Ruilin Ouyang¹, Xiaoning Jin¹, Chun-An Chou¹, Sarah Ostadabbas¹, Amy V. Mueller¹ - Show less +3 more•Institutions (1)

Northeastern University¹

01 Mar 2021-Expert Systems With Applications

TL;DR: An apples-to-apples comparison of the literature of nine stakeholder-centric engineering domains reveals that many problem types are shared across different domains and are approached differently in those domains, e.g., transportation problems have similar characteristics to critical care, food science, robotics, and civil engineering.

...read moreread less

Abstract: Multimodal data fusion (MMDF) is the process of combining disparate data streams (of different dimensionality, resolution, type, etc.) to generate information in a form that is more understandable or usable. Despite the explosion of data availability in recent decades, as yet there is no well-developed theoretical basis for multimodal data fusion, i.e., no way to determine a priori which approach is best suited to combine an arbitrary set of available data to achieve a stated goal for a given application. This has resulted in exploration of a wide variety of approaches across numerous domains but as yet very little integration of conclusions at a meta (cross-disciplinary) level. In response, this manuscript poses the following questions: (1) How convergent (or divergent) are approaches within single disciplines? (2) How similar are the challenges posed across different disciplines, i.e., might there be opportunity for successes in MMDF achieved in one field to inform progress in other areas as well? and (3) Where are the outstanding gaps in MMDF research, and what does this imply as targets for high impact research in the coming years? To begin to answer these questions, an apples-to-apples comparison of the literature of nine stakeholder-centric engineering domains (civil engineering, transportation, energy, environmental engineering, food engineering, critical care (healthcare), neuroscience, manufacturing/automation, and robotics) was created by quantifying the numbers and dimensionalities of modalities and sensors in each published project and classifying the algorithms used and purposes for which they are used. Within disciplines, it is shown there is often a tendency for use of similar methodologies, both in choice of level of fusion and data algorithm class. Yet this analysis also reveals that many problem types (defined by data dimensionality, modality number and type, and fusion purpose) are shared across different domains and are approached differently in those domains, e.g., transportation problems have similar characteristics to critical care, food science, robotics, and civil engineering. Of the disciplines studied, most ( > 75 %) share problem characteristics with 3–5 others; to support leveraging these resources, lookup tables indexed by data dimensions, number of modalities, etc. are provided as a starting point for cross-disciplinary MMDF literature searches for new applications. Critical gaps identified are (1) a drop off of the number of published studies with increasing number of distinct modalities and (2) a dearth of publications tackling challenges with high dimensionality inputs, especially time-series 2D and 3D data. These gaps may point to topics where algorithm development will be fruitful to enable future solutions as video and other high-dimensionality sensors decrease in price. Finally, the lack of a shared vocabulary across disciplines makes analyses like the one conducted here challenging, as does the often implicit incorporation of expert knowledge into design; therefore progress toward a better leveraging of the current state of knowledge and toward a theoretical MMDF framework depends critically on improved cross-disciplinary communication and coordination on this topic.

...read moreread less

20 citations

Journal Article•DOI•

Enhanced semi‐supervised ensemble machine learning approach for earthwork construction simulation activity sequence automatically updating driven by weather data

[...]

Jun Zhang, Jia Yu, Peng Yu, Xiaoling Wang, Dawei Tong, Jianjun Wang - Show less +2 more

01 Jun 2023-Geological Journal

TL;DR: In this paper , the authors developed a labour-saving activity sequence classification model for earthwork construction simulation which realizes flexibly modifying simulation activities according to different weather conditions, and three heterogeneous semi-supervised classifiers with complementary characteristics were ensembled to reduce the workload of manual labelling and enhance the generalization ability of activity sequences classification based on weather data.

...read moreread less

Abstract: Construction activity sequence changes are among the most crucial considerations in establishing construction simulation models. However, conventional simulation models are designed with a fixed sequence of simulation activities, which is unable to update automatically. Existing machine learning methods require abundant time for manual labelling and are unsuitable for construction data with high‐dimensional and heterogeneous characteristics. The motivation for this work is to develop a labour‐saving activity sequence classification model for earthwork construction simulation which realizes flexibly modifying simulation activities according to different weather conditions. Three heterogeneous semi‐supervised classifiers with complementary characteristics were ensembled to reduce the workload of manual labelling and enhance the generalization ability of activity sequence classification based on weather data. Furthermore, Dempster–Shafer‐based evidence reasoning improved by a security filtering mechanism was adopted to enhance the accuracy of semi‐supervised classification. The proposed enhanced ensemble semi‐supervised activity sequence classification model was embedded in an earthwork construction simulation model, which was evaluated in a case study of rockfill dam construction. The proposed classifier outperformed four common semi‐supervised methods and two common supervised methods in terms of accuracy and generalization ability. Additionally, the proposed simulation method outperformed conventional simulation methods in terms of the construction schedule, construction intensity and consistency of the simulated activity sequence with the true values by 65.55%, 28.47% and 88.15%, respectively.

...read moreread less

1 citations

Journal Article•DOI•

Prediction of soil thermal conductivity using artificial intelligence approaches

[...]

Xiaojie Yuan, Xinhua Xue

01 Sep 2023-Geothermics

TL;DR: In this paper , three artificial intelligence models, namely group method of data handing (GMDH), multi expression programming (MEP), and random forest (RF), are proposed to predict soil thermal conductivity.

...read moreread less

References

PDF

Open Access

More filters

Journal Article•DOI•

Support-Vector Networks

[...]

Corinna Cortes¹, Vladimir Vapnik¹•Institutions (1)

Bell Labs¹

15 Sep 1995-Machine Learning

TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.

...read moreread less

Abstract: The support-vector network is a new learning machine for two-group classification problems. The machine conceptually implements the following idea: input vectors are non-linearly mapped to a very high-dimension feature space. In this feature space a linear decision surface is constructed. Special properties of the decision surface ensures high generalization ability of the learning machine. The idea behind the support-vector network was previously implemented for the restricted case where the training data can be separated without errors. We here extend this result to non-separable training data. High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated. We also compare the performance of the support-vector network to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.

...read moreread less

37,861 citations

"Applicability of Data Mining Techni..." refers background or methods in this paper

...More details are found in many publications (Boser et al. 1992; Cortes and Vapnik 1995; Gualtieri et al. 1999; Vapnik 1998)....
[...]
...The SVM algorithm developed by Vapnik (Cortes and Vapnik 1995) is based on statistical learning theory....
[...]

Statistical learning theory

[...]

Vladimir Vapnik

01 Jan 1998

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.

...read moreread less

Abstract: A comprehensive look at learning and generalization theory. The statistical theory of learning and generalization concerns the problem of choosing desired functions on the basis of empirical data. Highly applicable to a variety of computer science and robotics fields, this book offers lucid coverage of the theory as a whole. Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.

...read moreread less

26,531 citations

Proceedings Article•DOI•

A training algorithm for optimal margin classifiers

[...]

Bernhard E. Boser¹, Isabelle Guyon², Vladimir Vapnik²•Institutions (2)

University of California, Berkeley¹, Bell Labs²

01 Jul 1992

TL;DR: A training algorithm that maximizes the margin between the training patterns and the decision boundary is presented, applicable to a wide variety of the classification functions, including Perceptrons, polynomials, and Radial Basis Functions.

...read moreread less

Abstract: A training algorithm that maximizes the margin between the training patterns and the decision boundary is presented. The technique is applicable to a wide variety of the classification functions, including Perceptrons, polynomials, and Radial Basis Functions. The effective number of parameters is adjusted automatically to match the complexity of the problem. The solution is expressed as a linear combination of supporting patterns. These are the subset of training patterns that are closest to the decision boundary. Bounds on the generalization performance based on the leave-one-out method and the VC-dimension are given. Experimental results on optical character recognition problems demonstrate the good generalization obtained when compared with other learning algorithms.

...read moreread less

11,211 citations

Journal Article•DOI•

Least Squares Support Vector Machine Classifiers

[...]

Johan A. K. Suykens¹, Joos Vandewalle¹•Institutions (1)

Katholieke Universiteit Leuven¹

01 Jun 1999-Neural Processing Letters

TL;DR: A least squares version for support vector machine (SVM) classifiers that follows from solving a set of linear equations, instead of quadratic programming for classical SVM's.

...read moreread less

Abstract: In this letter we discuss a least squares version for support vector machine (SVM) classifiers. Due to equality type constraints in the formulation, the solution follows from solving a set of linear equations, instead of quadratic programming for classical SVM‘s. The approach is illustrated on a two-spiral benchmark classification problem.

...read moreread less

8,811 citations

Proceedings Article•

Support Vector Regression Machines

[...]

Harris Drucker¹, Christopher John Burges, Linda Kaufman², Alexander J. Smola², Vladimir Vapnik³ - Show less +1 more•Institutions (3)

Monmouth University¹, Bell Labs², AT&T Labs³

03 Dec 1996

TL;DR: This work compares support vector regression (SVR) with a committee regression technique (bagging) based on regression trees and ridge regression done in feature space and expects that SVR will have advantages in high dimensionality space because SVR optimization does not depend on the dimensionality of the input space.

...read moreread less

Abstract: A new regression technique based on Vapnik's concept of support vectors is introduced. We compare support vector regression (SVR) with a committee regression technique (bagging) based on regression trees and ridge regression done in feature space. On the basis of these experiments, it is expected that SVR will have advantages in high dimensionality space because SVR optimization does not depend on the dimensionality of the input space.

...read moreread less

4,009 citations