Keyword search (4,163 papers available)

"dataset" Keyword-tagged Publications:

Title Authors PubMed ID
1 ADPv2: A hierarchical histological tissue type-annotated dataset for potential biomarker discovery of colorectal disease Yang Z; Li K; Ramandi SG; Brassard P; Khellaf A; Trinh VQ; Zhang J; Chen L; Rowsell C; Varma S; Plataniotis K; Hosseini MS; 41658283
ENCS
2 CACTUS: An open dataset and framework for automated Cardiac Assessment and Classification of Ultrasound images using deep transfer learning Elmekki H; Alagha A; Sami H; Spilkin A; Zanuttini AM; Zakeri E; Bentahar J; Kadem L; Xie WF; Pibarot P; Mizouni R; Otrok H; Singh S; Mourad A; 40107020
ENCS
3 MuscleMap: An Open-Source, Community-Supported Consortium for Whole-Body Quantitative MRI of Muscle McKay MJ; Weber KA; Wesselink EO; Smith ZA; Abbott R; Anderson DB; Ashton-James CE; Atyeo J; Beach AJ; Burns J; Clarke S; Collins NJ; Coppieters MW; Cornwall J; Crawford RJ; De Martino E; Dunn AG; Eyles JP; Feng HJ; Fortin M; Franettovich Smith MM; Galloway G; Gandomkar Z; Glastras S; Henderson LA; Hides JA; Hiller CE; Hilmer SN; Hoggarth MA; Kim B; Lal N; LaPorta L; Magnussen JS; Maloney S; March L; Nackley AG; O' Leary SP; Peolsson A; Perraton Z; Pool-Goudzwaard AL; Schnitzler M; Seitz AL; Semciw AI; Sheard PW; Smith AC; Snodgrass SJ; Sullivan J; Tran V; Valentin S; Walton DM; Wishart LR; Elliott JM; 39590726
HKAP
4 CosSIF: Cosine similarity-based image filtering to overcome low inter-class variation in synthetic medical image datasets Islam M; Zunair H; Mohammed N; 38492455
ENCS
5 Firefly (Coleoptera, Lampyridae) species from the Atlantic Forest hotspot, Brazil Vaz S; Mendes M; Khattar G; Macedo M; Ronquillo C; Zarzo-Arias A; Hortal J; Silveira L; 38327309
CONCORDIA
6 Analysis of input set characteristics and variances on k-fold cross validation for a Recurrent Neural Network model on waste disposal rate estimation Vu HL; Ng KTW; Richter A; An C; 35287077
ENCS
7 Multimodal 3D ultrasound and CT in image-guided spinal surgery: public database and new registration algorithms Masoumi N; Belasso CJ; Ahmad MO; Benali H; Xiao Y; Rivaz H; 33683544
PERFORM
8 Augmented reality mastectomy surgical planning prototype using the HoloLens template for healthcare technology letters. Amini S, Kersten-Oertel M 32038868
PERFORM

 

Title:Analysis of input set characteristics and variances on k-fold cross validation for a Recurrent Neural Network model on waste disposal rate estimation
Authors:Vu HLNg KTWRichter AAn C
Link:https://pubmed.ncbi.nlm.nih.gov/35287077/
DOI:10.1016/j.jenvman.2022.114869
Publication:Journal of environmental management
Keywords:Dataset partitionDataset skewness and varianceK-fold cross validationLandfill disposal ratesMunicipal solid waste managementRecurrent neural network
PMID:35287077 Category: Date Added:2022-03-15
Dept Affiliation: ENCS
1 Environmental Systems Engineering, Faculty of Engineering and Applied Science, University of Regina, Regina, Saskatchewan, S4S 0A2, Canada.
2 Environmental Systems Engineering, Faculty of Engineering and Applied Science, University of Regina, Regina, Saskatchewan, S4S 0A2, Canada. Electronic address: kelvin.ng@uregina.ca.
3 Department of Building, Civil, and Environmental Engineering, Concordia University, 1455 Boulevard de Maisonneuve O, Montréal, Quebec, H3G 1M8, Canada.

Description:

The use of machine learning techniques in waste management studies is increasingly popular. Recent literature suggests k-fold cross validation may reduce input dataset partition uncertainties and minimize overfitting issues. The objectives are to quantify the benefits of k-fold cross validation for municipal waste disposal prediction and to identify the relationship of testing dataset variance on predictive neural network model performance. It is hypothesized that the dataset characteristics and variances may dictate the necessity of k-fold cross validation on neural network waste model construction. Seven RNN-LSTM predictive models were developed using historical landfill waste records and climatic and socio-economic data. The performance of all trials was acceptable in the training and validation stages, with MAPE all less than 10%. In this study, the 7-fold cross validation reduced the bias in selection of testing sets as it helps to reduce MAPE by up to 44.57%, MSE by up to 54.15%, and increased R value by up to 8.33%. Correlation analysis suggests that fewer outliers and less variance of the testing dataset correlated well with lower modeling error. The length of the continuous high waste season and length of total high waste period appear not important to the model performance. The result suggests that k-fold cross validation should be applied to testing datasets with higher variances. The use of MSE as an evaluation index is recommended.





BookR developed by Sriram Narayanan
for the Concordia University School of Health
Copyright © 2011-2026
Cookie settings
Concordia University