About • How to Use • Choosing a Model • Wiki
|
RHESSysOutputExplorer provides a template and worked example for quickly exploring and identifying interesting variable relationships in RHESSys output data. This workflow is intended for users after the RHESSys model has been run and calibrated. In addition to the templates needed to use this workflow for a new dataset, there are two completed workflow implementations included for reference:
This repository contains the following directories:
|
-
Fork and clone this repository.
-
Place your RHESSys data in the
data/input/folder.
If this is your first time using this workflow, we suggest viewing the files within notebooks for steps 3-4 for more explanation and an example of a completed analysis.
-
In
notebook_templates, use "data_preparation.Rmd" to prepare data. We suggest aggregating by water year for the best results. -
In
notebook_templates, run "rf_variable_importance.Rmd" or "gb_variable_importance.Rmd". For most use cases, "rf_variable_importance.Rmd" is preferred. -
Refer to
shiny_sagehen/orshiny_chapdepending on your dataset. Open any of the server.R, global.R, or ui.R files in the shiny application’s directory and clicking “Run App” within R Studio. The app can also be run in a chunk at the bottom of the Rmd from step 4.
For most use cases, the random forest method is the recommended machine learning technique of the two provided in this repository. Random forest has proven effective in assessing variable importance in numerous fields including ecology. Additionally, it requires less hyper-parameter tuning than other techniques.
Gradient boosting is included as an alternative because it often attains greater predictive accuracy than random forests. This comes with the cost of a more difficult tuning process and a more computationally intensive training process.
| Random Forest | Gradient Boosting | |
|---|---|---|
| Faster run time | ✔️ | ❌ |
| Less tuning | ✔️ | ❌ |
| Accurate predictive power | ✔️ | ✔️ |
| Better maximum accuracy | ❌️ | ✔️ |
Do you need some help? For help specific to this workflow, check the documentation and guidance within notebooks. For help with RHESSys, check the articles from the wiki.