Releases: krshedd/GCL-R-Scripts
End of an Era
Notable Changes
This is the final release for version 1 of GCL-R-Scripts! After this release we are going to start developing version 2 based on a new format for .gcl objects to integrate with R 4.0.0 and the tidyverse. A static version of this release will be parked on a separate branch v1.12.0-static, for posterity.
Below are all notable changes to the body of GCL-R-Scripts on the master branch since A Cat in Your Lap v1.11.0 was released on April 1, 2020.
See the README for this repository if you have questions. Note the code for Functions.GCL has been updated to pull all functions in this repository, not just the ones with a .GCL ending.
New Scripts
No new scripts!
Function Updates & Bug Fixes
FindAlternateSpecies- Bugfix, added
drop=FALSEto prevent.gclarrays from collapsing to matrices if only 1 fish in the silly
- Bugfix, added
read_project_genotypes- Bugfix, filter for
project_nameif necessary, previously this function would pull all genotypes for fish involved in a project, including genotypes from older projects
- Bugfix, filter for
save_sillys- Bugfix, report missing sillys correctly
treeColor- Bugfix, colors would not plot, now they will if colors are a character vector and found in
colors()
- Bugfix, colors would not plot, now they will if colors are a character vector and found in
HoFisFstTable- Update, added argument for path to
fstatfileif already exists, rather than create a new one
- Update, added argument for path to
QC- Update, replaced
lattice::levelplotwithplotly::ggplotlyfor an interactive view of genotyping success rate
- Update, replaced
FailureRate- Update, added
plotlyfor interactive view of genotyping success rate
- Update, added
Tips
To avoid conflicts among packages that have the same function name, you can be more explicit in your function calls by adding package_name:: before the function name (i.e. dplyr::rename vs. rename, to avoid conflicts with reshape::rename). This makes your code more stable and less susceptible to package conflicts depending what the user has loaded in their workspace.
If you run into the following error when running LOKI2R.GCL
Error in .jcall(rp, "I", "fetch", stride, block) :
java.lang.OutOfMemoryError: GC overhead limit exceeded
- Save your work
- Re-start RStudio
- Before doing anything else, run
options(java.parameters = "-Xmx100g")to increase your heap space - Try to read data again
A Cat in Your Lap
Notable Changes
Below are all notable changes to the body of GCL-R-Scripts on the master branch since Midsummer's Eve v1.10.0 was released on June 20, 2019.
See the README for this repository if you have questions. Note the code for Functions.GCL has been updated to pull all functions in this repository, not just the ones with a .GCL ending.
New Scripts
QC_off_chip- Built off of normal QC script to deal with very rare case of off-chip QC re-runs
Function Updates & Bug Fixes
stratified_estimator_rubias- Added
cvargument to integrate harvest estimate error in to MSA estimates - Bugfix
ifstatements, only stratify overmixvec
- Added
CustomCombineBAYESOutput- Bugfix to catch auto-install for packages
QC- Added
project_typeargument to specify explicitly whether it is a "Biomark", "uSat", or "GT-seq" project, rather than trying to infer based on the number of alleles
- Added
load_objects- Added
patternargument to load only specific objects inpath
- Added
StratifiedEstimator- Added section to equalize mixtures run for different numbers of iterations by subsampling to the minimum number of iterations in
mixvec
- Added section to equalize mixtures run for different numbers of iterations by subsampling to the minimum number of iterations in
ReadUSatQC- Bugfix to remove leading "0"s on
fish_id, make character, not numeric
- Bugfix to remove leading "0"s on
GenotypeReport- Bugfix to read data from LOKI, filter for
project_name, then spread - Fix spelling error in file name
- Bugfix to read data from LOKI, filter for
Tips
To avoid conflicts among packages that have the same function name, you can be more explicit in your function calls by adding package_name:: before the function name (i.e. dplyr::rename vs. rename, to avoid conflicts with reshape::rename). This makes your code more stable and less susceptible to package conflicts depending what the user has loaded in their workspace.
If you run into the following error when running LOKI2R.GCL
Error in .jcall(rp, "I", "fetch", stride, block) :
java.lang.OutOfMemoryError: GC overhead limit exceeded
- Save your work
- Re-start RStudio
- Before doing anything else, run
options(java.parameters = "-Xmx100g")to increase your heap space - Try to read data again
Midsummer's Eve
Notable Changes
Below are all notable changes to the body of GCL-R-Scripts on the master branch since Java yo mama v1.9.0 was released on April 15, 2019.
See the README for this repository if you have questions. Note the code for Functions.GCL has been updated to pull all functions in this repository, not just the ones with a .GCL ending.
New Scripts
write_genepop.GCL- This tidy function effectively replaces
gcl2Genepop.GCL - Works for SNPs and uSATs
- This tidy function effectively replaces
Function Updates & Bug Fixes
ReadGenepopDis.GCL- Update include optional
lociargument, must be same aslociused to create Genepop input file, default isNULL - Update to include optional
sillyvecargument, must be same assillyvecused to create Genepop input file, default isNULL, this prevents population headers being labeled asSillySourceof last fish in population - Update to tidyverse, output is now a tibble
- Update include optional
ReadGTseqQC.GCL- Treat "0/0" genotype call as
NA
- Treat "0/0" genotype call as
StratifiedEstimator.GCL- Allows
mixvecargument to contain the same mixture more than once, this is useful if you have to have the same mixture represent multiple strata (i.e. represent unsampled harvest with another strata)
- Allows
GenotypeReport.GCL- Bugfix to allow pulling genotypes by
sillyvec(i.e.project_name = NULL)
- Bugfix to allow pulling genotypes by
run_rubias_mixture- Update to pass through new arguments in
rubiasv0.3.0
- Update to pass through new arguments in
Tips
To avoid conflicts among packages that have the same function name, you can be more explicit in your function calls by adding package_name:: before the function name (i.e. dplyr::rename vs. rename, to avoid conflicts with reshape::rename). This makes your code more stable and less susceptible to package conflicts depending what the user has loaded in their workspace.
If you run into the following error when running LOKI2R.GCL
Error in .jcall(rp, "I", "fetch", stride, block) :
java.lang.OutOfMemoryError: GC overhead limit exceeded
- Save your work
- Re-start RStudio
- Before doing anything else, run
options(java.parameters = "-Xmx100g")to increase your heap space - Try to read data again
Java yo mama
Notable Changes
Below are all notable changes to the body of GCL-R-Scripts on the master branch since New year new updates v1.8.0 was released on January 31, 2019. In order to get anything out of LOKI, you need to do a fresh pull from the master branch.
- Crucial update to move to our R functions that interface with
LOKIto Java8- There is a new java driver
ojdbc8.jarlocated here: "V:/Analysis/R files/OJDBC_Jar/ojdbc8.jar" - You need to copy that file to your machine at either one of these locations
- "C:/Program Files/R/RequiredLibraries/ojdbc8.jar", or
- "~/R/ojdbc8.jar"
- After that, you will need to restart RStudio (might as well just restart your computer) or you will get an error when trying to interface with LOKI
- There is a new java driver
- Added a stratified estimator for
rubias- Very similar to
StratifiedEstimator.GCLfor BAYES
- Very similar to
See the README for this repository if you have questions.
New Scripts
stratified_estimator_rubias- New script designed to calculate stratified estimates from
rubiasoutput using a vector of harvest/abundance (similar toStratifiedEstimator.GCLfor BAYES). - Works both with bias correction and without (i.e.
method = "PB") - Works for the same reporting groups or can "roll-up" groups
- See function script for details and a reproducible example
- New script designed to calculate stratified estimates from
LOKI_URL.GCL- This function builds the URL to connect to LOKI and is called by all GCL functions that connect to LOKI via
RJDBC::dbConnect - Having this one function allows us to update the URL for all dependent functions
- This function builds the URL to connect to LOKI and is called by all GCL functions that connect to LOKI via
Function Updates & Bug Fixes
- The following functions were all updated to set the driver to
ojdbc8.jarGentoypeReportCreateLocusControlLOKI2RLOKI2R_GAPSReadProjectLOKI2Rread_project_genotypes
LOKI2R_GAPS- Can now pull genotypes from a collection with only 1 fish (drop = FALSE)
gcl2Genepop- Can now pull genotypes from a collection with only 1 fish (drop = FALSE)
custom_combine_rubias_output- Cosmetic changes only to update reproducible example
GenotypeReport.GCL- Added a filter argument to make sure that this only pulls fish from
project_name
- Added a filter argument to make sure that this only pulls fish from
Tips
To avoid conflicts among packages that have the same function name, you can be more explicit in your function calls by adding package_name:: before the function name (i.e. dplyr::rename vs. rename, to avoid conflicts with reshape::rename). This makes your code more stable and less susceptible to package conflicts depending what the user has loaded in their workspace.
If you run into the following error when running LOKI2R.GCL
Error in .jcall(rp, "I", "fetch", stride, block) :
java.lang.OutOfMemoryError: GC overhead limit exceeded
- Save your work
- Re-start RStudio
- Before doing anything else, run
options(java.parameters = "-Xmx100g")to increase your heap space - Try to read data again
New year new updates
Notable Changes
This release is long overdue. Below are all notable changes to the body of GCL-R-Scripts on the master branch since Ready for rubias v1.7.0 was released on August 2, 2018. I highly recommend that everyone do a fresh pull from the master branch.
- Many of the updates move functions to the tidyverse.
- There is critically important bugfix to
create_rubias_baselineandcreate_rubias_mixture. - The QC script and its dependent scripts have all been updated. Please use these updated versions for all future QC analyses.
See the README for this repository if you have questions.
New Scripts
GenotypeReport.GCL- New script designed for Heather so she can pull genotypes from LOKI. Previously this was done via OceanAK, however, it times out on large GTseq projects. This function pulls just the genotypes and creates a .csv file so that Heather can visualize failure rates.
read_project_genotypes.GCL- New script designed for QC. This pulls all project genotypes based solely on
project_name(i.e. "P014"). No longer needprojectIDormarkersuite. Does not requireCreateLocusControlto be run first. This function was written based off ofGenotypeReport.GCLbecause for unknown reasonsReadProjectLOKI2Rstarted becoming unreasonably slow. This script hits a less complicated view of LOKI so it is much faster. Note: this function is slower thanGenotypeReport.GCLand can still take up to 20 minutes to run for GTseq projects, because the view in LOKI has to pair up fish genotyped in the project with the correctplateIDthat was run in the project (tissue table).
- New script designed for QC. This pulls all project genotypes based solely on
ReadUSatQC.GCL- Removed code from QC script and made as a stand alone function.
ReadGTseqQC.GCL- Reads in genotype data from GTseq .csv files and creates
.gclobjects.
- Reads in genotype data from GTseq .csv files and creates
gcl2tibble.GCL- This function converts .gcl objects into a very simple tall, tidy tibble. Only four columns: sillysource, silly, locus, genotype (there is an argument for how you want alleles separated, defaults to "/"). This function is handy if you need to share or export genotypes.
Function Updates & Bug Fixes
QC- Update to tidyverse.
- Now uses
read_project_genotypesinstead ofReadProjectLOKI2R. - New QC genotype import functions
ReadUSatQC.GCLandReadGTseqQC.GCL. - Conflicts now come in as tibbles.
- Alternate species object updates (not applicable in QC).
- Duplicate check code re-written and simplified for tidyverse.
- QC summary tables condensced and simplified for tidyverse.
- Switch from
xlsxpackage towritexlpackage for writing output. This package is much cleaner, however, it can not append the QC summary tables to the existing QC excel file, it creates a separate excel file with "R Script" in the name.
FailureRate- Update to tidyverse.
- Automatically output heatmaps of 1) failure rate by plate and locus, and 2) failure rate by silly and locus.
create_rubias_baseline- Can now handle populations with only 1 fish.
- Major bugfix, replace "0" genotypes from
scoresobject withNA. Otherwise, the "0" was recognized as a unique allele.
create_rubias_mixture- Major bugfix, replace "0" genotypes from
scoresobject withNA. Otherwise, the "0" was recognized as a unique allele.
- Major bugfix, replace "0" genotypes from
CombineConflictsWithPlateID- Update to tidyverse, catches mitochondrial conflicts appropriately
CheckDupWithinSilly- Duplicate summary output is now a tibble (previously it was a dataframe). The nice thing about it now being a tibble is that you still get an object with nrow = 0, rather than getting a NULL when no duplicates are found. Much easier for summarizing across multiple sillys (bind_rows).
- Bugfix "NO" = "No"
LOKI2R_GAPS- Bugfix, was looking for "ojdbc6.jar" in the wrong location.
BayesOutputCopyPaste- Bugfix to deal with
sillyvecas a named character vector.
- Bugfix to deal with
custom_combine_rubias_output- Can now roll up reporting groups if using
method = "PB". This is good incentive to always runrubiaswith your finest scale reporting groups and then roll up if necessary. - Bugfix, will not allow negative values with
method = "PB".
- Can now roll up reporting groups if using
RemoveIndMissLoci- Update to only calculate proportion missing based on how many loci were genotyped per collection. This can be an issue if you have more loci in LocusControl than were actually genotyped for all collections.
gcl2Nexus.GCL- Added the ability to create file with actual allele calls (letters) rather than numbers for allele 1 and 2. Update also catches "0" and "?" genotypes to indicate as missing data.
Tips
To avoid conflicts among packages that have the same function name, you can be more explicit in your function calls by adding package_name:: before the function name (i.e. dplyr::rename vs. rename, to avoid conflicts with reshape::rename). This makes your code more stable and less susceptible to package conflicts depending what the user has loaded in their workspace.
If you run into the following error when running LOKI2R.GCL
Error in .jcall(rp, "I", "fetch", stride, block) :
java.lang.OutOfMemoryError: GC overhead limit exceeded
- Save your work
- Re-start RStudio
- Before doing anything else, run
options(java.parameters = "-Xmx100g")to increase your heap space - Try to read data again
Ready for rubias
Notable Changes
Below are all notable changes to the body of GCL-R-Scripts on the master branch since Pest Control v1.6.1 was released on February 6, 2018. Many of these scripts are to streamline use of Eric Anderson's rubias with our standard GCL objects and workflow.
Note that the README for this repository has been updated to explain:
- Initial cloning of the repository
- Normal usage
- Releases
- Branches
- Issues
- Remotely sourcing individual scripts (i.e. from Remote Desktop)
- QC
New Scripts
create_rubias_baseline- This function creates the baseline (aka "reference") dataframe needed for
rubiasfrom.gclobjects. - It also saves the baseline as a .csv file with an assumed file structure.
- This function creates the baseline (aka "reference") dataframe needed for
create_rubias_mixture- This function creates the mixture dataframe needed for
rubiasfrom.gclobjects. - It also saves each mixture in
sillyvecas its own .csv file with an assumed file structure.
- This function creates the mixture dataframe needed for
run_rubias_mixture- This function is a wrapper for
rubias::infer_mixture. - It automatically sets the seed for the MCMC chain and comes with several GCL default settings (i.e. number of reps, burn_in, etc.).
- It also saves the different types of output into wide .csv files.
- This function is a wrapper for
custom_combine_rubias_output- This function computes summary statistics from
rubiasoutput into a tibble. It can take either the output tibble or can re-create the output tibble from the output files that are saved byrun_rubias_mixture. - It also creates a traceplot for all mixtures and reporting groups.
- This function computes summary statistics from
save_objects- This function is a wrapper for
dput. It takes a character vector of objects and saves them asobject_name.txtinpath.
- This function is a wrapper for
load_objects- This function is a wrapper for
dget. It will load objects previouslydputgiving them the same name.
- This function is a wrapper for
save_sillys- This function is a wrapper for
dput, but can take a sillyvec (i.e. doesn't need the.gcl).
- This function is a wrapper for
load_sillys- This function is a wrapper for
dget, but can take a sillyvec (i.e. doesn't need the.gcl).
- This function is a wrapper for
Function Updates & Bug Fixes
QC- Update to filter the Concordance Report by only "Conflicts", just in case lab staff accidently dump out the full Concordance Report on accident.
PoolCollections- Bugfix to give the correct
attributesby matching withFK_FISH_ID.
- Bugfix to give the correct
Tips
If you run into the following error when running LOKI2R.GCL
Error in .jcall(rp, "I", "fetch", stride, block) :
java.lang.OutOfMemoryError: GC overhead limit exceeded
- Save your work
- Re-start RStudio
- Before doing anything else, run
options(java.parameters = "-Xmx100g")to increase your heap space - Try to read data again
Pest Control
Notable Changes
Below are all notable changes to the body of GCL-R-Scripts on the master branch since January Blues v1.6.0 was released on January 10, 2018.
Note that the README for this repository has been updated to explain:
- Initial cloning of the repository
- Normal usage
- Releases
- Branches
- Issues
- Remotely sourcing individual scripts (i.e. from Remote Desktop)
- QC
New Scripts
No new scripts in this release.
Function Updates & Bug Fixes
gcl2Genepop- Bugfix so that this works as intended for uSats (replace no-calls with
000).
- Bugfix so that this works as intended for uSats (replace no-calls with
ReadGenepopHWE- Bugfix to
strsplitfor "Pop : " instead of " Pop : ", due to updates inGenepop 4.7. - Updated to properly convert summary results from Exact test from character to numeric, due to updates in
Genepop 4.7. - Added an optional logical argument
summaryAsNumeric = TRUEthat defaults to converting Exact test summary results that show up as character (i.e. "High. sign." or " < 0.155") to numeric (0.000 and 0.155, respectively). If you set this flag toFALSE, then the summary matrix will becharacterrather thannumeric.
- Bugfix to
Anderson_etal- Updated to provide a flat by population confusion matrix along with by group confusion matrix (which is flat with respect to group).
- Updated to provide a population to group confusion matrix (default is flat by group).
- Note this function does not work with combined markers at the moment.
Tips
If you run into the following error when running LOKI2R.GCL
Error in .jcall(rp, "I", "fetch", stride, block) :
java.lang.OutOfMemoryError: GC overhead limit exceeded
- Save your work
- Re-start RStudio
- Before doing anything else, run
options(java.parameters = "-Xmx100g")to increase your heap space - Try to read data again
January Blues
Notable Changes
Below are all notable changes to the body of GCL-R-Scripts on the master branch since Vernal Equinox v1.5.0 was released on April 4, 2017.
Note that the README for this repository has been updated to explain:
- Initial cloning of the repository
- Normal usage
- Releases
- Branches
- Issues
- Remotely sourcing individual scripts (i.e. from Remote Desktop)
- QC
New Scripts
ConfusionMatricesLikeProfile- This function creates confusion matrices from the output of
LeaveOneOutLikeProfile. - Output matches that of the confusion matrices built from
LeaveOneOutDist.
- This function creates confusion matrices from the output of
Anderson_etal- This function very quickly creates a group confusion matrix based on leave one out statistics from Anderson et al.
- Overall mean group relative likelihood matrix.
- Dataframe of individual likelihoods to each population, with population and group identifiers.
- This function very quickly creates a group confusion matrix based on leave one out statistics from Anderson et al.
require- This function is a wrapper for requiring and installing (if necessary) a vector of packages.
Function Updates & Bug Fixes
StratifiedEstimator- Updated to install required packages if necessary.
- Added a new logical argument
PosteriorOutputin order to control whether the whole posterior is included, default isFALSE.
LOKI2R_GAPS- Updated with code from
LOKI2RandReadLOKI_GAPSto quickly and correctly read in GAPS uSat data. - Updated to give a message when a silly has no fish.
- Updated with code from
CombineConflictsWithPlateID- Removed package
xlsx, output is now written as a .csv in order to remove dependencies (xlsxhas Java issues occasionally).
- Removed package
genetic_msa- Set minimum value for Dirichlet to avoid hard zero's and
NaNs.
- Set minimum value for Dirichlet to avoid hard zero's and
ReadGenepopHWE- Bugfix to
grepfor "Pop : " instead of " Pop : ", due to updates inGenepop 4.7. - Added an optional arguement
sillyvecthat will name populations automatically rather than grabbing names from the genepop .P.txt file. This solves the issue of having pooled silly names cut off due to character limits.
- Bugfix to
BayesOutputCopyPaste- Inserted a
stopstatement to make sure files aren't accidentally deleted if a typo insillyvecargument causes an issue with moving files.
- Inserted a
LeaveOneOutDist- Updated to remove loci that are fixed for one allele.
LOKI2R- Bugfix to properly catch sillys with no fish genotyped.
CreateLocusControl- Updated with new
locusnamesargument so that LocusControl can be built with eithermarkersuiteorlocusnames. This is really useful if you don't remember whatmarkersuiteyou are using or an appropriatemarkersuitehasn't been built in LOKI. However, all of the loci inlocusnamesdo have to be built in LOKI.locusnamesis a character vector of marker names.
- Updated with new
Tips
If you run into the following error when running LOKI2R.GCL
Error in .jcall(rp, "I", "fetch", stride, block) :
java.lang.OutOfMemoryError: GC overhead limit exceeded
- Save your work
- Re-start RStudio
- Before doing anything else, run
options(java.parameters = "-Xmx100g")to increase your heap space - Try to read data again
Vernal Equinox
Important User Change Required
.gitignore was added and updated in Fall Update v1.3.0 so that Git no longer tracks .RProj. This will prevent headaches in the long run, but if you have not done it already you will need to delete .gitignore from your local machine, otherwise you will not be able to do a fresh pull.
Notable Changes
Below are all notable changes to the body of GCL-R-Scripts on the master branch since QC For Everyone v1.4.0 was released on September 8, 2016.
Note that the README for this repository has been updated to explain:
- Initial cloning of the repository
- Normal usage
- Releases
- Branches
- Issues
- Remotely sourcing individual scripts (i.e. from Remote Desktop)
- QC
New Scripts
genetic_msa- The intent of this new function is to run the full Pella Masuda MSA model in R (i.e.
BAYES, but inR). - This function requires mixture and baseline files written by
write_data4msa. - This function will multi-core over the number of chains.
- The main output that is returned is a dataframe with summary statistics for each reporting group.
- There are two types of output file for each chain:
- The
ifiles indicate which population (number) each individual was assigned to at each iteration. - The
Pfiles indicate what proportion of the mixture is assigned to each population for each iteration.
- The
- The intent of this new function is to run the full Pella Masuda MSA model in R (i.e.
write_data4msa- This function creates and writes the baseline and mixture files (.txt) to be used with
genetic_msa. - This function requires
.gclobjects for all baseline and mixture sillys to be loaded in your workspace.
- This function creates and writes the baseline and mixture files (.txt) to be used with
Function Updates & Bug Fixes
QC- Updated to note the number of Alternate and Duplicate fish within silly without actually removing them. If removed, then might miss important QC conflicts.
- Updated Conflict Report section to accomodate new conflict.csv that outputs " " for mitochondrial conflicts as opposed to "0". Also added some extra code to help catch sequential IDs in within silly duplicate check that may be indicative of row duplication.
- Added
simplfy = FALSEto dupcheck summary output in order to force list object. - Bugfix to prevent issues when
nAltBySillyis not present.
gcl2Genepop.GCL- Updated to add a logical switch
usatto print out the scores rather than counts when set toTRUE. Default setting isFALSE.
- Updated to add a logical switch
Functions Removed
LynnCanal_agmamagma2012and13
Tips
If you run into the following error when running LOKI2R.GCL
Error in .jcall(rp, "I", "fetch", stride, block) :
java.lang.OutOfMemoryError: GC overhead limit exceeded
- Save your work
- Re-start RStudio
- Before doing anything else, run
options(java.parameters = "-Xmx100g")to increase your heap space - Try to read data again
QC For Everyone
Important User Change Required
.gitignore was added and updated in Fall Update v1.3.0 so that Git no longer tracks .RProj. This will prevent headaches in the long run, but if you have not done it already you will need to delete .gitignore from your local machine, otherwise you will not be able to do a fresh pull.
Notable Changes
Below are all notable changes to the body of GCL-R-Scripts on the master branch since Fall Update v1.3.0 was released on August 19, 2016.
Note that the README for this repository has been updated to explain:
- Initial cloning of the repository
- Normal usage
- Releases
- Branches
- Issues
- Remotely sourcing individual scripts (i.e. from Remote Desktop)
- QC
New Scripts
QC- This new script is designed handle both microsatellite and SNP QC.
- This script is intended to replace any existing QC R scripts and perform the following operations:
- Read in Project genotypes from
LOKI - Read in QC genotypes from Biomark.csv files (SNPs) or GeneMapper.csv files (uSATs)
- Compute Project failure rates
- Report conflict rates
- Check for missing loci, duplicate, or alternate species individuals in Project fish
- Check for missing loci individuals in QC fish
- Perform duplicate check between QC and Project fish for individuals with a specified conflict rate
- Create summary tables
- Write summary tables to the Project Summary
.xslxfile
- Read in Project genotypes from
- This is a new script, not a new function, it is a separate script. The idea is that when you need to perform a QC, you will do a fresh
pullfrommasterin order to have the latest version ofQC.Rin your remote repository (i.e.C:\Users\krshedd\Documents\R\GCL-R-Scripts). You will then copy/paste this file to the QC project folder (i.e.V:\Lab\Genotyping\SNP Projects\Sockeye\Project S161 Chignik Inseason 2016\QC), modify the Arguments, and run the script from there. You should not have to modify anything past GO!, but if the function bombs on you, you will at least know where you are. - Written and tested by Jim Jasper and Kyle Shedd
Function Updates & Bug Fixes
CombineConflictsWithPlateID.GCL- Updated documentation for
filesargument to specify a vector of .csv ConcordanceReports.
- Updated documentation for
SampSizeByLocus.GCL- Updated to add
drop = FALSEtocounts.
- Updated to add
Tips
If you run into the following error when running LOKI2R.GCL
Error in .jcall(rp, "I", "fetch", stride, block) :
java.lang.OutOfMemoryError: GC overhead limit exceeded
- Save your work
- Re-start RStudio
- Before doing anything else, run
options(java.parameters = "-Xmx100g")to increase your heap space - Try to read data again