Skip to content

Subsampling guidance #4

@Althalis

Description

@Althalis

Dear Sylvain,
first of all, thank you for your time again.
I managed to run_AMOPLS, but had to include undersampling as my dataset is not balanced.
And it worked only with subsampling = 1 (which I don't know if it is 'correct' to do).

I was wondering where I can find guidance for the nb_perm, subsampling and parallel parameters.

Below are some details for when it didn't work (with the values used in your examples). I understand that my dataset is particularly big, so maybe that's why.

[Also another note that could be useful to note for other users, is that I needed to convert my data.frames into data.tables for things to run. Just FYI]

Thank you,
best
Julie

dim(dm_jc)
[1] 44 6402
dim(smd_jc)
[1] 44 3
result_unbalanced <- run_AMOPLS(datamatrix = dm_jc,
samplemetadata = smd_jc,
factor_names = c("Group","Isolation_method"),
nb_perm = 100,
subsampling = 10,
parallel = 3)
Data are unbalanced in:
Group
Isolation_method
Group x Isolation_method
Data are unbalanced, running stratified subsampling.
Run sub-sampling: 1
Error in svd(data) : infinite or missing values in 'x'

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions