Skip to content

suggestion for numeric fields containing commas #4

@jayoung

Description

@jayoung

Dear Eric and colleagues,

Thanks again for your really handy program - the gene name thing used to drive me crazy and now it doesn't: so nice! Here's a suggestion: this would be useful for me if it's easy to do and makes sense for others too.

I'm using R to make a table of CNV regions with one row per region. For each region I have various columns that describe the genes each CNV overlaps. If a CNV overlaps >1 gene, I concatenate the multiple gene information in each column together using commas (rather than having duplicate rows one per gene). I then use write.table to export tab-delimited data and escape_excel.pl on the output, usually including quotes.

Escape_excel does very nicely on the gene names (from yeast in this case) but I'm not seeing the numeric columns as I'd like to in the ideal world.

Here's an example - some of my fields (start coords for two genes)
look like this in the R output: "37303,38690"
look like this in the text file after escape_excel.pl: 37303,38690 (it removed the quotes)
but after Excel import it shows 3730338690 (it removed the comma).

I should probably just use another character for the concatenation, but would it be reasonable to try to handle situations like this in escape_excel?

thanks for thinking about it,

Janet Young

Metadata

Metadata

Assignees

Type

No type
No fields configured for issues without a type.

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions