Skip to content

Information Content/goSim could be calculated for more GO terms? #33

@dorrenasun

Description

@dorrenasun

Dear authors,

I am trying to calculate semantic similarity between some GO terms based on the information content methods.

# GO.db retrieved by 2020-09-10, Bioconductor version 3.12
atGO<-godata('org.At.tair.db', keytype="TAIR",ont="BP")
goSim("GO:0120254", "GO:0120255", semData=atGO, measure="Jiang") # NA

The value returns NA as these GO terms are not directly annotated for Arabidopsis. However, their descendant terms were actually annotated to some genes in the database (GO.db retrieved by 2020-09-10, Bioconductor version 3.12):

length(GOBPOFFSPRING[["GO:0120254"]] %in% keys(org.At.tair.db,"GO"))  # 196
length(GOBPOFFSPRING[["GO:0120255"]] %in% keys(org.At.tair.db,"GO"))  # 97

and thus their IC values were actually feasible for calculation. Do you think it is possible to include such GO terms in the current IC & goSim() calculation?

Thank you very much.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions