Recurrent mutation in cancers
- The core idea is to study recurrent SNP mutations in cancers:
- their distribution
- their effects
- suppress
- enhance
- change function?
- why they are recurrent,possibly:
- usage: ??
The goal: to answer these questions:
What’s the distribution of recurrent mutations in cancer / non-cancer genes?
Recurrent mutations are enriched in both oncogenes and tumor suppressors.
NON_CG |
261137 |
2999 |
0.011 |
1.000 |
POG |
5817 |
205 |
0.034 |
2.998 |
TSG |
6891 |
180 |
0.025 |
2.242 |
- What’s the distribution of SNPs in POG/TSG
For recurrent nonsense mutations, where did they cut, what are their effects?
Case study of p53
In short: recurrent nonsense mutations might due to DNA methylation sites.
require(ggplot2)
require(dplyr)
require(ggrepel)
source("../R/seq_potentialns.R")
source("../R/plot_pfamdomain.R")
source("../R/pfam_readpfam.R")
load("../test2/tp53.cdna.rda")
dist<-seq_potentialns(tp53.cdna)
load("../data/tp53test2.rda")
ungroup(minimaf.tp53.sum)->minimaf.tp53.sum
nshotspot<-minimaf.tp53.sum %>%
filter(Variant_Classification=="Nonsense_Mutation") %>%
group_by(Protein_Change) %>%
summarise(recur_allcoh=sum(recur_pos)) %>%
mutate(Prot_pos=as.numeric(gsub("[^0-9]|_.*", "", Protein_Change))) %>%
filter(recur_allcoh>1)
gg <-
ggplot(data = dist %>% filter(dist == 1)) +
## potential site
geom_point(aes(x = x, y = dist),
size = 3,
alpha = 0.2) +
## lollipop
geom_segment(
inherit.aes = FALSE,
data = nshotspot %>% filter(recur_allcoh > 5),
aes(
x = Prot_pos,
y = recur_allcoh,
xend = Prot_pos,
yend = 0
)
) +
geom_point(
inherit.aes = FALSE,
data = nshotspot %>% filter(recur_allcoh > 5),
aes(x = Prot_pos, y = recur_allcoh),
colour = "red",
size = 3
) +
ggrepel::geom_text_repel(
inherit.aes = FALSE,
data = nshotspot %>% filter(recur_allcoh > 19),
aes(x = Prot_pos, y = recur_allcoh, label = Protein_Change),
colour = "blue"
) +
plot_pfamdomain("P04637") +
labs(fill = "Domain", y = "Recurrence", x = "AA position")
plot(gg)
Nonsense mutations reduce protein expression level at mRNA level:
In Breast cancer (BRCA), gene :TP53(tsg), CDH1(tsg), MAP3K1(unclear)
For recurrent silent mutations, will they affect expression?
Will do this with 5
What are the effects of recurrent mutations to neighboring genes
Working on
- plan:
retrieve recurrent mutated gene and its neighbor in network
retrieve expression profile in a cancer cohort for this gene set
Comparison & statistical test
Mutation relations: co-mutated / mutation exclusive
- Other studies: gene level.
LS0tCnRpdGxlOiAiTXV0YXRpb24gaW4gY2FuY2VycyIKb3V0cHV0OgogIGh0bWxfbm90ZWJvb2s6IGRlZmF1bHQKICBodG1sX2RvY3VtZW50OiBkZWZhdWx0Ci0tLQoKYGBge3Igc2V0dXAsIGluY2x1ZGU9RkFMU0V9CmtuaXRyOjpvcHRzX2NodW5rJHNldChjYWNoZT1UUlVFLCBlY2hvID0gRkFMU0UpCgpkb3dubG9hZF9hbmRfb3JfbG9hZCA8LSBmdW5jdGlvbih4KSB7CiAgeSA8LSB4WyEoeCAlaW4lIGluc3RhbGxlZC5wYWNrYWdlcygpWywgIlBhY2thZ2UiXSldCiAgaWYgKGxlbmd0aCh5KSl7CiAgaW5zdGFsbC5wYWNrYWdlcyh5LCBkZXBlbmRlbmNpZXMgPSBUUlVFKQogIH0KICBzYXBwbHkoeCwgcmVxdWlyZSwgY2hhcmFjdGVyLm9ubHkgPSBUUlVFKQp9CgpteV9saWJyYXJpZXMgPC0gYygiZ2dwbG90MiIsICJkcGx5ciIsICJwdXJyciIsImNpdHIiKQpkb3dubG9hZF9hbmRfb3JfbG9hZChteV9saWJyYXJpZXMpCmBgYAoKCgoKIyMgUmVjdXJyZW50IG11dGF0aW9uIGluIGNhbmNlcnMgIAoKLSBUaGUgY29yZSBpZGVhIGlzIHRvIHN0dWR5IHJlY3VycmVudCBTTlAgbXV0YXRpb25zIGluIGNhbmNlcnM6CiAgICAtIHRoZWlyIGRpc3RyaWJ1dGlvbiAgCiAgICAtIHRoZWlyIGVmZmVjdHMgIAogICAgICAgIC0gc3VwcHJlc3MKICAgICAgICAtIGVuaGFuY2UKICAgICAgICAtIGNoYW5nZSBmdW5jdGlvbj8KICAgIC0gd2h5IHRoZXkgYXJlIHJlY3VycmVudCxwb3NzaWJseTogIAogICAgICAgIC0gRE5BIGxldmVsICAKICAgICAgICAtIFByb3RlaW4gbGV2ZWwgIAogICAgLSB1c2FnZTogPz8KCgpUaGUgZ29hbDogdG8gYW5zd2VyIHRoZXNlIHF1ZXN0aW9uczogIAoKMS4gKipXaGF0J3MgdGhlIGRpc3RyaWJ1dGlvbiBvZiByZWN1cnJlbnQgbXV0YXRpb25zIGluIGNhbmNlciAvIG5vbi1jYW5jZXIgZ2VuZXM/KioKCiAgICAgIFJlY3VycmVudCBtdXRhdGlvbnMgYXJlIGVucmljaGVkIGluIGJvdGggb25jb2dlbmVzIGFuZCB0dW1vciBzdXBwcmVzc29ycy4KCmBgYHtyIGVjaG89RkFMU0UsbWVzc2FnZT1GQUxTRX0KcmVxdWlyZSgia25pdHIiKQpsb2FkKCIuLi90ZXN0L211dGF0aW9uX2VucmljaF90YmwucmRhIikKI2FzLmRhdGEuZnJhbWUubWF0cml4KHQodG1wLnRibDIpKS0+dG1wLnRibDIKa2FibGUodG1wLnRibDIsY2FwdGlvbiA9ICJEaXN0cmlidXRpb24gb2YgZGlmZmVyZW50IGdlbmUgdHlwZXMiLGRpZ2l0cyA9IDMpCmBgYCAgCgoyLiAqKldoYXQncyB0aGUgZGlzdHJpYnV0aW9uIG9mIFNOUHMgaW4gUE9HL1RTRyAqKgoKYGBge3Igd2FybmluZz1GQUxTRSxtZXNzYWdlPUZBTFNFLGVjaG89RkFMU0UsfQp7CiAgbG9hZCgiLi4vdGVzdC9leHByLnF1YW50aWxlLnJkYSIpCiAgc291cmNlKCIuLi9SL3Bsb3RtdXRfZXhwci5SIikgI25hbWUgY2hhbmdlZAogIHBsb3RtdXRleHByKCJzbnAiKQoKfQpgYGAgIAoKMy4gKipGb3IgcmVjdXJyZW50IG5vbnNlbnNlIG11dGF0aW9ucywgd2hlcmUgZGlkIHRoZXkgY3V0LCB3aGF0IGFyZSB0aGVpciBlZmZlY3RzPyAqKiAgCgogICAgW0Nhc2Ugc3R1ZHkgb2YgcDUzXShodHRwczovL3RjZ3JpZmZpdGguZ2l0aHViLmlvL3RjZ2FNdXQvMjAxNzAzMDZfcDUzLmh0bWwpCiAgICAKICAgIEluIHNob3J0OiByZWN1cnJlbnQgbm9uc2Vuc2UgbXV0YXRpb25zIG1pZ2h0IGR1ZSB0byBETkEgbWV0aHlsYXRpb24gc2l0ZXMuCiAgICAKCmBgYHtyIGZpZy5oZWlnaHQ9MixmaWcud2lkdGg9NyxjYWNoZT1UUlVFLGZpZy5jYXA9Im5vbnNlbnNlIGhvdHNwb3RzIGluIFRDR0EifQpyZXF1aXJlKGdncGxvdDIpCnJlcXVpcmUoZHBseXIpCnJlcXVpcmUoZ2dyZXBlbCkKCnNvdXJjZSgiLi4vUi9zZXFfcG90ZW50aWFsbnMuUiIpCnNvdXJjZSgiLi4vUi9wbG90X3BmYW1kb21haW4uUiIpCnNvdXJjZSgiLi4vUi9wZmFtX3JlYWRwZmFtLlIiKQpsb2FkKCIuLi90ZXN0Mi90cDUzLmNkbmEucmRhIikKZGlzdDwtc2VxX3BvdGVudGlhbG5zKHRwNTMuY2RuYSkKbG9hZCgiLi4vZGF0YS90cDUzdGVzdDIucmRhIikKCnVuZ3JvdXAobWluaW1hZi50cDUzLnN1bSktPm1pbmltYWYudHA1My5zdW0KCm5zaG90c3BvdDwtbWluaW1hZi50cDUzLnN1bSAlPiUKICBmaWx0ZXIoVmFyaWFudF9DbGFzc2lmaWNhdGlvbj09Ik5vbnNlbnNlX011dGF0aW9uIikgJT4lIAogIGdyb3VwX2J5KFByb3RlaW5fQ2hhbmdlKSAlPiUgCiAgc3VtbWFyaXNlKHJlY3VyX2FsbGNvaD1zdW0ocmVjdXJfcG9zKSkgJT4lIAogIG11dGF0ZShQcm90X3Bvcz1hcy5udW1lcmljKGdzdWIoIlteMC05XXxfLioiLCAiIiwgUHJvdGVpbl9DaGFuZ2UpKSkgJT4lCiAgZmlsdGVyKHJlY3VyX2FsbGNvaD4xKQoKZ2cgPC0KICBnZ3Bsb3QoZGF0YSA9IGRpc3QgJT4lIGZpbHRlcihkaXN0ID09IDEpKSArCiAgIyMgcG90ZW50aWFsIHNpdGUKICBnZW9tX3BvaW50KGFlcyh4ID0geCwgeSA9IGRpc3QpLAogIHNpemUgPSAzLAogIGFscGhhID0gMC4yKSArCiAgIyMgbG9sbGlwb3AgCiAgZ2VvbV9zZWdtZW50KAogIGluaGVyaXQuYWVzID0gRkFMU0UsCiAgZGF0YSA9IG5zaG90c3BvdCAlPiUgZmlsdGVyKHJlY3VyX2FsbGNvaCA+IDUpLAogIGFlcygKICB4ID0gUHJvdF9wb3MsCiAgeSA9IHJlY3VyX2FsbGNvaCwKICB4ZW5kID0gUHJvdF9wb3MsCiAgeWVuZCA9IDAKICApCiAgKSArCgogIGdlb21fcG9pbnQoCiAgaW5oZXJpdC5hZXMgPSBGQUxTRSwKICBkYXRhID0gbnNob3RzcG90ICU+JSBmaWx0ZXIocmVjdXJfYWxsY29oID4gNSksCiAgYWVzKHggPSBQcm90X3BvcywgeSA9IHJlY3VyX2FsbGNvaCksCiAgY29sb3VyID0gInJlZCIsCiAgc2l6ZSA9IDMKICApICsKICAKICBnZ3JlcGVsOjpnZW9tX3RleHRfcmVwZWwoCiAgaW5oZXJpdC5hZXMgPSBGQUxTRSwKICBkYXRhID0gbnNob3RzcG90ICU+JSBmaWx0ZXIocmVjdXJfYWxsY29oID4gMTkpLAogIGFlcyh4ID0gUHJvdF9wb3MsIHkgPSByZWN1cl9hbGxjb2gsIGxhYmVsID0gUHJvdGVpbl9DaGFuZ2UpLAogIGNvbG91ciA9ICJibHVlIgogIAogICkgKwogIHBsb3RfcGZhbWRvbWFpbigiUDA0NjM3IikgKwogIGxhYnMoZmlsbCA9ICJEb21haW4iLCB5ID0gIlJlY3VycmVuY2UiLCB4ID0gIkFBIHBvc2l0aW9uIikKICAKICBwbG90KGdnKQoKYGBgICAKCgpOb25zZW5zZSBtdXRhdGlvbnMgcmVkdWNlIHByb3RlaW4gZXhwcmVzc2lvbiBsZXZlbCBhdCBtUk5BIGxldmVsOgoKSW4gQnJlYXN0IGNhbmNlciAoQlJDQSksIGdlbmUgOlRQNTModHNnKSwgIENESDEodHNnKSwgTUFQM0sxKHVuY2xlYXIpCgohW10oL2hvbWUvdGMvRG93bmxvYWRzL3A1M19tcm5hX2NuYV9leHByLnBuZykKICAgICFbXSgvaG9tZS90Yy9Eb3dubG9hZHMvY2RoMV9tcm5hX2NuYS5wbmcpCiAgICAhW10oL2hvbWUvdGMvRG93bmxvYWRzL21hcDNrMV9tcm5hX2NuYS5wbmcpCiAgICAKICAgIAo0LiAqKkZvciByZWN1cnJlbnQgc2lsZW50IG11dGF0aW9ucywgd2lsbCB0aGV5IGFmZmVjdCBleHByZXNzaW9uPyAqKiAKCiAgICAqV2lsbCBkbyB0aGlzIHdpdGggNSoKCgo1LiAqKldoYXQgYXJlIHRoZSBlZmZlY3RzIG9mIHJlY3VycmVudCBtdXRhdGlvbnMgdG8gbmVpZ2hib3JpbmcgZ2VuZXMgKioKCiAgICAqV29ya2luZyBvbiogIAogICAgLSBwbGFuOiAKICAgICAgICBhLiByZXRyaWV2ZSByZWN1cnJlbnQgbXV0YXRlZCBnZW5lIGFuZCBpdHMgbmVpZ2hib3IgaW4gbmV0d29yawogICAgICAgIAogICAgICAgIGIuIHJldHJpZXZlIGV4cHJlc3Npb24gcHJvZmlsZSBpbiBhIGNhbmNlciBjb2hvcnQgZm9yIHRoaXMgZ2VuZSBzZXQKICAgICAgICAgICAgCiAgICAgICAgICAgIC0gcGF0aWVudCBsZXZlbAogICAgICAgIAogICAgICAgIGMuIENvbXBhcmlzb24gJiBzdGF0aXN0aWNhbCB0ZXN0CiAgICAgICAgCiAgCiAgICAKICAgICFbXSgvaG9tZS90Yy9Eb3dubG9hZHMvbmV0d29yayAoMikucG5nKSAgCiAgICAKICAgIAo2LiBNdXRhdGlvbiByZWxhdGlvbnM6IGNvLW11dGF0ZWQgLyBtdXRhdGlvbiBleGNsdXNpdmUKCiAgICAtIE90aGVyIHN0dWRpZXM6IGdlbmUgbGV2ZWwuIAoKIyMgQXZhaWxhYmxlIGRhdGEKCi0gVENHQSBEYXRhOiBDTkEsIE11dGF0aW9uLCBFeHByZXNzaW9uKHByb3RlaW4vbVJOQSksIGNsaW5pY2FsIG91dHB1dCAgCi0gQ09TTUlDIGRhdGEgOiBNYWlubHkgbXV0YXRpb25zLiAgCgotIEFkZGl0aW9uYWwKICAgIC0gW3BhdGh3YXkgY29tbW9uc10oaHR0cDovL3BhdGh3YXljb21tb25zLm9yZy8pIDogdG8gaWRlbnRpZnkgbmVpZ2hib3JzICAKICAgIC0gW3BmYW1dKHBmYW0ueGZhbS5vcmcpIDogcHJvdGVpbiBkb21haW4gaW5mbyAgCiAgICAtIGNhbmNlciBnZW5lOiBbQ2FuY2VyIGdlbmUgY2Vuc3VzXShodHRwOi8vY2FuY2VyLnNhbmdlci5hYy51ay9jZW5zdXMpIGFuZCB1bmlwcm90IFBPRy9UU0cga2V5d29yZAogICAgCg==