WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00001630 Gene Name  gly-5
Sequence Name  ? Y39E4B.12 Brief Description  gly-5 encodes a UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase; GLY-5 isoforms exhibit UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase activity in vitro.
Organism  Caenorhabditis elegans Automated Description  Enables polypeptide N-acetylgalactosaminyltransferase activity. Involved in protein O-linked glycosylation via threonine. Predicted to be located in Golgi apparatus. Human ortholog(s) of this gene implicated in colorectal cancer. Is an ortholog of human GALNT12 (polypeptide N-acetylgalactosaminyltransferase 12); GALNT4 (polypeptide N-acetylgalactosaminyltransferase 4); and POC1B-GALNT4 (POC1B-GALNT4 readthrough).
Biotype  SO:0001217 Genetic Position  III :21.0711 ±0.032355
Length (nt)  ? 17514
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00001630

Genomics

6 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:Y39E4B.12c.1 Y39E4B.12c.1 2406   III: 13187390-13204897
Transcript:Y39E4B.12a.1 Y39E4B.12a.1 2408   III: 13187392-13204895
Transcript:Y39E4B.12b.1 Y39E4B.12b.1 2401   III: 13187392-13204897
Transcript:Y39E4B.12a.2 Y39E4B.12a.2 2362   III: 13188102-13204900
Transcript:Y39E4B.12c.2 Y39E4B.12c.2 2358   III: 13188103-13204903
Transcript:Y39E4B.12b.2 Y39E4B.12b.2 2348   III: 13188108-13204901
 

Other

3 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:Y39E4B.12a Y39E4B.12a 1881   III: 13188136-13188243
CDS:Y39E4B.12c Y39E4B.12c 1875   III: 13188136-13188243
CDS:Y39E4B.12b Y39E4B.12b 1872   III: 13188136-13188243

4 RNAi Result

WormBase ID
WBRNAi00056236
WBRNAi00056237
WBRNAi00027752
WBRNAi00002699

279 Allele

Public Name
otn9919
gk963887
gk963904
gk963552
otn11209
gk189982
gk189983
gk189980
gk189981
gk190006
gk190007
gk190008
gk190009
gk190010
gk189986
gk189984
gk189985
gk189989
gk189990
gk189987
gk189988
gk189993
gk189994
gk189991
gk189992
gk189995
gk189996
gk189997
gk190000
gk190001

1 Chromosome

WormBase ID Organism Length (nt)
III Caenorhabditis elegans 13783801  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00001630 13187390 13204903 1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIII_13204904..13204981   78 III: 13204904-13204981 Caenorhabditis elegans

127 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts of coding genes that showed significantly decreased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_depleted_coding-RNA
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:all-neurons_L1-larva_expressed
  mRNAs that showed decreased expression in 1 cell mebryo comparing to in oocyte, according to RNAseq analysis. Gaussian error propagation. As cutoff for the up-regulated genes authors used log2 fold change > 1 and P < 0.05 and as cutoff for the down-regulated genes authors used log2 fold change < -1 and P < 0.05. WBPaper00045420:fertilization_downregulated_transcript
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:AVE-neuron_L1-larva_expressed
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_Food
  Proteins interacting with NHR-49-GFP according to co-IP and LC-MS. N.A. WBPaper00064071:NHR-49_interacting
  Transcripts expressed in the epithelial tissues surrounding the pharynx that includes the arcade and intestinal valve (AIV) cells, according to PAT-Seq analysis using Pbath-15-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:arcade_intestinal-valve_expressed
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Transcripts expressed in NMDA neuron, according to PAT-Seq analysis using Pnmr-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:NMDA-neuron_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_regulated_developing
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 10 mix) vs BT407 12h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.1mix_downregulated_12h
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Transcripts that showed significantly increased expression in aak-1(tm1944);aak-2(ok524) animals comparing to in N2. DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:aak-1(tm1944);aak-2(ok524)_upregulated
Bacteria infection: Staphylococcus aureus MW2. 4 hours of exposure. Transcripts that showed significantly increased expression after N2 animals had 4 hours of infection by Staphylococcus aureus (MW2). DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:S.aureus-4h_upregulated_N2
  Strictly maternal class (SM): genes that are the subset of maternal genes that are not also classified as embryonic. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_SM
  Transcripts that showed significantly changed expression in 6-day post-L4 adult hermaphrodite comparing to in 1-day post L4 adult hermaphrodite animals. Sleuth WBPaper00051558:aging_regulated
  Transcripts that showed significantly increased expression in sftb-1(cer6) deletion homozygous comparing to to in N2 animals at L4 larva stage. DESeq2, fold change > 2 WBPaper00058725:sftb-1(cer6)_downregulated
  Transcripts detected in body muscle nuclei according to a nuclear FACS-based strategy. Cufflinks WBPaper00065120:body-muscle-transcriptome
  Transcripts that showed significantly increased expression in nuo-6(qm200) comparing to in N2. Differential gene expression analysis was performed using the quasi-likeli-hood framework in edgeR package v. 3.20.1 in R v. 3.4.1. WBPaper00053810:nuo-6(qm200)_upregulated
  Transcripts that showed significantly altered expression at URX, AQR, and PQR neurons in camt-1(ok515) animals comparing to in wild type AX1888-1 strain. RNA-seq data were mapped using PRAGUI - a Python 3-based pipeline for RNA-seq data analysis. WBPaper00061902:camt-1(ok515)_regulated_URX-AQR-PQR
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 0hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:germline-precursors_blastula-embryo_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:intestine_L2-larva_expressed
  Transcripts that showed significantly increased expression in hda-2(ok1479) comparing to in N2 animals. DESeq2 (version 1.28.1), FDR < 0.01, fold change > 2. WBPaper00062159:hda-2(ok1479)_upregulated
  Transcripts that showed significantly decreased expression after animals were treated with 100uM Psora and 250uM Allantoin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Psora-Allantoin_downregulated
  Transcripts that showed significantly decreased expression after animals were treated with 100uM Rapamycin and 50mM Metformin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rapamycin-Metformin_downregulated
  Transcripts that showed significantly decreased expression after animals were treated with 50uM Rifampicin and 250uM Allantoin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rifampicin-Allantoin_downregulated
  Transcripts that showed significantly decreased expression after animals were treated with 50uM Rifampicin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rifampicin_downregulated
  Transcripts detected in germline isolated from day-1 adult hermaphrodite animals. All three experiments have CPM >= 1. WBPaper00067147:germline_expressed

5 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr1030977 Tiling arrays expression graphs  
    Expr2012145 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1017972 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr1159726 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr2030381 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  

14 GO Annotation

Annotation Extension Qualifier
  located_in
  located_in
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  enables
  enables
  enables
  enables
  enables

1 Homologues

Type
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00001630 13187390 13204903 1

14 Ontology Annotations

Annotation Extension Qualifier
  located_in
  located_in
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  enables
  enables
  enables
  enables
  enables

0 Regulates Expr Cluster

1 Sequence

Length
17514

1 Sequence Ontology Term

Identifier Name Description
gene  

4 Strains

WormBase ID
WBStrain00036231
WBStrain00036184
WBStrain00037734
WBStrain00037771

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIII_13173107..13187389   14283 III: 13173107-13187389 Caenorhabditis elegans