WormMine

WS294

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00021311 Gene Name  thoc-5
Sequence Name  ? Y32H12A.2 Brief Description  Y32H12A.2 encodes the C. elegans ortholog of mammalian THOC5, a subunit of the THO complex involved in mRNP biogenesis as part of transcription elongation, mRNA maturation, and export.
Organism  Caenorhabditis elegans Automated Description  Predicted to enable mRNA binding activity. Predicted to be involved in mRNA export from nucleus. Predicted to be located in nucleus. Predicted to be part of THO complex part of transcription export complex. Human ortholog(s) of this gene implicated in breast carcinoma. Is an ortholog of human THOC5 (THO complex subunit 5).
Biotype  SO:0001217 Genetic Position  III :-1.80654±
Length (nt)  ? 3471
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00021311

Genomics

2 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:Y32H12A.2b.1 Y32H12A.2b.1 2213   III: 5345931-5349401
Transcript:Y32H12A.2a.1 Y32H12A.2a.1 1878   III: 5346226-5349367
 

Other

2 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:Y32H12A.2a Y32H12A.2a 1794   III: 5346303-5346507
CDS:Y32H12A.2b Y32H12A.2b 1800   III: 5346303-5346507

13 RNAi Result

WormBase ID
WBRNAi00067297
WBRNAi00055805
WBRNAi00083740
WBRNAi00020187
WBRNAi00021831
WBRNAi00006989
WBRNAi00036758
WBRNAi00083633
WBRNAi00083855
WBRNAi00061920
WBRNAi00103646
WBRNAi00115570
WBRNAi00115509

41 Allele

Public Name
WBVar01827662
WBVar02124405
tm2921
ttTi5566
ttTi5831
WBVar01445717
WBVar01837582
WBVar01837583
WBVar01408593
WBVar02089658
WBVar00099448
WBVar01628512
gk517341
gk606313
gk926407
gk911182
gk790159
gk855424
gk677330
gk693633
gk875608
gk774528
gk598377
gk484325
gk660912
gk391114
gk416598
gk498537
WBVar00056371
WBVar01644667

1 Chromosome

WormBase ID Organism Length (nt)
III Caenorhabditis elegans 13783801  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00021311 5345931 5349401 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

0 Downstream Intergenic Region

115 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts of coding genes that showed significantly decreased expression in muscle. DESeq2 (version 1.24.0). Transcripts with a false-discovery rate adjusted p-value less than 0.05 were considered significantly differentially expressed. WBPaper00062325:muscle_depleted_coding-RNA
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
Bacteria infection: Enterococcus faecalis OG1RF. Exposure for 16 hours. Transcripts that showed significantly decreased expression in N2 after animals were exposed to E. faecalis OG1RF for 16 hours comparing to exposure to E. Coli OP50. Cuffcompare and Cuffdiff WBPaper00056090:E.faecalis_downregulated_N2
  Proteins interacting with NHR-49-GFP according to co-IP and LC-MS. N.A. WBPaper00064071:NHR-49_interacting
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Transcripts that showed significantly increased expression in day 1 adult hermaphrodite comparing to in L4 larva fem-3(q20) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-1-adult_vs_L4_upregulated_fem-3(q20)
  Transcripts that showed significantly increased expression in day 3 adult hermaphrodite comparing to in L4 larva fem-3(q20) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-3-adult_vs_L4_upregulated_fem-3(q20)
Bacteria diet: Sphingomonas aquatilis Yellow. Fed for 30 generations. Transcripts that showed significantly decreased expression after fed by bacteria Sphingomonas aquatilis (Yellow) for 30 generations comparing to animals fed by E. coli OP50. DESeq2 fold change > 2, p-value < 0.01. WBPaper00061007:S.aquatilis_downregulated
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 10 mix) vs BT407 12h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.1mix_downregulated_12h
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 2 mix) vs BT407 12h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.5mix_downregulated_12h
  Transcripts that showed significantly increased expression in hrde-1(tm1200) animals, comparing to in N2, after growing at 25C for five generations (late generation). CuffDiff2 WBPaper00051265:F4_hrde-1(tm1200)_upregulated
  Transcripts that showed significantly increased expression in animals exposed to 400uM tamoxifen from L1 to L4 larva stage. DEseq2, fold change > 2 WBPaper00064505:tamoxifen_upregulated
  Genes down regulated by mir-243(n4759). RNAs that changed at least 2-fold with a probability of p > 0.05 in three biological replicates were considered differentially regulated between wild-type and mir-243. WBPaper00036130:mir-243_down_regulated
  Transcripts that showed significantly increased expression in 10-days post L4 adult hermaphrodite N2 grown at 20C, comparing to in 1-day post L4 adult hermaphrodite N2 animals grown at 20C. CuffDiff, fold change > 2. WBPaper00065096:Day10_vs_Day1_upregulated
  Transcripts that showed significantly increased expression in xrep-4(lax137). DESeq2. Genes were selected if their p value < 0.01. WBPaper00066062:xrep-4(lax137)_upregulated
  Transcripts that showed significantly decreased expression in tetraploid N2 comparing to diploid N2 animals at L4 larva stage. DESeq2 R package (1.20.0), fold change > 2, and FDR < 0.05. WBPaper00066110:tetraploid_vs_diploid_downregulated
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:all-neurons_L2-larva_expressed
  Transcripts that showed significantly increased expression in ilc-17.1(syb5296) comparing to in N2 animals at L4 larva stage. DESeq2, fold change > 2, FDR < 0.05. WBPaper00066594:ilc-17.1(syb5296)_upregulated
Bacteria: B.thuringiensis Transcripts in elt-2(RNAi) animals that were significantly differentially expressed at least for one time point and one pathogenic strain Bt247 and Bt679 compared to the non pathogenic strain Bt407. Cuffdiff WBPaper00060358:B.thuringiensis_pathogen_regulated_elt-2(RNAi)
Bacteria: B.thuringiensis Transcripts in N2 animals that were significantly differentially expressed at least for one time point and one pathogenic strain Bt247 and Bt679 compared to the non pathogenic strain Bt407. Cuffdiff WBPaper00060358:B.thuringiensis_pathogen_regulated_N2
  Genes that were not enriched in either spermatogenic fem-3(q96gf) nor oogenic fog-2(q71) gonads, according to RNAseq analysis. To identify differentially expressed transcripts, authors used R/Bioconductor package DESeq. WBPaper00045521:Gender_Neutral
  Transcripts that showed decreased expression in hlh-11(ko1) knockout strain comparing to in wild type background. DESeq2, FDR < 0.05 WBPaper00060683:hlh-11(ko1)_downregulated
Bacteria infection: Enterococcus faecalis OG1RF. Exposure for 16 hours. Transcripts that showed significantly decreased expression in hpx-2(dg047) after animals were exposed to E. faecalis OG1RF for 16 hours comparing to exposure to E. Coli OP50. Cuffcompare and Cuffdiff WBPaper00056090:E.faecalis_downregulated_hpx-2(dg047)
  Genes found to be regulated by low-copy overexpression of sir-2.1 with p < 0.014. N.A. WBPaper00026929:sir-2.1_overexpression_regulated
  TGF- Dauer pathway adult transcriptional targets. Results obtained by comparing the microarray results of the dauer-constitutive mutants daf-7(e1372), daf-7(m62), and daf-1(m40) with dauer-defective mutants daf-3(mgDf90), daf-5(e1386), and daf-7(e1372);daf-3(mgDf90) double mutants at the permissive temperature, 20C, on the first day of adulthood. SAM WBPaper00031040:TGF-beta_adult_downregulated
  Germline-intrinsic transcripts. Comparisons were made between genotypes by subtracting the mean log value of one ratio from another, and the significance of the difference was evaluated using Student t-test for two populations. For the fem-3(gf) versus fem-1(lf) direct comparison, authors performed the same analysis, except they used a Students t-test for one population. Author chose a combination of a twofold difference with a t value exceeding 99% confidence (P < 0.01), because these criteria allowed the inclusion of essentially all genes that had previously been identified as germline-enriched in a wt/glp-4 hermaphrodite comparison. Additionally, requiring a twofold difference reduced false positives, as the number of genes with two-fold difference and a P<0.01 only included ~100 genes more than with P < 0.001, and almost all genes showed germline expression by in situ hybridization. [cgc6390]:intrinsic
  Transcripts that showed significantly increased expression in set-2(tm1630) animals at embryo stage, comparing to in N2 animals. DESeq2 (v2.1.8.3) was used to determine DE genes and to generate principal component and scatter plots. DE genes with FDR < 0.05 were analysed using g:Profiler with Bonferroni correction. WBPaper00060014:set-2(tm1630)_upregulated
  Genes expressed in N2. Expressed transcripts were identified on the basis of a Present call in 3 out of 4 N2 experiments as determined by Affymetrix MAS 5.0. WBPaper00025141:N2_Expressed_Genes

6 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr2035531 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1019175 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr1159336 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr13875   THOC-5 was strictly nuclear excluded in dopaminergic neurons.
    Expr1039290 Tiling arrays expression graphs  
    Expr2017392 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  

6 GO Annotation

Annotation Extension Qualifier
  enables
  involved_in
  located_in
  located_in
  part_of
  part_of

5 Homologues

Type
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00021311 5345931 5349401 -1

6 Ontology Annotations

Annotation Extension Qualifier
  enables
  involved_in
  located_in
  located_in
  part_of
  part_of

2 Regulates Expr Cluster

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts that showed significantly decreased expression in FACS sorted neuron cells (labelled by pan-neuronal GFP) from edIs6[unc-119::GFP + rol-6(su1006)]; thoc-5(wy822) comparing to in edIs6. DESeq2, log2 fold change > 2, adjusted p-value < 0.005. WBPaper00055103:thoc-5(wy822)_downregulated
  Transcripts that showed significantly increased expression in FACS sorted neuron cells (labelled by pan-neuronal GFP) from edIs6[unc-119::GFP + rol-6(su1006)]; thoc-5(wy822) comparing to in edIs6. DESeq2, log2 fold change > 2, adjusted p-value < 0.005. WBPaper00055103:thoc-5(wy822)_upregulated

1 Sequence

Length
3471

1 Sequence Ontology Term