WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00019821 Gene Name  gtf-2H1
Sequence Name  ? R02D3.3 Organism  Caenorhabditis elegans
Automated Description  Predicted to be involved in DNA repair; transcription by RNA polymerase I; and transcription by RNA polymerase II. Predicted to be located in nucleus. Predicted to be part of transcription factor TFIIH core complex and transcription factor TFIIH holo complex. Expressed widely. Is an ortholog of human GTF2H1 (general transcription factor IIH subunit 1). Biotype  SO:0001217
Genetic Position  IV :-26.8017± Length (nt)  ? 6335
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00019821

Genomics

2 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:R02D3.3a.1 R02D3.3a.1 1641   IV: 225439-231773
Transcript:R02D3.3b.1 R02D3.3b.1 426   IV: 225481-225906
 

Other

2 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:R02D3.3b R02D3.3b 426   IV: 225481-225906
CDS:R02D3.3a R02D3.3a 1599   IV: 225481-226038

8 RNAi Result

WormBase ID
WBRNAi00026011
WBRNAi00051124
WBRNAi00082704
WBRNAi00034528
WBRNAi00071391
WBRNAi00111674
WBRNAi00082440
WBRNAi00009025

81 Allele

Public Name
gk963025
gk963690
otn12226
gk191588
gk191589
gk191587
gk191592
gk191593
gk191590
gk191591
gk191596
gk191594
gk191595
gk191597
gk191598
WBVar01607417
WBVar01607416
WBVar01607418
WBVar01607415
gk945288
otn542
WBVar01825524
WBVar01828005
h4962
gk953307
WBVar01984253
WBVar01984255
WBVar01984254
gk954372
WBVar01944847

1 Chromosome

WormBase ID Organism Length (nt)
IV Caenorhabditis elegans 17493829  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00019821 225439 231773 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIV_220856..225438   4583 IV: 220856-225438 Caenorhabditis elegans

87 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts that showed significantly increased expression in L1 neural cells comparing to in adult neural cells. DESeq2 (v1.18.1) fold change > 2, P-adj<0.05, using BenjaminiHochberg correction. WBPaper00060811:L1_vs_adult_upregulated_neural
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  Transcripts that showed significantly increased expression after animals were treated with 50uM Rifampicin and 250uM Allantoin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rifampicin-Allantoin_upregulated
  Transcripts that showed significantly increased expression after animals were treated with 100uM Psora and 250uM Allantoin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Psora-Allantoin_upregulated
  Transcripts that showed significantly increased expression after animals were treated with 100uM Rapamycin and 50mM Metformin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rapamycin-Metformin_upregulated
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Transcripts expressed in NMDA neuron, according to PAT-Seq analysis using Pnmr-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:NMDA-neuron_expressed
  Transcripts expressed in seam cells, according to PAT-Seq analysis using Pgrd-10-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:seam_expressed
  Transcripts that showed significantly increased expression in day 1 adult hermaphrodite comparing to in L4 larva fem-3(q20) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-1-adult_vs_L4_upregulated_fem-3(q20)
  Transcripts that showed significantly increased expression in day 3 adult hermaphrodite comparing to in L4 larva fem-3(q20) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-3-adult_vs_L4_upregulated_fem-3(q20)
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Transcripts that showed significantly increased expression in animals exposed to 400uM tamoxifen from L1 to L4 larva stage. DEseq2, fold change > 2 WBPaper00064505:tamoxifen_upregulated
  Transcripts that showed significantly changed expression in 6-day post-L4 adult hermaphrodite comparing to in 1-day post L4 adult hermaphrodite animals. Sleuth WBPaper00051558:aging_regulated
  Transcripts that showed significantly increased expression in 10-days post L4 adult hermaphrodite N2 grown at 20C, comparing to in 1-day post L4 adult hermaphrodite N2 animals grown at 20C. CuffDiff, fold change > 2. WBPaper00065096:Day10_vs_Day1_upregulated
  Proteins that showed significantly decreased expression in 1-day-old sek-1(km4) adults comparing to in wild type animals, both with 6 hours of cisplatin treatment. The differential expression analysis was performed in R. Differentially expressed proteins were identified by using a two-sided t-test on log-transformed data. WBPaper00065373:sek-1(km4)_downregulated_cisplatin
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:intestine_L2-larva_expressed
  Transcripts that showed significantly increased expression in hda-1(RNAi) embryos comparing to control animals. DESeq2, fold change > 2, FDR < 0.05. WBPaper00067044:hda-1(RNAi)_upregulated
  Transcripts detected in germline isolated from day-1 adult hermaphrodite animals. All three experiments have CPM >= 1. WBPaper00067147:germline_expressed
  Genes that were not enriched in either spermatogenic fem-3(q96gf) nor oogenic fog-2(q71) gonads, according to RNAseq analysis. To identify differentially expressed transcripts, authors used R/Bioconductor package DESeq. WBPaper00045521:Gender_Neutral
  Transcripts that showed altered expression from P0 to F2 generation animals after N2 parental generation were treated with antimycin, but not in damt-1(gk961032) P0 to F2 animals after the parenal generaton were treated with antimycin. N.A. WBPaper00055862:antimycin_damt-1(gk961032)_regulated
  Transcripts that showed significantly decreased expression in the neurons of bcat-1(RNAi) animals at 5-days post L4 adult hermaphrodite stage, comparing to animals injected with empty vector. DESeq2. FDR < 0.05. WBPaper00060459:bcat-1(RNAi)_downregulated
  Transcripts that showed significantly decreased expression in nhl-2(ok818) comparing to in N2 at 25C. EdgeR, FDR < 0.05, fold change < 0.5. WBPaper00055971:nhl-2(ok818)_25C_upregulated
  Genes found to be regulated by low-copy overexpression of sir-2.1 with p < 0.014. N.A. WBPaper00026929:sir-2.1_overexpression_regulated
  TGF- Dauer pathway adult transcriptional targets. Results obtained by comparing the microarray results of the dauer-constitutive mutants daf-7(e1372), daf-7(m62), and daf-1(m40) with dauer-defective mutants daf-3(mgDf90), daf-5(e1386), and daf-7(e1372);daf-3(mgDf90) double mutants at the permissive temperature, 20C, on the first day of adulthood. SAM WBPaper00031040:TGF-beta_adult_downregulated
  Genes expressed in N2. Expressed transcripts were identified on the basis of a Present call in 3 out of 4 N2 experiments as determined by Affymetrix MAS 5.0. WBPaper00025141:N2_Expressed_Genes
  Embryonic class (E): genes that significantly increase in abundance at some point during embryogenesis. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_E
  Transcripts with significantly increased expression in isp-1(qm150) vs. N2, and in isp-1(qm150) ced-4(n1162) vs. ced-4(n1162). Comparisons of each genotype were compared to the wild-type using the Empirical Base (Wright & Simon) algorithm and fold changes were represented on a log2 scale. A threshold of p < 0.05 and a fold change of 1.3 (log2) was set to determine differentially expressed targets. WBPaper00045263:isp-1(qm150)_upregulated
  Transcripts that showed differential expression in dauer mir-34(gk437) vs dauer mir-34(OverExpression) animals at 20C. N.A. WBPaper00050488:mir-34(gk437)_vs_mir-34(OverExpression)_regulated_dauer_20C
  Genes depleted in muscle cells (24hr muscle dataset). Dissociated myo-3::GFP embryos were cultured for 24 hours before FACS sorting. A two-class unpaired analysis was performed to identify genes that are elevated 1.7-fold or greater when compared with the reference for each dataset, at a false discovery rate of 1.8% or less for M0 and 1.2% or less for the M24 datasets. WBPaper00031003:24hr_muscle_depleted

6 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr2023534 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1021734 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr16019 we generated GFP knockins of gtf-2h1 and gtf-2h5 and shows ubiquitous expression in nuclei of basically all tissues.  
    Expr1038588 Tiling arrays expression graphs  
    Expr1154833 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr2005314 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  

11 GO Annotation

Annotation Extension Qualifier
  involved_in
  part_of
  part_of
  part_of
  located_in
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  involved_in

6 Homologues

Type
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00019821 225439 231773 -1

11 Ontology Annotations

Annotation Extension Qualifier
  involved_in
  part_of
  part_of
  part_of
  located_in
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  involved_in

0 Regulates Expr Cluster

1 Sequence

Length
6335

1 Sequence Ontology Term

Identifier Name Description
gene  

0 Strains

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIV_231774..232472   699 IV: 231774-232472 Caenorhabditis elegans