WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00006386 Gene Name  taf-5
Sequence Name  ? F30F8.8 Organism  Caenorhabditis elegans
Automated Description  Predicted to contribute to RNA polymerase II general transcription initiation factor activity. Involved in embryo development and transcription by RNA polymerase II. Located in nucleus. Is an ortholog of human TAF5 (TATA-box binding protein associated factor 5). Biotype  SO:0001217
Genetic Position  I :2.37598 ±0.00155 Length (nt)  ? 4777
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00006386

Genomics

2 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:F30F8.8.2 F30F8.8.2 2368   I: 7848864-7853640
Transcript:F30F8.8.1 F30F8.8.1 2097   I: 7849813-7853636
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:F30F8.8 F30F8.8 1947   I: 7849825-7850057

40 RNAi Result

WormBase ID
WBRNAi00095598
WBRNAi00113244
WBRNAi00045970
WBRNAi00045972
WBRNAi00003543
WBRNAi00081373
WBRNAi00031602
WBRNAi00071023
WBRNAi00071024
WBRNAi00113014
WBRNAi00076362
WBRNAi00060915
WBRNAi00060916
WBRNAi00000039
WBRNAi00001779
WBRNAi00073065
WBRNAi00073064
WBRNAi00022953
WBRNAi00008050
WBRNAi00025333
WBRNAi00025334
WBRNAi00060913
WBRNAi00060914
WBRNAi00066471
WBRNAi00066466
WBRNAi00066465
WBRNAi00066468
WBRNAi00066467
WBRNAi00066470
WBRNAi00066469

60 Allele

Public Name
gk962858
gk962706
gk963902
gk963849
gk964316
WBVar01432189
WBVar01432188
gk962720
gk962721
WBVar02122894
WBVar02028476
cxTi10053
WBVar01909888
gk427182
WBVar01909889
gk895904
WBVar01909890
gk800520
gk367920
gk940004
gk594187
gk940005
gk319407
gk462604
gk328318
WBVar01909887
gk816712
gk690245
gk813768
gk837386

1 Chromosome

WormBase ID Organism Length (nt)
I Caenorhabditis elegans 15072434  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00006386 7848864 7853640 1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrI_7853641..7853749   109 I: 7853641-7853749 Caenorhabditis elegans

96 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  Transcripts that showed significantly higher expression in somatic gonad precursor cells (SGP) vs. head mesodermal cells (hmc). DESeq2, fold change >= 2, FDR <= 0.01. WBPaper00056826:SGP_biased
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_developing
  Genes with expression level regulated by genotype (N2 vs CB4856) at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_regulated_developing
  Transcripts that showed significantly decreased expression in day 3 adult hermaphrodite comparing to in L4 larva daf-16(mu86);glp-1(e2141) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-3-adult_vs_L4_downregulated_daf-16(mu86);glp-1(e2141)
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Strictly maternal class (SM): genes that are the subset of maternal genes that are not also classified as embryonic. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_SM
  Top 300 transcripts enriched in ABalppppppa, ABpraaapppa according to single cell RNAseq. Top 300 enriched transcripts were determined by log2.ratio of the tpm in the cell type vs the tpm in the other cells * the log2 of the cell.type tpm. WBPaper00061340:ASE_parent
  Transcripts that showed significantly changed expression in 6-day post-L4 adult hermaphrodite comparing to in 1-day post L4 adult hermaphrodite animals. Sleuth WBPaper00051558:aging_regulated
  Significantly differentially expressed genes as determined by microarray analysis of wild-type and cde-1 mutant germlines. RNAs that changed at least 2-fold with a probability of p < 0.05 were considered differentially regulated between wildtype and cde-1. WBPaper00035269:cde-1_regulated
  Transcripts that showed significantly increased expression in xrep-4(lax137). DESeq2. Genes were selected if their p value < 0.01. WBPaper00066062:xrep-4(lax137)_upregulated
  Transcripts that showed significantly decreased expression in tetraploid N2 comparing to diploid N2 animals at L4 larva stage. DESeq2 R package (1.20.0), fold change > 2, and FDR < 0.05. WBPaper00066110:tetraploid_vs_diploid_downregulated
  Transcripts detected in germline isolated from day-1 adult hermaphrodite animals. All three experiments have CPM >= 1. WBPaper00067147:germline_expressed
  Genes that were not enriched in either spermatogenic fem-3(q96gf) nor oogenic fog-2(q71) gonads, according to RNAseq analysis. To identify differentially expressed transcripts, authors used R/Bioconductor package DESeq. WBPaper00045521:Gender_Neutral
  Transcripts that showed decreased expression in hlh-11(ko1) knockout strain comparing to in wild type background. DESeq2, FDR < 0.05 WBPaper00060683:hlh-11(ko1)_downregulated
Starvation 48 hours at L1 arrest Transcripts that showed significantly increased expression in starved N2 animals (48 hours at L1 arrest) Fold change > 2. WBPaper00064005:starvation_upregulated_N2_mRNA
  Genes found to be regulated by low-copy overexpression of sir-2.1 with p < 0.014. N.A. WBPaper00026929:sir-2.1_overexpression_regulated
  Germline-intrinsic transcripts. Comparisons were made between genotypes by subtracting the mean log value of one ratio from another, and the significance of the difference was evaluated using Student t-test for two populations. For the fem-3(gf) versus fem-1(lf) direct comparison, authors performed the same analysis, except they used a Students t-test for one population. Author chose a combination of a twofold difference with a t value exceeding 99% confidence (P < 0.01), because these criteria allowed the inclusion of essentially all genes that had previously been identified as germline-enriched in a wt/glp-4 hermaphrodite comparison. Additionally, requiring a twofold difference reduced false positives, as the number of genes with two-fold difference and a P<0.01 only included ~100 genes more than with P < 0.001, and almost all genes showed germline expression by in situ hybridization. [cgc6390]:intrinsic
  TGF- Dauer pathway adult transcriptional targets. Results obtained by comparing the microarray results of the dauer-constitutive mutants daf-7(e1372), daf-7(m62), and daf-1(m40) with dauer-defective mutants daf-3(mgDf90), daf-5(e1386), and daf-7(e1372);daf-3(mgDf90) double mutants at the permissive temperature, 20C, on the first day of adulthood. SAM WBPaper00031040:TGF-beta_adult_downregulated
  Genes expressed in N2. Expressed transcripts were identified on the basis of a Present call in 3 out of 4 N2 experiments as determined by Affymetrix MAS 5.0. WBPaper00025141:N2_Expressed_Genes
  Transcripts unqiuely expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_enriched
  Transcripts that showed significantly increased expression in jmjd-3.1p::jmjd-3.1 comparing to in N2. DESeq2 Benjamini-Hochberg adjusted p-value < 0.05. WBPaper00049545:jmjd-3.1(+)_upregulated
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:bodywall-muscle_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:dopaminergic-neurons_L1-larva_expressed
  Transcripts with significantly increased expression in nuo-6(qm200) vs. N2, and in nuo-6(qm200);ced-4(n1162) vs. ced-4(n1162). Comparisons of each genotype were compared to the wild-type using the Empirical Base (Wright & Simon) algorithm and fold changes were represented on a log2 scale. A threshold of p < 0.05 and a fold change of 1.3 (log2) was set to determine differentially expressed targets. WBPaper00045263:nuo-6(qm200)_upregulated
20C vs 25C Transcripts that showed differential expression in 20C vs 25C in mir-34(gk437) animals at adult stage. N.A. WBPaper00050488:20C_vs_25C_regulated_mir-34(gk437)_adult
20C vs 25C Transcripts that showed differential expression in 20C vs 25C in N2 animals at adult stage. N.A. WBPaper00050488:20C_vs_25C_regulated_N2_adult
  Total muscle depleted genes (complete list of non-overlapping genes from the 0hr and 24hr muscle depleted datasets). A two-class unpaired analysis was performed to identify genes that are elevated 1.7-fold or greater when compared with the reference for each dataset, at a false discovery rate of 1.8% or less for M0 and 1.2% or less for the M24 datasets. WBPaper00031003:total_muscle_depleted

7 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr2035351 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
Strain: BC14309 [taf-5::gfp] transcriptional fusion. PCR products were amplified using primer A: 5' [CTGTACCTTGCGGCTTTCA] 3' and primer B 5' [GAATTCTCGCTATCGATCAAG] 3'. Expr5919 Adult Expression: Nervous System; nerve ring; ventral nerve cord; head neurons; tail neurons; Larval Expression: Nervous System; nerve ring; ventral nerve cord; head neurons; tail neurons;  
    Expr1032583 Tiling arrays expression graphs  
    Expr1012108 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr2017215 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1149870 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr3569   Localized to only punctate structures in the DA motor neurons.

9 GO Annotation

Annotation Extension Qualifier
  part_of
  part_of
  located_in
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  contributes_to

14 Homologues

Type
least diverged orthologue
least diverged orthologue
orthologue
least diverged orthologue
least diverged orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
least diverged orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00006386 7848864 7853640 1

9 Ontology Annotations

Annotation Extension Qualifier
  part_of
  part_of
  located_in
  located_in
  located_in
  involved_in
  involved_in
  involved_in
  contributes_to

0 Regulates Expr Cluster

1 Sequence

Length
4777

1 Sequence Ontology Term

Identifier Name Description
gene  

1 Strains

WormBase ID
WBStrain00002973

0 Upstream Intergenic Region