WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00013808 Gene Name  sfa-1
Sequence Name  ? Y116A8C.32 Brief Description  sfa-1 encodes an ortholog of splicing factor SF1, which enables 3' splice site recognition by binding U2AF65 and the intron branch site during splicing complex formation; SFA-1 shares several domains with mammalian SF1 proteins (a U2AF65 binding domain, a hnRNP K homology domain, two RNA-binding zinc knuckles, and a proline-rich C-terminal domain), while also sharing a hydrophilic N-terminal domain (enriched for serine, arginine, lysine, and aspartate) with Drosophila but not mammalian SF1; SFA-1's N-terminal domain resembles RS domains in other splicing proteins; sfa-1 is required for embryonic viability, normally rapid growth, and proper body morphology.
Organism  Caenorhabditis elegans Automated Description  Predicted to enable mRNA binding activity. Involved in alternative mRNA splicing, via spliceosome. Predicted to be located in nucleus. Predicted to be part of spliceosomal complex. Is an ortholog of human SF1 (splicing factor 1).
Biotype  SO:0001217 Genetic Position  IV :16.1569 ±0.005785
Length (nt)  ? 7158
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00013808

Genomics

2 Transcripts

Class WormMine ID Sequence Name Length (nt) Chromosome Location
MRNA Transcript:Y116A8C.32a.1 Y116A8C.32a.1 2820   IV: 17103825-17110982
NcPrimaryTranscript Transcript:Y116A8C.32b Y116A8C.32b 2027   IV: 17104545-17110982
 

Other

1 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:Y116A8C.32a Y116A8C.32a 2100   IV: 17104545-17104879

18 RNAi Result

WormBase ID
WBRNAi00009288
WBRNAi00055449
WBRNAi00075322
WBRNAi00083605
WBRNAi00083710
WBRNAi00080035
WBRNAi00083799
WBRNAi00071500
WBRNAi00075323
WBRNAi00026596
WBRNAi00078087
WBRNAi00111876
WBRNAi00108657
WBRNAi00108582
WBRNAi00080123
WBRNAi00083544
WBRNAi00111052
WBRNAi00111156

177 Allele

Public Name
gk964078
gk964500
gk962765
gk963590
gk962529
gk963948
gk963888
otn8834
WBVar01970269
WBVar01970268
WBVar01970267
WBVar01970266
WBVar01970272
WBVar01970271
WBVar01970270
WBVar01970275
WBVar01970274
WBVar01970273
WBVar02067947
WBVar01543068
WBVar00200140
WBVar00200141
WBVar00200142
WBVar00200143
WBVar00200144
WBVar00200145
WBVar00200146
WBVar00200147
WBVar00200148
WBVar00200149

1 Chromosome

WormBase ID Organism Length (nt)
IV Caenorhabditis elegans 17493829  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00013808 17103825 17110982 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIV_17103716..17103824   109 IV: 17103716-17103824 Caenorhabditis elegans

79 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts that showed significantly increased expression in L1 neural cells comparing to in adult neural cells. DESeq2 (v1.18.1) fold change > 2, P-adj<0.05, using BenjaminiHochberg correction. WBPaper00060811:L1_vs_adult_upregulated_neural
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
  mRNAs that showed decreased expression in 1 cell mebryo comparing to in oocyte, according to RNAseq analysis. Gaussian error propagation. As cutoff for the up-regulated genes authors used log2 fold change > 1 and P < 0.05 and as cutoff for the down-regulated genes authors used log2 fold change < -1 and P < 0.05. WBPaper00045420:fertilization_downregulated_transcript
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:AVE-neuron_L1-larva_expressed
  Transcripts that showed significantly increased expression after animals were treated with 100uM Psora and 250uM Allantoin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Psora-Allantoin_upregulated
  Transcripts that showed significantly increased expression after animals were treated with 100uM Rapamycin and 50mM Metformin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rapamycin-Metformin_upregulated
  Proteins interacting with NHR-49-GFP according to co-IP and LC-MS. N.A. WBPaper00064071:NHR-49_interacting
  Transcripts expressed in the epithelial tissues surrounding the pharynx that includes the arcade and intestinal valve (AIV) cells, according to PAT-Seq analysis using Pbath-15-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:arcade_intestinal-valve_expressed
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at old adults stage (214 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_aging
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 2 mix) vs BT407 12h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.5mix_downregulated_12h
  Significantly differentially expressed genes as determined by microarray analysis of wild-type and cde-1 mutant germlines. RNAs that changed at least 2-fold with a probability of p < 0.05 were considered differentially regulated between wildtype and cde-1. WBPaper00035269:cde-1_regulated
  Transcripts that showed significantly altered expression after 24 hour exposure to stavudine (d4T) starting at L1 lava stage. DESeq WBPaper00053302:stavudine_24h_regulated
  Expression Pattern Group C, enriched for genes involved in metabolic processes. The significance (P 0.0001) of the relative age (time) was used to determine if a gene was differentially expressed between the three age (time) groups. The effect of this factor explaining gene expression differences was used to determine if the expression went up or down during the two age/time periods (t1 - t2 and t2 -t3). Authors used a permutation approach to determine the thresholds for the different mapping strategies. For each of the used models for eQTL mapping, authors used 23,000 permutations. For each permutation, authors randomly picked a spot; each spot could only be picked once. The gene expression and relative lifespan values were than randomly distributed over the RILs (and time points) and used for mapping. In this way, authors obtained a threshold for each of the explaining factors. For the single time points, authors used a FDR of 0.01 to adjust for multiple testing. The genome-wide threshold for this FDR is -log10 P = 3.8 for each of the three time points. For the combined models (t1 to t2 and t2 to t3), authors used a genome-wide threshold of -log10 P = 4, which resembles an FDR of 0.006, 0.001, and 0.006 for marker, age, and the interaction between marker and age, respectively. To determine the threshold for the single gene examples, authors used 1000 permutations as in the genome-wide threshold. The difference is that they use the gene under study in all of the permutations. The P-values for the gene specific thresholds were determined at FDR = 0.05. WBPaper00036286:Pattern_C
  Transcripts detected in body muscle nuclei according to a nuclear FACS-based strategy. Cufflinks WBPaper00065120:body-muscle-transcriptome
Gamma irradiation 100 mGY per hour for 72 hours since L1 larva. Transcripts that showed significantly increased expression after exposure to 100mGy per hour gamma irradiation from L1 to day 1 adult hermaphrodite stage. DESeq2, FDR <= 0.05, log2 fold change >= 0.3 or <= -0.3. WBPaper00058958:100mGy-irradiation-72h_upregulated
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:coelomocytes_L2-larva_expressed
  Genes down-regulated following nhr-25(RNAi). Pair-wise significance testing (mutant/RNAi vs. wild-type/vector) was performed using the Bioconductor package limma and p-values were initially corrected for multiple testing using the false discovery rate (FDR) method of Benjamini and Hochberg. Authors defined differential expression as log2(ratio) >= 0.848 with the FDR set to 5%, and p-value <= 0.001. WBPaper00045015:nhr-25(RNAi)_downregulated
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:GABAergic-motor-neurons_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:intestine_L2-larva_expressed
  Transcripts that showed altered expression in cat-1(RNAi) animals comparing to control animals injected with empty vector. p-value <= 0.05 WBPaper00066902:cat-1(RNAi)_regulated
  Transcripts that showed significantly altered expression in rnp-6(dh1127) animals comparing to in N2 when fed with heat killed E. coli OP50. Differentially expressed genes (DEGs) (q-value <0.05) between different samples were identified using the stringtie version 1.3.0, followed by Cufflinks version 2.2. WBPaper00059824:rnp-6(dh1127)_regulated_OP50
  Transcripts that showed significantly altered expression in rnp-6(dh1127) animals comparing to in N2 when fed with live S. aureus. Differentially expressed genes (DEGs) (q-value <0.05) between different samples were identified using the stringtie version 1.3.0, followed by Cufflinks version 2.2. WBPaper00059824:rnp-6(dh1127)_regulated_S.aureus
Fungi infection: Haptoglossa zoospora. Transcripts that showed significantly altered expression after L4 N2 animals were exposed to omycete Haptoglossa zoospora for 6 hours. Kalisto abundance files were converted and analysed using Sleuth in a R pipeline. Standard Sleuth protocols were used to calculate differential expression. P value < 0.01 and FDR < 0.01. WBPaper00062354:H.zoospora_6h_regulated
  Transcripts detected in germline isolated from day-1 adult hermaphrodite animals. All three experiments have CPM >= 1. WBPaper00067147:germline_expressed
  Genes that were not enriched in either spermatogenic fem-3(q96gf) nor oogenic fog-2(q71) gonads, according to RNAseq analysis. To identify differentially expressed transcripts, authors used R/Bioconductor package DESeq. WBPaper00045521:Gender_Neutral
  Transcripts that showed significantly decreased expression in the neurons of bcat-1(RNAi) animals at 5-days post L4 adult hermaphrodite stage, comparing to animals injected with empty vector. DESeq2. FDR < 0.05. WBPaper00060459:bcat-1(RNAi)_downregulated
  Transcripts that showed significantly decreased expression in nhl-2(ok818) comparing to in N2 at 25C. EdgeR, FDR < 0.05, fold change < 0.5. WBPaper00055971:nhl-2(ok818)_25C_upregulated

5 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr2034019 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1019252 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr2015786 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1036183 Tiling arrays expression graphs  
    Expr1158961 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  

20 GO Annotation

Annotation Extension Qualifier
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  enables
  enables
  part_of
  located_in
  located_in
  located_in
  located_in
  located_in
  enables
  enables
  enables
  enables
  enables
  enables

5 Homologues

Type
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00013808 17103825 17110982 -1

20 Ontology Annotations

Annotation Extension Qualifier
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables
  enables
  enables
  part_of
  located_in
  located_in
  located_in
  located_in
  located_in
  enables
  enables
  enables
  enables
  enables
  enables

0 Regulates Expr Cluster

1 Sequence

Length
7158

1 Sequence Ontology Term

Identifier Name Description
gene  

2 Strains

WormBase ID
WBStrain00027479
WBStrain00027594

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIV_17110983..17111080   98 IV: 17110983-17111080 Caenorhabditis elegans