WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00012891 Gene Name  sorb-1
Sequence Name  ? Y45F10D.13 Organism  Caenorhabditis elegans
Automated Description  Involved in maintenance of mitochondrion location and sarcomere organization. Located in dense body; focal adhesion; and plasma membrane. Expressed in body wall musculature; non-striated muscle; and uterus. Human ortholog(s) of this gene implicated in obesity and type 2 diabetes mellitus. Is an ortholog of human SORBS1 (sorbin and SH3 domain containing 1) and SORBS2 (sorbin and SH3 domain containing 2). Biotype  SO:0001217
Genetic Position  IV :10.712 ±0.001593 Length (nt)  ? 20165
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00012891

Genomics

5 Transcripts

Class WormMine ID Sequence Name Length (nt) Chromosome Location
MRNA Transcript:Y45F10D.13d.1 Y45F10D.13d.1 3173   IV: 13750333-13770497
MRNA Transcript:Y45F10D.13a.1 Y45F10D.13a.1 3164   IV: 13750334-13770495
NcPrimaryTranscript Transcript:Y45F10D.13b Y45F10D.13b 1210   IV: 13761915-13770353
MRNA Transcript:Y45F10D.13c.1 Y45F10D.13c.1 1422   IV: 13764610-13770351
MRNA Transcript:Y45F10D.13e.1 Y45F10D.13e.1 1428   IV: 13764610-13770351
 

Other

4 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:Y45F10D.13a Y45F10D.13a 3018   IV: 13750336-13750374
CDS:Y45F10D.13d Y45F10D.13d 3024   IV: 13750336-13750374
CDS:Y45F10D.13c Y45F10D.13c 1422   IV: 13764610-13764715
CDS:Y45F10D.13e Y45F10D.13e 1428   IV: 13764610-13764715

21 RNAi Result

WormBase ID
WBRNAi00048742
WBRNAi00056600
WBRNAi00109550
WBRNAi00109606
WBRNAi00032974
WBRNAi00037097
WBRNAi00077332
WBRNAi00109161
WBRNAi00109218
WBRNAi00114504
WBRNAi00114500
WBRNAi00114499
WBRNAi00114501
WBRNAi00109315
WBRNAi00109259
WBRNAi00114498
WBRNAi00114497
WBRNAi00109453
WBRNAi00109509
WBRNAi00109412
WBRNAi00109356

301 Allele

Public Name
otn10072
otn10073
otn10074
otn10075
otn10076
otn10077
gk964078
gk963546
gk963547
gk964500
gk962765
gk964111
gk964110
gk963691
WBVar02068750
gk618503
gk361933
gk882999
gk452297
gk385840
gk792500
gk915203
gk400755
gk374851
gk635756
gk511861
gk596085
gk532240
gk773671
gk883000

1 Chromosome

WormBase ID Organism Length (nt)
IV Caenorhabditis elegans 17493829  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00012891 13750333 13770497 1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIV_13770498..13770672   175 IV: 13770498-13770672 Caenorhabditis elegans

189 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Genes with expression altered >= 3-fold in dpy-10(e128) mutants. Data across the wild type series was analyzed using the Significance analysis of Microarrays (SAM) algorithm (to calculate the False Discovery Rate (FDR)). WBPaper00035873:dpy-10_regulated
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:bodywall-muscle_L1-larva_expressed
Bacteria infection: Enterococcus faecalis Genes with increased expression after 24 hours of infection by E.faecalis Fold changes shown are pathogen vs OP50. For RNA-seq and tiling arrays, log2 fold changes between gene expression values of infected versus uninfected nematodes were calculated. For log2 fold changes > 0.00001 the values > 81.25th percentile were defined as up-regulated and for log2 fold changes < -0.00001 the values < 18.75th percentile were defined as down-regulated. WBPaper00038438:E.faecalis_24hr_upregulated_TilingArray
  Transcripts that showed significantly increased expression after animals were treated with 50uM Rifampicin from day 1 to day 3 adult hermaphradite. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:Rifampicin_upregulated
  Proteins interacting with NHR-49-GFP according to co-IP and LC-MS. N.A. WBPaper00064071:NHR-49_interacting
  Transcripts expressed in body muscle, according to PAT-Seq analysis using Pmyo-3-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:body-muscle_expressed
  Transcripts that showed significantly decreased expression at 5-days-post L4 adult N2 hermaphrodites comparing to 1-day-post L4 adult N2 hermaphrodites. DESeq2, fold change > 2, FDR < 0.05 WBPaper00065835:Day5_vs_Day1_downregulated
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts expressed in pharynx, according to PAT-Seq analysis using Pmyo-2-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:pharynx_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_regulated_developing
  Transcripts that showed significantly decreased expression in day 3 adult hermaphrodite comparing to in L4 larva daf-16(mu86);glp-1(e2141) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-3-adult_vs_L4_downregulated_daf-16(mu86);glp-1(e2141)
  Transcripts that showed significantly decreased expression in day 3 adult hermaphrodite comparing to in L4 larva glp-1(e2141) animals. Fold change > 2, FDR < 0.05 WBPaper00064088:Day-3-adult_vs_L4_downregulated_glp-1(e2141)
  Transcripts that showed significantly increased expression after four-day-old young adult worms were placed on NGM plates seeded with OP50 in the presence 5% Agaro-oligosaccharides(AGO) for 24 h, comparing to animals grown in the absence of AGO. Fold change > 2. WBPaper00064306:Agaro-oligosaccharides_upregulated
  Transcripts that showed significantly increased expression in sin-3(tm1276) comparing to in N2. DESeq2, fold change > 2, p-value < 0.01. WBPaper00061203:sin-3(tm1276)_upregulated
Bacteria infection: Staphylococcus aureus MW2. 4 hours of exposure. Transcripts that showed significantly increased expression after N2 animals had 4 hours of infection by Staphylococcus aureus (MW2). DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:S.aureus-4h_upregulated_N2
  Transcripts that showed significantly decreased expression in N2 animals exposed to 0.1mM Paraquat from hatching to reaching adult stage. DESeq2 version 1.22.2, p < 0.05 WBPaper00064716:paraquat_downregulated
  Transcripts that showed significantly altered expression after 24 hour exposure to stavudine (d4T) starting at L1 lava stage. DESeq WBPaper00053302:stavudine_24h_regulated
  Transcripts that showed significantly decreased expression in sin-3(tm1276) comparing to in N2 at early embryo when there were only 3 -5 eggs in the adult. DESeq2, fold change > 2, adjusted p-value < 0.01 WBPaper00058598:sin-3(tm1276)_downregulated
  Expression Pattern Group C, enriched for genes involved in metabolic processes. The significance (P 0.0001) of the relative age (time) was used to determine if a gene was differentially expressed between the three age (time) groups. The effect of this factor explaining gene expression differences was used to determine if the expression went up or down during the two age/time periods (t1 - t2 and t2 -t3). Authors used a permutation approach to determine the thresholds for the different mapping strategies. For each of the used models for eQTL mapping, authors used 23,000 permutations. For each permutation, authors randomly picked a spot; each spot could only be picked once. The gene expression and relative lifespan values were than randomly distributed over the RILs (and time points) and used for mapping. In this way, authors obtained a threshold for each of the explaining factors. For the single time points, authors used a FDR of 0.01 to adjust for multiple testing. The genome-wide threshold for this FDR is -log10 P = 3.8 for each of the three time points. For the combined models (t1 to t2 and t2 to t3), authors used a genome-wide threshold of -log10 P = 4, which resembles an FDR of 0.006, 0.001, and 0.006 for marker, age, and the interaction between marker and age, respectively. To determine the threshold for the single gene examples, authors used 1000 permutations as in the genome-wide threshold. The difference is that they use the gene under study in all of the permutations. The P-values for the gene specific thresholds were determined at FDR = 0.05. WBPaper00036286:Pattern_C
  Transcripts depleted in purified oocyte P bodies comparing to in the whole animal. DESeq2, FDR < 0.05, fold change > 2. WBPaper00065975:P-body_vs_WholeAnimal_depleted
  Proteins that showed significantly decreased expression in 1-day-old sek-1(km4) adults comparing to in wild type animals, both with 6 hours of cisplatin treatment. The differential expression analysis was performed in R. Differentially expressed proteins were identified by using a two-sided t-test on log-transformed data. WBPaper00065373:sek-1(km4)_downregulated_cisplatin
  Transcripts that showed significantly increased expression in hpk-1(pk1393) comparing to in N2 at adult day 2. DESeq 2, fold change > 2, FDR < 0.05. WBPaper00065581:hpk-1(pk1393)_upregulated
  Transcripts that showed significantly increased expression in hda-1(ne4752[3xFLAG-Degron-HDA-1]) in gonads dissected from 1-day old adult animals. Salmon was used to map the mRNA-seq reads with the worm database WS268, and its output files were imported to DESeq2 in R. The differentially expressed genes were filtered by fold change more than 2 and adjusted p-value < 0.05. The scatter plots were generated by the plot function in R. WBPaper00061479:hda-1(ne4752)_upregulated
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 0hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:germline-precursors_blastula-embryo_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L3/L4 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:PVD-OLL-neurons_L3-L4-larva_expressed
  Transcripts that showed significantly increased expression in hda-2(ok1479) comparing to in N2 animals. DESeq2 (version 1.28.1), FDR < 0.01, fold change > 2. WBPaper00062159:hda-2(ok1479)_upregulated
  Transcripts that showed significantly increased expression in npr-15(tm12539) comparing to in N2 at L4 larva stage. Fold change > 2, FDR < 0.05. WBPaper00066608:npr-15(tm12539)_upregulated

6 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr2034244 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr14483   sorb-1::gfp was strongly expressed in all body wall muscles, where the protein tagged with green fluorescent protein (GFP) localized to adhesion plaques between myocytes and to dense bodies. SORB-1::GFP was also observed in muscle arm attachment sites at the nerve ring, where protrusions from head muscles form contacts with the basement membrane (White et al., 1986). SORB-1::GFP was also observed in all nonstriated muscles of C. elegans with the exception of the pharynx, localizing exclusively to integrin adhesion sites. Fluorescent signal localized strongly to the origins and insertions of the vulval and anal depressor muscles as well as to the spicule-associated and diagonal muscles of the male tail. In myocytes more closely resembling vertebrate smooth muscle, such as the uterus, stomatointestinal muscle, and proximal gonad sheath, GFP signal was present in small puncta present throughout the tissue).
    Expr1035702 Tiling arrays expression graphs  
    Expr1012125 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr2016009 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1160065 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  

80 GO Annotation

Annotation Extension Qualifier
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  located_in
  located_in
  located_in
  involved_in
  involved_in
  enables

11 Homologues

Type
orthologue
orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
orthologue
orthologue
orthologue
orthologue
least diverged orthologue
orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00012891 13750333 13770497 1

80 Ontology Annotations

Annotation Extension Qualifier
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  enables
  located_in
  located_in
  located_in
  involved_in
  involved_in
  enables

0 Regulates Expr Cluster

1 Sequence

Length
20165

1 Sequence Ontology Term

Identifier Name Description
gene  

2 Strains

WormBase ID
WBStrain00032411
WBStrain00035969

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIV_13749887..13750332   446 IV: 13749887-13750332 Caenorhabditis elegans