WormMine

WS294

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00004929 Gene Name  soc-2
Sequence Name  ? AC7.2 Brief Description  soc-2 encodes a leucine-rich repeat protein; soc-2 functions downstream in the let-60/Ras and egl-15/FGF receptor signaling pathways to positively and negatively regulate signaling through these pathways, respectively, and thus affect such processes as vulval development, osmoregulation, and muscle membrane extension; consistent with its role in regulating Ras-mediated signal transduction, SOC-2 interacts with LET-60/Ras in yeast two-hybrid assays; soc-2 is reported to be widely expressed in larval and adult tissues.
Organism  Caenorhabditis elegans Automated Description  Enables small GTPase binding activity. Involved in several processes, including fibroblast growth factor receptor signaling pathway; positive regulation of Ras protein signal transduction; and vulval development. Expressed in tail. Human ortholog(s) of this gene implicated in Noonan syndrome-like disorder with loose anagen hair 1 and atopic dermatitis. Is an ortholog of human SHOC2 (SHOC2 leucine rich repeat scaffold protein).
Biotype  SO:0001217 Genetic Position  IV :1.51793±
Length (nt)  ? 25451
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00004929

Genomics

4 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:AC7.2a.1 AC7.2a.1 2128   IV: 5127041-5133857
Transcript:AC7.2b.1 AC7.2b.1 2046   IV: 5127041-5145623
Transcript:AC7.2c.1 AC7.2c.1 2281   IV: 5127041-5152491
Transcript:AC7.2d.1 AC7.2d.1 1650   IV: 5127402-5149537
 

Other

4 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:AC7.2a AC7.2a 1680   IV: 5127402-5127622
CDS:AC7.2c AC7.2c 1896   IV: 5127402-5127622
CDS:AC7.2d AC7.2d 1650   IV: 5127402-5127622
CDS:AC7.2b AC7.2b 1677   IV: 5127402-5127622

9 RNAi Result

WormBase ID
WBRNAi00067894
WBRNAi00068109
WBRNAi00038602
WBRNAi00007349
WBRNAi00101967
WBRNAi00009515
WBRNAi00027803
WBRNAi00022782
WBRNAi00086092

289 Allele

Public Name
otn9992
gk964500
gk963722
gk963417
gk963867
gk963150
gk963416
WBVar02122597
WBVar02123809
WBVar02123810
WBVar01727684
WBVar02020606
WBVar02020605
WBVar02020604
WBVar02020603
WBVar02020602
WBVar02020601
WBVar02020609
WBVar02020608
WBVar02020607
WBVar00188940
WBVar00188936
WBVar00188937
WBVar00188939
WBVar01825777
gk871520
gk621422
gk958957
gk833938
gk841758

1 Chromosome

WormBase ID Organism Length (nt)
IV Caenorhabditis elegans 17493829  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00004929 5127041 5152491 -1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIV_5126805..5127040   236 IV: 5126805-5127040 Caenorhabditis elegans

182 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  Transcripts that showed significantly increased expression in L1 neural cells comparing to in adult neural cells. DESeq2 (v1.18.1) fold change > 2, P-adj<0.05, using BenjaminiHochberg correction. WBPaper00060811:L1_vs_adult_upregulated_neural
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
  mRNAs that showed decreased expression in 1 cell mebryo comparing to in oocyte, according to RNAseq analysis. Gaussian error propagation. As cutoff for the up-regulated genes authors used log2 fold change > 1 and P < 0.05 and as cutoff for the down-regulated genes authors used log2 fold change < -1 and P < 0.05. WBPaper00045420:fertilization_downregulated_transcript
  Genes that showed expression levels higher than the corresponding reference sample (embryonic 24hr reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:AVE-neuron_L1-larva_expressed
Osmotic stress Transcripts that showed significantly altered expression with 500 mM salt (NaCl) vs 100 mM salt when food was present DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:OsmoticStress_regulated_Food
Bacteria: E.faecalis strain OG1RF Transcripts that showed significantly increased expression after infection by E. faecalis OG1RF. Ballgown was used to calculate differential expression of genes using FPKM data and to generate tables with fold change and P values. Genes were shortlisted with a cutoff of 2-fold change and P values of less than 0.05. WBPaper00059754:E.faecalis_OG1RF_upregulated
Bacteria infection: Enterococcus faecalis Genes with increased expression after 24 hours of infection by E.faecalis Fold changes shown are pathogen vs OP50. For RNA-seq and tiling arrays, log2 fold changes between gene expression values of infected versus uninfected nematodes were calculated. For log2 fold changes > 0.00001 the values > 81.25th percentile were defined as up-regulated and for log2 fold changes < -0.00001 the values < 18.75th percentile were defined as down-regulated. WBPaper00038438:E.faecalis_24hr_upregulated_TilingArray
  Coexpression clique No. 60, 176662_at-Y53F4B.16, on the genome-wide coexpression clique map for the nematode GPL200 platform. All available microarray datasets for the GPL200 platform (Affymetrix C. elegans Genome Array) were obtained from the GEO repository. This included 2243 individual microarray experiments. These were normalized against each other with the software RMAexpress (Bolstad, 2014). Based on these normalized values, Pearsons correlation coefficients were obtained for each probe-probe pair of the 22,620 probes represented on this array type. The resulting list of correlation coefficients was then ranked to generate the ranked coexpression database with information on each probe represented on the GPL200 platform. WBPaper00061527:176662_at-Y53F4B.16
  Proteins interacting with NHR-49-GFP according to co-IP and LC-MS. N.A. WBPaper00064071:NHR-49_interacting
  Transcripts expressed in body muscle, according to PAT-Seq analysis using Pmyo-3-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:body-muscle_expressed
  Transcripts expressed in GABAergic neuron, according to PAT-Seq analysis using Punc-47-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:GABAergic-neuron_expressed
  Transcripts expressed in hypodermis, according to PAT-Seq analysis using Pdpy-7-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:hypodermis_expressed
  Transcripts that showed significantly decreased expression in atfs-1(cmh15) (null allele) animals comparing to in N2 animals at L4 larva stage. edgeR, fold change > 2, FDR < 0.05 WBPaper00060909:atfs-1(cmh15)_downregulated
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Transcripts expressed in NMDA neuron, according to PAT-Seq analysis using Pnmr-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:NMDA-neuron_expressed
  Genes with expression level regulated by genotype (N2 vs CB4856) and age at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_age_regulated_developing
  Genes with expression level regulated by genotype (N2 vs CB4856) at L3 larva and Late reproduction stage (96 hours at 24 centigrade). For model 2, authors used 100 permutations to estimate the FDR threshold. Per permutation, genotypes and ages were independently randomly distributed, keeping the among-gene structure intact. Then for each spot (23,232) on the array, model 2 was tested. The obtained P-values were used to estimate a threshold for each of the explanatory factors. Authors also used a genome-wide threshold of -log10 P-value = 2, which resembles an FDR of 0.072 and 0.060 for marker and the interaction age-marker for the developing worms and FDR of 0.050 and 0.065 for marker and age-marker for the aging worms. For the physiological age effect, authors used a log10 P-value = 8 in developing worms (0.012 FDR) and -log10 P-value = 6 (0.032 FDR). WBPaper00040858:eQTL_regulated_developing
  Strictly maternal class (SM): genes that are the subset of maternal genes that are not also classified as embryonic. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_SM
  Transcripts that showed significantly increased expression in sin-3(tm1276) comparing to in N2. DESeq2, fold change > 2, p-value < 0.01. WBPaper00061203:sin-3(tm1276)_upregulated
  Transcripts that showed significantly increased expression in mrg-1(qa6200) comparing to in control animals in primordial germ cells (PGCs) at L1 larva stage. DESeq2(v1.32.0), FDR < 0.05. WBPaper00064315:mrg-1(qa6200)_upregulated_PGCs
  Transcripts that showed significantly increased expression in aak-1(tm1944);aak-2(ok524) animals comparing to in N2. DEseq 1.18.0, adjusted p-value < 0.05. WBPaper00056471:aak-1(tm1944);aak-2(ok524)_upregulated
  Transcripts that showed significantly changed expression in 6-day post-L4 adult hermaphrodite comparing to in 1-day post L4 adult hermaphrodite animals. Sleuth WBPaper00051558:aging_regulated
  Transcripts that showed significantly decreased expression in sin-3(tm1276) comparing to in N2 at early embryo when there were only 3 -5 eggs in the adult. DESeq2, fold change > 2, adjusted p-value < 0.01 WBPaper00058598:sin-3(tm1276)_downregulated
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:A-class-motor-neurons_L2-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L2 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:all-neurons_L2-larva_expressed
  Proteins that showed significantly decreased expression after 1-day-old wild type adults were exposed to cisplatin (300ug per mL) for 6 hours. The differential expression analysis was performed in R. Differentially expressed proteins were identified by using a two-sided t-test on log-transformed data. WBPaper00065373:Cisplatin_downregulated_WT
  Proteins that showed significantly decreased expression in 1-day-old sek-1(km4) adults comparing to in wild type animals, both with 6 hours of cisplatin treatment. The differential expression analysis was performed in R. Differentially expressed proteins were identified by using a two-sided t-test on log-transformed data. WBPaper00065373:sek-1(km4)_downregulated_cisplatin
  Genes that showed expression levels higher than the corresponding reference sample (L3/L4 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:dopaminergic-neurons_L3-L4-larva_expressed

11 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
Strain: BC11357 [soc-2::gfp] transcriptional fusion. PCR products were amplified using primer A: 5' [TGACGTCTCCGTGATCCAT] 3' and primer B 5' [CGCAGCCTTCAGACACAGT] 3'. Expr5002 Adult Expression: intestine; rectal epithelium; Reproductive System; uterus; spermatheca; body wall muscle; hypodermis; seam cells; Nervous System; nerve ring; ventral nerve cord; head neurons; Larval Expression: intestine; rectal epithelium; Reproductive System; developing vulva; developing spermatheca; body wall muscle; hypodermis; seam cells; Nervous System; nerve ring; ventral nerve cord; head neurons;  
Strain: BC15418 [soc-2::gfp] transcriptional fusion. PCR products were amplified using primer A: 5' [TGACGTCTCCGTGATCCAT] 3' and primer B 5' [CGCAGCCTTCAGACACAGT] 3'. Expr5003 Adult Expression: intestine; rectal epithelium; Reproductive System; vulval muscle; vulva other; body wall muscle; hypodermis; Nervous System; nerve ring; ventral nerve cord; head neurons; unidentified cells in tail ; Larval Expression: intestine; rectal epithelium; Reproductive System; developing vulva; body wall muscle; hypodermis; Nervous System; nerve ring; ventral nerve cord; head neurons; unidentified cells in tail ;  
    Expr2034228 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1019375 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr1032457 Tiling arrays expression graphs  
Original chronogram file: chronogram.1546.xml [AC7.2:gfp] transcriptional fusion. Chronogram529    
    Expr2015993 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1142810 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
    Expr1145127 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  
Original chronogram file: chronogram.236.xml [AC7.2:gfp] transcriptional fusion. Chronogram1237    
    Expr1017803 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  

7 GO Annotation

Annotation Extension Qualifier
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables

31 Homologues

Type
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00004929 5127041 5152491 -1

7 Ontology Annotations

Annotation Extension Qualifier
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  enables

0 Regulates Expr Cluster

1 Sequence

Length
25451

1 Sequence Ontology Term