WormMine

WS295

Intermine data mining platform for C. elegans and related nematodes

Gene :

WormBase Gene ID  ? WBGene00002001 Gene Name  hars-1
Sequence Name  ? T11G6.1 Brief Description  hars-1 encodes the sole histidyl-tRNA synthetase (HisRS), class II aminoacyl-tRNA synthetases that catalyze the attachment of histidine to its cognate tRNAs and are thus required for protein biosynthesis; in C. elegans, hars-1 is required for larval and germline development, and hence normal fertility; hars-1(RNAi) animals exhibit a reduced numbers of functional germ cells which is partially suppressed by ced-4 mutations, and a hars-1 loss-of-function mutation exhibits arrest at the L2 larval stage of development; mutations in human HARS2 cause Perrault syndrome, which is characterized by female ovarian dysgenesis and sensorineural hearing loss in both females and males.
Organism  Caenorhabditis elegans Automated Description  Predicted to enable RNA binding activity; histidine-tRNA ligase activity; and identical protein binding activity. Involved in germ cell development; nematode larval development; and translation. Predicted to be located in cytosol and mitochondrion. Used to study Perrault syndrome and sensory peripheral neuropathy. Human ortholog(s) of this gene implicated in Charcot-Marie-Tooth disease, axonal type 2W; Perrault syndrome; and Usher syndrome type 3B. Is an ortholog of human HARS1 (histidyl-tRNA synthetase 1) and HARS2 (histidyl-tRNA synthetase 2, mitochondrial).
Biotype  SO:0001217 Genetic Position  IV :4.72237 ±0.002487
Length (nt)  ? 2328
Quick Links:
 
Quick Links:
 

1 Organism

Name Taxon Id
Caenorhabditis elegans 6239

1 Synonyms

Value
WBGene00002001

Genomics

2 Transcripts

WormMine ID Sequence Name Length (nt) Chromosome Location
Transcript:T11G6.1a.1 T11G6.1a.1 1795   IV: 10858957-10861284
Transcript:T11G6.1b.1 T11G6.1b.1 1777   IV: 10859317-10861282
 

Other

2 CDSs

WormMine ID Sequence Name Length (nt) Chromosome Location
CDS:T11G6.1a T11G6.1a 1575   IV: 10858968-10859075
CDS:T11G6.1b T11G6.1b 1566   IV: 10859321-10859419

19 RNAi Result

WormBase ID
WBRNAi00095726
WBRNAi00107944
WBRNAi00091058
WBRNAi00053133
WBRNAi00093130
WBRNAi00071458
WBRNAi00097766
WBRNAi00027694
WBRNAi00009158
WBRNAi00026275
WBRNAi00110433
WBRNAi00111851
WBRNAi00096898
WBRNAi00112019
WBRNAi00110431
WBRNAi00117478
WBRNAi00110434
WBRNAi00096959
WBRNAi00110896

31 Allele

Public Name
gk964278
gk964078
gk964500
gk962765
gk963382
gk567320
gk651897
gk334651
gk869797
gk404399
WBVar02123821
gk790862
WBVar02123246
gk511206
gk371869
gk631174
gk543219
gk337678
gk838277
gk653289
tm4074
WBVar02121214
WBVar02058824
gk542524
gk636289
gk665723
gk211584
WBVar01857502
WBVar02122145
WBVar01857503

1 Chromosome

WormBase ID Organism Length (nt)
IV Caenorhabditis elegans 17493829  

1 Chromosome Location


Feature . Primary Identifier
Start End Strand
WBGene00002001 10858957 10861284 1

4 Data Sets

Name URL
WormBaseAcedbConverter  
GO Annotation data set  
C. elegans genomic annotations (GFF3 Gene)  
Panther orthologue and paralogue predictions  

1 Downstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIV_10861285..10861401   117 IV: 10861285-10861401 Caenorhabditis elegans

110 Expression Clusters

Regulated By Treatment Description Algorithm Primary Identifier
  oocyte proteins identified by two or more unique peptides during proteomics study. In the pooled data set, 1453 C. elegans proteins were identified with a probability >= 0.9 according to ProteinProphet, of which 1165 proteins were identified by more than one unique peptide. WBPaper00038289:oocyte_protein
  Transcripts expressed in neuronal cells, by analyzingfluorescence-activated cell sorted (FACS) neurons. DESeq. False discovry rate (FDR) < 0.1. WBPaper00048988:neuron_expressed
adult vs dauer larva Transcripts that showed differential expression in adult vs dauer lava in N2 animals at 20C. N.A. WBPaper00050488:adult_vs_dauer_regulated_N2_20C
Bacteria infection: Enterococcus faecalis OG1RF. Exposure for 16 hours. Transcripts that showed significantly decreased expression in N2 after animals were exposed to E. faecalis OG1RF for 16 hours comparing to exposure to E. Coli OP50. Cuffcompare and Cuffdiff WBPaper00056090:E.faecalis_downregulated_N2
  Proteins interacting with NHR-49-GFP according to co-IP and LC-MS. N.A. WBPaper00064071:NHR-49_interacting
  Transcripts expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_expressed
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 10 mix) vs BT407 6h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.1mix_downregulated_6h
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 10 mix) vs BT407 12h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.1mix_downregulated_12h
Bacteria infection: Bacillus thuringiensis mRNAs that showed significantly decreased expression after pathogenic bacteria Bacillus thuringiensis infections comparing to non pathogenic BT (BT247(1 to 2 mix) vs BT407 12h), according to RNAseq. Cuffdiff, ajusted p-value < 0.01. WBPaper00046497:B.thuringiensis_0.5mix_downregulated_12h
  Maternal class (M): genes that are called present in at least one of the three PC6 replicates. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_M
  Strictly maternal class (SM): genes that are the subset of maternal genes that are not also classified as embryonic. A modified Welch F statistic was used for ANOVA. For each gene, regressed error estimates were substituted for observed error estimates. The substitution is justified by the lack of consistency among the most and least variable genes at each time point. Regressed error estimates were abundance-dependent pooled error estimates that represented a median error estimate from a window of genes of similar abundance to the gene of interest. A randomization test was used to compute the probability Pg of the observed F statistic for gene g under the null hypothesis that developmental time had no effect on expression. P-values were not corrected for multiple testing. [cgc5767]:expression_class_SM
  Transcripts depleted in purified oocyte P bodies comparing to in whole oocytes. DESeq2, FDR < 0.05, fold change > 2. WBPaper00065975:P-body_vs_oocyte_depleted
  Transcripts detected in body muscle nuclei according to a nuclear FACS-based strategy. Cufflinks WBPaper00065120:body-muscle-transcriptome
Temprature shift to 28C for 24 hours. Transcripts that showed significantly decreased expression after animals were exposed to 28C temperature for 24 hours. Differentially expressed genes wereidentified using DESeq (v.1.18.0) by normalizing readsbased on the negative binomial distribution method andcomparing each HS timepoint to the 0-h control. WBPaper00061341:28C_24h_downregulated
  Genes that showed expression levels higher than the corresponding reference sample (L3/L4 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:dopaminergic-neurons_L3-L4-larva_expressed
  Genes that showed expression levels higher than the corresponding reference sample (L3/L4 all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:hypodermis_L3-L4-larva_expressed
  Transcripts that showed significantly decreased expression in eat-2(ad1116) comparing to in N2 at 3-days post L4 adult hermaphrodite animals. DESeq2(v1.14.1), fold change > 2, p-value < 0.05 WBPaper00055354:eat-2(ad1116)_downregulated
  Transcripts detected in germline isolated from day-1 adult hermaphrodite animals. All three experiments have CPM >= 1. WBPaper00067147:germline_expressed
  Genes that were not enriched in either spermatogenic fem-3(q96gf) nor oogenic fog-2(q71) gonads, according to RNAseq analysis. To identify differentially expressed transcripts, authors used R/Bioconductor package DESeq. WBPaper00045521:Gender_Neutral
Starvation Transcripts that showed significantly altered expression by starvation with 100 mM salt (NaCl) DESeq(version 1.10.1), FDR < 0.05. WBPaper00050726:starvation_regulated_LowSalt
  Proteins identified in extracellular vesicle. N.A. WBPaper00062669:extracellular-vesicle_protein
Starvation 48 hours at L1 arrest Transcripts that showed significantly increased expression in starved N2 animals (48 hours at L1 arrest) Fold change > 2. WBPaper00064005:starvation_upregulated_N2_mRNA
Bacteria infection: Enterococcus faecalis OG1RF. Exposure for 16 hours. Transcripts that showed significantly decreased expression in hpx-2(dg047) after animals were exposed to E. faecalis OG1RF for 16 hours comparing to exposure to E. Coli OP50. Cuffcompare and Cuffdiff WBPaper00056090:E.faecalis_downregulated_hpx-2(dg047)
  Genes found to be regulated by low-copy overexpression of sir-2.1 with p < 0.014. N.A. WBPaper00026929:sir-2.1_overexpression_regulated
  TGF- Dauer pathway adult transcriptional targets. Results obtained by comparing the microarray results of the dauer-constitutive mutants daf-7(e1372), daf-7(m62), and daf-1(m40) with dauer-defective mutants daf-3(mgDf90), daf-5(e1386), and daf-7(e1372);daf-3(mgDf90) double mutants at the permissive temperature, 20C, on the first day of adulthood. SAM WBPaper00031040:TGF-beta_adult_downregulated
  Genes expressed in N2. Expressed transcripts were identified on the basis of a Present call in 3 out of 4 N2 experiments as determined by Affymetrix MAS 5.0. WBPaper00025141:N2_Expressed_Genes
  Transcripts unqiuely expressed in intestine, according to PAT-Seq analysis using Pges-1-GFP-3XFLAG mRNA tagging. Cufflinks FPKM value >=1. WBPaper00050990:intestine_enriched
  Transcripts that showed significantly increased expression in emb-4(hc60) comparing to in N2. DESeq2 WBPaper00052884:emb-4(hc60)_upregulated
  Genes that showed expression levels higher than the corresponding reference sample (Young adult all cell reference). A Mann-Whitney U test with an empirical background model and FDR correction for multiple testing was used to detect expressed transcripts (Benjamini and Hochberg 1995). Genes and TARs with an FDR <= 0.05 were reported as expressed above background. Authors detected differentially expressed transcripts using a method based on linear models. Genes and TARs were called differentially expressed if the FDR was <= 0.05 and the fold change (FC) >= 2.0. To more strictly correct for potential false-positives resulting from multiple sample comparisons, authors divided individual FDR estimates by the number of samplesor sample comparisons, respectively. This resulted in an adjusted FDR of 1.3 * 0.0001 for expression above background and of 7.4 * 0.0001 for differential expression. Authors called genes selectively enriched in a given tissue if they met the following requirements: (1) enriched expression in a given tissue (FDR <= 0.05 and FC >= 2.0), (2) fold change versus reference among the upper 40% of the positive FC range observed for this gene across all tissues, and (3) fold-change entropy among the lower 40% of the distribution observed for all genes. WBPaper00037950:CEP-sheath-cells_Day1-adult_expressed
  Transcripts enriched in germline by comparing dissected germline tissue with dissected intestine tissue, both injected with empty RNAi vector. Genes were determined germline-enriched if the lowest expression value (log2(FPKM+1)) observed in the germline empty vector samples was at least 2-fold higher than the highest expression value observed in the intestine empty vector samples. WBPaper00051039:germline_enriched

5 Expression Patterns

Remark Reporter Gene Primary Identifier Pattern Subcellular Localization
    Expr1031166 Tiling arrays expression graphs  
    Expr2012370 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1014355 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/levin2012  
    Expr2030606 Single cell embryonic expression. Only cell types with an expression fraction of greater 0.2 of the maximum expressed fraction are labeled (Full data can be downloaded from http://caltech.wormbase.org/pub/wormbase/datasets-published/packer2019/). The colors represent the broad cell class to which the cell type has been assigned. The size of the point is proportional to the log2 of the numbers of cells in the dataset of that cell type. Interactive visualizations are available as a web app (https://cello.shinyapps.io/celegans/) and can also be installed as an R package (https://github.com/qinzhu/VisCello.celegans).  
    Expr1156738 Developmental gene expression time-course. Raw data can be downloaded from ftp://caltech.wormbase.org/pub/wormbase/datasets-published/hashimshony2015  

24 GO Annotation

Annotation Extension Qualifier
  located_in
  located_in
  located_in
  located_in
  enables
  enables
  enables
  enables
  involved_in
  involved_in
  enables
  enables
  enables
  enables
  enables
  enables
  involved_in
  located_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in

9 Homologues

Type
orthologue
orthologue
least diverged orthologue
least diverged orthologue
orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue
least diverged orthologue

1 Locations


Feature . Primary Identifier
Start End Strand
WBGene00002001 10858957 10861284 1

24 Ontology Annotations

Annotation Extension Qualifier
  located_in
  located_in
  located_in
  located_in
  enables
  enables
  enables
  enables
  involved_in
  involved_in
  enables
  enables
  enables
  enables
  enables
  enables
  involved_in
  located_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in
  involved_in

0 Regulates Expr Cluster

1 Sequence

Length
2328

1 Sequence Ontology Term

Identifier Name Description
gene  

0 Strains

1 Upstream Intergenic Region

WormBase ID Name Sequence Name Length (nt) Chromosome Location Organism
intergenic_region_chrIV_10858497..10858956   460 IV: 10858497-10858956 Caenorhabditis elegans