KIAA0895 is a protein that in Homo sapiens is encoded by the KIAA0895 gene. The gene encodes a protein commonly known as the KIAA0895 protein. Its aliases include hypothetical protein LOC23366, OTTHUMP00000206979, OTTHUMP00000206980, 9530077C05Rik, and 1110003N12Rik.[5] It is located at 7p14.2.[6]

KIAA0895
Identifiers
AliasesKIAA0895, 9530077C05Rik, 1110003N12Rik, Kiaa0895, mKIAA0895
External IDsMGI: 1915533; HomoloGene: 19280; GeneCards: KIAA0895; OMA:KIAA0895 - orthologs
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_026739
NM_183273
NM_001378982
NM_001378983

RefSeq (protein)

NP_081015
NP_001365911
NP_001365912

Location (UCSC)Chr 7: 36.32 – 36.39 MbChr 9: 22.32 – 22.36 Mb
PubMed search[3][4]
Wikidata
View/Edit HumanView/Edit Mouse

Research into the KIAA proteins has shown that they are similar to known genes with functions related to cell signaling/communication, cell structure/motility and nucleic acid management.[7]

Gene

edit

Locus

edit

The KIAA0895 gene is located at 7p14.2.[6] The genomic DNA is 65,976 base pairs long,[8] while the longest mRNA that it produces is 4463 bases long.

It can be transcribed into 15 transcript variants, which in turn can produce 13 different isoforms of the protein.[9]

 
The location of the KIAA0895 gene on chromosome 7

Gene Neighborhood

edit

KIAA0895 is surrounded by the following genes on chromosome 7:[8]

Size of gene

edit

The gene encoded for the KIAA0895 protein is 65,975 nucleotides long, from nucleotides 36324150 to 36390125, with seven exons.

mRNA

edit

There are ten different isoforms for KIAA0895.[6]

  • NP_001093895.1
  • EAW94064.1
  • NP_056129.2
  • NP_001186636.1
  • NP_001186635.1
  • XP_005249746.1
  • EAW94065.1
  • XP_024302470.1
  • NP_001186637.1
  • NP_001287885.1

Protein

edit

The longest protein isoform that is produced by the KIAA0895 gene is termed LOC23366 isoform 1 and is 520 amino acids long.[10] The predicted molecular weight is 61kDa.[11] Additionally, the theoretical isoelectric point is 10.[11]

Amino acid composition

edit

KIAA0895 is a lysine and arginine semi-enriched protein.[12] KIAA0895 is semi-enriched in positively charged lysine and arginine groups, and positively and negatively charged lysine, arginine, glutamic acid and aspartic acid groups.[12] However, KIAA0895 is semi-depleted in non-polar alanine, glycine and proline groups.[12]

The charge distribution analysis shows that there are no negative or mixed charge clusters.[12] However, there is one positive charge cluster from amino acids 12 to 36.[12]

Domain of unknown function 1704
Identifiers
SymbolDUF1704
PfamPF08014
InterProIPR012548
Available protein structures:
Pfam  structures / ECOD  
PDBRCSB PDB; PDBe; PDBj
PDBsumstructure summary

Regions

edit

LOC23366 contains a protein domain of unknown function called DUF1704.[13] It also contains a region of low complexity from position 120 to position 150 in the protein,[14] and an arginine-rich area from position 12 to position 51.[15]

Promoters

edit

Using ElDorado by Genomatrix, a promoter region sequence was found.[16] The most likely promoter for KIAA0895 starts at 36389926 and goes to 36391149, with a length of 1224.[16]

 
The predicted serine, threonine, and tyrosine phosphorylation sites of the KIAA0895 protein

Post-translational modification

edit

KIAA0895 is predicted to undergo phosphorylation at several serines, threonines, and tyrosines throughout its structure.[17] Phosphorylation at these sites is a form of gene regulation. Phosphorylation results in a conformational change in the structure of many enzymes and receptors. This causes them to become activated or deactivated.

Stem loops

edit

Using a database called Mfold, a stem-loop formation for the 5' UTR region shows that there is a lack of conservation, meaning there is some precedence that these stems and loops are not used for translation regulation.[18] The ΔG value was -12.90 kcal/mol with three loops.[18] Using the same database, a stem-loop formation for the 3' UTR region shows that there is conservation, meaning there is some precedence that they are used for translation regulation.[18] The ΔG value was -636.80 kcal/mol.[18]

Tertiary structure

edit

KIAA0895 has a tertiary structure with alpha helices and beta sheets.[19]

 
Proposed Tertiary Structure for KIAA0895.
 
Proposed Tertiary Structure for 24% of KIAA0895. Image coloured by rainbow N → C terminus.

Interacting proteins

edit

There are three proteins likely to be interacting proteins with KIAA0895. These proteins are ELAVL1,[20] vata,[21] and glym.[21] These interactions have experimental evidence from the sources provided.

Expression

edit

KIAA0895 is most commonly found in the testis, however it also has a strong expression in the kidneys, adrenal glands, and brain.[6]

 
RNA-seq tissue data is reported as mean TPM (transcripts per million).
 
The Mean RPKM values for KIAA0895 in 27 different tissue types.

Per Health State

edit

Using an EST profile from NCBI, KIAA0895 has strong expression in cervical tumors and bladder carcinoma.[22]

 
An EST profile was created by NCBI. EST profiles show approximate gene expression patterns as inferred from EST counts and the cDNA library sources.

Interacting proteins

edit

Using three different databases, three different interacting proteins were found. These include ELAVL1, VATA, and GLYM.[21] There was experimental evidence for all three of these interacting proteins.

Homologs and orthologs

edit

KIAA0895 has over 228 orthologs.[6] Orthologs have been found in mammals and eukaryotes.[23] There are homologs in 9 species.[6] The full list of organisms in which homologs have been found is given below.

Paralogs

edit

KIAA0895 has 7 paralogs in Homo sapiens:[23]

  • Unnamed protein product
  • Uncharacterized protein KIAA0895-like
  • hCG28832, isoform CRA_a
  • Hypothetical protein
  • Unnamed protein product
  • hCG38687, isoform CRA_a, partial
  • hCG38687, isoform CRA_b

References

edit
  1. ^ a b c GRCh38: Ensembl release 89: ENSG00000164542Ensembl, May 2017
  2. ^ a b c GRCm38: Ensembl release 89: ENSMUSG00000036411Ensembl, May 2017
  3. ^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. ^ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. ^ "GeneCards: KIAA0895 Gene". Retrieved 2011-04-24.
  6. ^ a b c d e f "KIAA0895 KIAA0895 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-02-24.
  7. ^ Nagase T, Ishikawa K, Suyama M, Kikuno R, Hirosawa M, Miyajima N, Tanaka A, Kotani H, Nomura N, Ohara O (December 1998). "Prediction of the coding sequences of unidentified human genes. XII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro". DNA Research. 5 (6): 355–64. doi:10.1093/dnares/5.6.355. PMID 10048485.
  8. ^ a b "Entrez Gene: KIAA0895". Retrieved 2011-04-24.
  9. ^ "NCBI AceView: KIAA0895". Retrieved 2011-04-24.
  10. ^ "NCBI Protein: LOC23366 isoform 1". Retrieved 2011-05-09.
  11. ^ a b "ExPASy - Compute pI/Mw tool". web.expasy.org. Retrieved 2019-04-17.
  12. ^ a b c d e "SAPS < Sequence Statistics < EMBL-EBI". www.ebi.ac.uk. Retrieved 2019-04-17.
  13. ^ "Wellcome Trust Sanger Institute, Pfam". Archived from the original on 2014-06-17. Retrieved 2011-05-09.
  14. ^ "MyHits Dotlet". Retrieved 2011-05-09.
  15. ^ "Uniprot". Retrieved 2011-05-09.
  16. ^ a b "ElDorado Introduction". www.genomatix.de. Archived from the original on 2016-06-02. Retrieved 2019-05-13.
  17. ^ "DTU Center for Biological Sciences, NetPhos". Retrieved 2011-05-09.
  18. ^ a b c d "The Mfold Web Server | mfold.rit.albany.edu". unafold.rna.albany.edu. Retrieved 2019-04-17.
  19. ^ "PHYRE2 Protein Fold Recognition Server". www.sbg.bio.ic.ac.uk. Retrieved 2019-04-20.
  20. ^ "mentha: the interactome browser". mentha.uniroma2.it. Retrieved 2019-04-20.
  21. ^ a b c "2 binary interactions found for search term KIAA0895". IntAct Molecular Interaction Database. Retrieved 2019-04-20.
  22. ^ "Tissue expression of KIAA0895 - Summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2019-05-13.
  23. ^ a b "NCBI BLAST". Retrieved 2011-05-09.