SinEx DB 2.0 update 2020: database for eukaryotic single-exon coding sequences

Research output: Contribution to journalJournal articleResearchpeer-review

Documents

  • baab002

    Final published version, 1.29 MB, PDF document

Single-exon coding sequences (CDSs), also known as 'single-exon genes' (SEGs), are defined as nuclear, protein-coding genes that lack introns in their CDSs. They have been studied not only to determine their origin and evolution but also because their expression has been linked to several types of human cancers and neurological/developmental disorders, and many exhibit tissue-specific transcription. We developed SinEx DB that houses DNA and protein sequence information of SEGs from 10 mammalian genomes including human. SinEx DB includes their functional predictions (KOG (euKaryotic Orthologous Groups)) and the relative distribution of these functions within species. Here, we report SinEx 2.0, a major update of SinEx DB that includes information of the occurrence, distribution and functional prediction of SEGs from 60 completely sequenced eukaryotic genomes, representing animals, fungi, protists and plants. The information is stored in a relational database built with MySQL Server 5.7, and the complete dataset of SEG sequences and their GO (Gene Ontology) functional assignations are available for downloading. SinEx DB 2.0 was built with a novel pipeline that helps disambiguate single-exon isoforms from SEGs. SinEx DB 2.0 is the largest available database for SEGs and provides a rich source of information for advancing our understanding of the evolution, function of SEGs and their associations with disorders including cancers and neurological and developmental diseases.

Original languageEnglish
Article numberbaab002
JournalDatabase: The Journal of Biological Databases and Curation
Volume2021
Number of pages5
ISSN1758-0463
DOIs
Publication statusPublished - 2021

    Research areas

  • INTRONLESS, EXPRESSION, EVOLUTION

Number of downloads are based on statistics from Google Scholar and www.ku.dk


No data available

ID: 272171448