LIVIVO - The Search Portal for Life Sciences

zur deutschen Oberfläche wechseln
Advanced search

Search results

Result 1 - 5 of total 5

Search options

  1. Article: Genome-wide profiling of genetic variation at tandem repeat from long reads.

    Jam, Helyaneh Ziaei / Zook, Justin M / Javadzadeh, Sara / Park, Jonghun / Sehgal, Aarushi / Gymrek, Melissa

    bioRxiv : the preprint server for biology

    2024  

    Abstract: Tandem repeats are frequent across the human genome, and variation in repeat length has been linked to a variety of traits. Recent improvements in long read sequencing technologies have the potential to greatly improve TR analysis, especially for long or ...

    Abstract Tandem repeats are frequent across the human genome, and variation in repeat length has been linked to a variety of traits. Recent improvements in long read sequencing technologies have the potential to greatly improve TR analysis, especially for long or complex repeats. Here we introduce LongTR, which accurately genotypes tandem repeats from high fidelity long reads available from both PacBio and Oxford Nanopore Technologies. LongTR is freely available at https://github.com/gymrek-lab/longtr.
    Language English
    Publishing date 2024-01-23
    Publishing country United States
    Document type Preprint
    DOI 10.1101/2024.01.20.576266
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  2. Article ; Online: WebSTR: A Population-wide Database of Short Tandem Repeat Variation in Humans

    Lundström, Oxana (Sachenkova) / Adriaan Verbiest, Max / Xia, Feifei / Jam, Helyaneh Ziaei / Zlobec, Inti / Anisimova, Maria / Gymrek, Melissa

    Journal of Molecular Biology. 2023 Sept. 07, p.168260-

    2023  , Page(s) 168260–

    Abstract: Short tandem repeats (STRs) are consecutive repetitions of one to six nucleotide motifs. They are hypervariable due to the high prevalence of repeat unit insertions or deletions primarily caused by polymerase slippage during replication. Genetic ... ...

    Abstract Short tandem repeats (STRs) are consecutive repetitions of one to six nucleotide motifs. They are hypervariable due to the high prevalence of repeat unit insertions or deletions primarily caused by polymerase slippage during replication. Genetic variation at STRs has been shown to influence a range of traits in humans, including gene expression, cancer risk, and autism. Until recently STRs have been poorly studied since they pose significant challenges to bioinformatics analyses. Moreover, genome-wide analysis of STR variation in population-scale cohorts requires large amounts of data and computational resources. However, the recent advent of genome-wide analysis tools has resulted in multiple large genome-wide datasets of STR variation spanning nearly two million genomic loci in thousands of individuals from diverse populations. Here we present WebSTR, a database of genetic variation and other characteristics of genome-wide STRs across human populations. WebSTR is based on reference panels of more than 1.7 million human STRs created with state of the art repeat annotation methods and can easily be extended to include additional cohorts or species. It currently contains data based on STR genotypes for individuals from the 1000 Genomes Project, H3Africa, the Genotype-Tissue Expression (GTEx) Project and colorectal cancer patients from the TCGA dataset. WebSTR is implemented as a relational database with programmatic access available through an API and a web portal for browsing data. The web portal is publicly available at https://webstr.ucsd.edu.
    Keywords Internet ; autism ; bioinformatics ; colorectal neoplasms ; data collection ; databases ; gene expression ; genetic variation ; genome ; genome-wide association study ; genomics ; humans ; microsatellite repeats ; molecular biology ; risk ; human genetic variation ; short tandem repeats ; database ; API ; web portal
    Language English
    Dates of publication 2023-0907
    Publishing place Elsevier Ltd
    Document type Article ; Online
    Note Pre-press version
    ZDB-ID 80229-3
    ISSN 1089-8638 ; 0022-2836
    ISSN (online) 1089-8638
    ISSN 0022-2836
    DOI 10.1016/j.jmb.2023.168260
    Database NAL-Catalogue (AGRICOLA)

    More links

    Kategorien

  3. Article ; Online: WebSTR: A Population-wide Database of Short Tandem Repeat Variation in Humans.

    Lundström, Oxana Sachenkova / Adriaan Verbiest, Max / Xia, Feifei / Jam, Helyaneh Ziaei / Zlobec, Inti / Anisimova, Maria / Gymrek, Melissa

    Journal of molecular biology

    2023  Volume 435, Issue 20, Page(s) 168260

    Abstract: Short tandem repeats (STRs) are consecutive repetitions of one to six nucleotide motifs. They are hypervariable due to the high prevalence of repeat unit insertions or deletions primarily caused by polymerase slippage during replication. Genetic ... ...

    Abstract Short tandem repeats (STRs) are consecutive repetitions of one to six nucleotide motifs. They are hypervariable due to the high prevalence of repeat unit insertions or deletions primarily caused by polymerase slippage during replication. Genetic variation at STRs has been shown to influence a range of traits in humans, including gene expression, cancer risk, and autism. Until recently STRs have been poorly studied since they pose significant challenges to bioinformatics analyses. Moreover, genome-wide analysis of STR variation in population-scale cohorts requires large amounts of data and computational resources. However, the recent advent of genome-wide analysis tools has resulted in multiple large genome-wide datasets of STR variation spanning nearly two million genomic loci in thousands of individuals from diverse populations. Here we present WebSTR, a database of genetic variation and other characteristics of genome-wide STRs across human populations. WebSTR is based on reference panels of more than 1.7 million human STRs created with state of the art repeat annotation methods and can easily be extended to include additional cohorts or species. It currently contains data based on STR genotypes for individuals from the 1000 Genomes Project, H3Africa, the Genotype-Tissue Expression (GTEx) Project and colorectal cancer patients from the TCGA dataset. WebSTR is implemented as a relational database with programmatic access available through an API and a web portal for browsing data. The web portal is publicly available at https://webstr.ucsd.edu.
    MeSH term(s) Humans ; Computational Biology ; Genotype ; Microsatellite Repeats/genetics ; Genome, Human ; Databases, Genetic ; Genome-Wide Association Study ; Genetic Variation ; Datasets as Topic ; Colorectal Neoplasms/genetics
    Language English
    Publishing date 2023-09-07
    Publishing country Netherlands
    Document type Journal Article ; Research Support, N.I.H., Extramural ; Research Support, Non-U.S. Gov't
    ZDB-ID 80229-3
    ISSN 1089-8638 ; 0022-2836
    ISSN (online) 1089-8638
    ISSN 0022-2836
    DOI 10.1016/j.jmb.2023.168260
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  4. Article: Benchmarking of small and large variants across tandem repeats.

    English, Adam / Dolzhenko, Egor / Jam, Helyaneh Ziaei / Mckenzie, Sean / Olson, Nathan D / De Coster, Wouter / Park, Jonghun / Gu, Bida / Wagner, Justin / Eberle, Michael A / Gymrek, Melissa / Chaisson, Mark J P / Zook, Justin M / Sedlazeck, Fritz J

    bioRxiv : the preprint server for biology

    2023  

    Abstract: Tandem repeats (TRs) are highly polymorphic in the human genome, have thousands of associated molecular traits, and are linked to over 60 disease phenotypes. However, their complexity often excludes them from at-scale studies due to challenges with ... ...

    Abstract Tandem repeats (TRs) are highly polymorphic in the human genome, have thousands of associated molecular traits, and are linked to over 60 disease phenotypes. However, their complexity often excludes them from at-scale studies due to challenges with variant calling, representation, and lack of a genome-wide standard. To promote TR methods development, we create a comprehensive catalog of TR regions and explore its properties across 86 samples. We then curate variants from the GIAB HG002 individual to create a tandem repeat benchmark. We also present a variant comparison method that handles small and large alleles and varying allelic representation. The 8.1% of the genome covered by the TR catalog holds ∼24.9% of variants per individual, including 124,728 small and 17,988 large variants for the GIAB HG002 TR benchmark. We work with the GIAB community to demonstrate the utility of this benchmark across short and long read technologies.
    Language English
    Publishing date 2023-11-01
    Publishing country United States
    Document type Preprint
    DOI 10.1101/2023.10.29.564632
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  5. Article: A deep population reference panel of tandem repeat variation.

    Jam, Helyaneh Ziaei / Li, Yang / DeVito, Ross / Mousavi, Nima / Ma, Nichole / Lujumba, Ibra / Adam, Yagoub / Maksimov, Mikhail / Huang, Bonnie / Dolzhenko, Egor / Qiu, Yunjiang / Kakembo, Fredrick Elishama / Joseph, Habi / Onyido, Blessing / Adeyemi, Jumoke / Bakhtiari, Mehrdad / Park, Jonghun / Javadzadeh, Sara / Jjingo, Daudi /
    Adebiyi, Ezekiel / Bafna, Vineet / Gymrek, Melissa

    bioRxiv : the preprint server for biology

    2023  

    Abstract: Tandem repeats (TRs) represent one of the largest sources of genetic variation in humans and are implicated in a range of phenotypes. Here we present a deep characterization of TR variation based on high coverage whole genome sequencing from 3,550 ... ...

    Abstract Tandem repeats (TRs) represent one of the largest sources of genetic variation in humans and are implicated in a range of phenotypes. Here we present a deep characterization of TR variation based on high coverage whole genome sequencing from 3,550 diverse individuals from the 1000 Genomes Project and H3Africa cohorts. We develop a method, EnsembleTR, to integrate genotypes from four separate methods resulting in high-quality genotypes at more than 1.7 million TR loci. Our catalog reveals novel sequence features influencing TR heterozygosity, identifies population-specific trinucleotide expansions, and finds hundreds of novel eQTL signals. Finally, we generate a phased haplotype panel which can be used to impute most TRs from nearby single nucleotide polymorphisms (SNPs) with high accuracy. Overall, the TR genotypes and reference haplotype panel generated here will serve as valuable resources for future genome-wide and population-wide studies of TRs and their role in human phenotypes.
    Language English
    Publishing date 2023-03-12
    Publishing country United States
    Document type Preprint
    DOI 10.1101/2023.03.09.531600
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

To top