Timothée Cezard

European Bioinformatics Institute

Author Statistics

Papers

Citation

H-Index

i-10 index

Research Trends

Author Order

Document Type

Co-Authors

Nina Thiessen

Berlin Institute of Health at Charité - Universitätsmedizin Berlin

Steven J.M. Jones

Children's Hospital of Philadelphia

Martin Hirst

University of British Columbia

Karim Gharbi

University of Edinburgh

Marco A. Marra

Canada's Michael Smith Genome Sciences Centre

Richard Varhol

Curtin University

Ryan D. Morin

Simon Fraser University

Kristen M. Smith

Amgen (United States)

Freda D. Miller

University of Toronto

Loen M. Hansford

Children's Cancer Institute Australia

Cooperative Institutions

BC Cancer Agency

University of British Columbia

European Bioinformatics Institute

Canada's Michael Smith Genome Sciences Centre

University of Edinburgh

Wellcome Sanger Institute

University College London

University of Toronto

Wellcome Trust

Genome British Columbia

Author Statistics

Papers

Citation

H-Index

i-10 index

Research Field

Identification and characterization of Hoxa9 binding sites in hematopoietic cells

Blood (2011)

Yongsheng Huang Kajal Sitwala Joel C. Bronstein Daniel Steven Sanders Monisha Dandekar

10.1182/blood-2011-03-341081

Cite

Citations (183)

Mass production of SNP markers in a nonmodel passerine bird through RAD sequencing and contig mapping to the zebra finch genome

Molecular Ecology Resources (2013)

Yann Bourgeois Émeline Lhuillier Timothée Cezard Joris A. M. Bertrand Boris Delahaie

Abstract Here, we present an adaptation of restriction‐site‐associated DNA sequencing ( RAD ‐seq) to the I llumina H i S eq2000 technology that we used to produce SNP markers in very large quantities at low cost per unit in the R éunion grey white‐eye ( Z osterops borbonicus ), a nonmodel passerine bird species with no reference genome. We sequenced a set of six pools of 18–25 individuals using a single sequencing lane. This allowed us to build around 600 000 contigs, among which at least 386 000 could be mapped to the zebra finch ( T aeniopygia guttata ) genome. This yielded more than 80 000 SNP s that could be mapped unambiguously and are evenly distributed across the genome. Thus, our approach provides a good illustration of the high potential of paired‐end RAD sequencing of pooled DNA samples combined with comparative assembly to the zebra finch genome to build large contigs and characterize vast numbers of informative SNP s in nonmodel passerine bird species in a very efficient and cost‐effective way.

Sequence assembly

10.1111/1755-0998.12137

Cite

Citations (23)

Supplementary Table S1 from System-Level Analysis of Neuroblastoma Tumor–Initiating Cells Implicates AURKB as a Novel Drug Target for Neuroblastoma

Olena Morozova Milijana Vojvodic Natalie Grinshtein Loen M. Hansford Kim M. Blakely

Supplementary Table S1.

Table (database)

10.1158/1078-0432.22440813

Cite

Citations (0)

The effect ofRADallele dropout on the estimation of genetic variation within and between populations

Molecular Ecology (2012)

Mathieu Gautier Karim Gharbi Timothée Cezard Julien Foucaud Carole Kerdelhué

Inexpensive short-read sequencing technologies applied to reduced representation genomes is revolutionizing genetic research, especially population genetics analysis, by allowing the genotyping of massive numbers of single-nucleotide polymorphisms (SNP) for large numbers of individuals and populations. Restriction site-associated DNA (RAD) sequencing is a recent technique based on the characterization of genomic regions flanking restriction sites. One of its potential drawbacks is the presence of polymorphism within the restriction site, which makes it impossible to observe the associated SNP allele (i.e. allele dropout, ADO). To investigate the effect of ADO on genetic variation estimated from RAD markers, we first mathematically derived measures of the effect of ADO on allele frequencies as a function of different parameters within a single population. We then used RAD data sets simulated using a coalescence model to investigate the magnitude of biases induced by ADO on the estimation of expected heterozygosity and F(ST) under a simple demographic model of divergence between two populations. We found that ADO tends to overestimate genetic variation both within and between populations. Assuming a mutation rate per nucleotide between 10(-9) and 10(-8), this bias remained low for most studied combinations of divergence time and effective population size, except for large effective population sizes. Averaging F(ST) values over multiple SNPs, for example, by sliding window analysis, did not correct ADO biases. We briefly discuss possible solutions to filter the most problematic cases of ADO using read coverage to detect markers with a large excess of null alleles.

Effective population size

Tag SNP

10.1111/mec.12089

Cite

Citations (292)

Linkage maps of the Atlantic salmon (Salmo salar) genome derived from RAD sequencing

BMC Genomics (2014)

Serap Gonen Natalie R Lowe Timothée Cezard Karim Gharbi Stephen Bishop

Genetic linkage maps are useful tools for mapping quantitative trait loci (QTL) influencing variation in traits of interest in a population. Genotyping-by-sequencing approaches such as Restriction-site Associated DNA sequencing (RAD-Seq) now enable the rapid discovery and genotyping of genome-wide SNP markers suitable for the development of dense SNP linkage maps, including in non-model organisms such as Atlantic salmon (Salmo salar). This paper describes the development and characterisation of a high density SNP linkage map based on SbfI RAD-Seq SNP markers from two Atlantic salmon reference families.Approximately 6,000 SNPs were assigned to 29 linkage groups, utilising markers from known genomic locations as anchors. Linkage maps were then constructed for the four mapping parents separately. Overall map lengths were comparable between male and female parents, but the distribution of the SNPs showed sex-specific patterns with a greater degree of clustering of sire-segregating SNPs to single chromosome regions. The maps were integrated with the Atlantic salmon draft reference genome contigs, allowing the unique assignment of ~4,000 contigs to a linkage group. 112 genome contigs mapped to two or more linkage groups, highlighting regions of putative homeology within the salmon genome. A comparative genomics analysis with the stickleback reference genome identified putative genes closely linked to approximately half of the ordered SNPs and demonstrated blocks of orthology between the Atlantic salmon and stickleback genomes. A subset of 47 RAD-Seq SNPs were successfully validated using a high-throughput genotyping assay, with a correspondence of 97% between the two assays.This Atlantic salmon RAD-Seq linkage map is a resource for salmonid genomics research as genotyping-by-sequencing becomes increasingly common. This is aided by the integration of the SbfI RAD-Seq SNPs with existing reference maps and the draft reference genome, as well as the identification of putative genes proximal to the SNPs. Differences in the distribution of recombination events between the sexes is evident, and regions of homeology have been identified which are reflective of the recent salmonid whole genome duplication.

SNP genotyping

Tag SNP

Genetic linkage

Molecular Inversion Probe

Linkage (software)

10.1186/1471-2164-15-166

Cite

Citations (137)

Exome Sequencing and Linkage Analysis Implicates Two Candidate Genes On Chromosome 3p in Familial Hodgkin Lymphoma

Blood (2012)

Alastair Lawrie Timothée Cezard Dominic Culligan Mark A. Vickers

dbSNP

1000 Genomes Project

SNP array

Tag SNP

Exome

SNP genotyping

10.1182/blood.v120.21.53.53

Cite

Citations (2)

Data from System-Level Analysis of Neuroblastoma Tumor–Initiating Cells Implicates AURKB as a Novel Drug Target for Neuroblastoma

Olena Morozova Milijana Vojvodic Natalie Grinshtein Loen M. Hansford Kim M. Blakely

<div>AbstractPurpose: Neuroblastoma (NB) is an aggressive tumor of the developing peripheral nervous system that remains difficult to cure in the advanced stages. The poor prognosis for high-risk NB patients is associated with common disease recurrences that fail to respond to available therapies. NB tumor-initiating cells (TICs), isolated from metastases and primary tumors, may escape treatment and contribute to tumor relapse. New therapies that target the TICs may therefore prevent or treat tumor recurrences.Experimental Design: We undertook a system-level characterization of NB TICs to identify potential drug targets against recurrent NB. We used next-generation RNA sequencing and/or human exon arrays to profile the transcriptomes of 11 NB TIC lines from six NB patients, revealing genes that are highly expressed in the TICs compared with normal neural crest-like cells and unrelated cancer tissues. We used gel-free two-dimensional liquid chromatography coupled to shotgun tandem mass spectrometry to confirm the presence of proteins corresponding to the most abundant TIC-enriched transcripts, thereby providing validation to the gene expression result.Results: Our study revealed that genes in the BRCA1 signaling pathway are frequently misexpressed in NB TICs and implicated Aurora B kinase as a potential drug target for NB therapy. Treatment with a selective AURKB inhibitor was cytotoxic to NB TICs but not to the normal neural crest-like cells.Conclusion: This work provides the first high-resolution system-level analysis of the transcriptomes of 11 primary human NB TICs and identifies a set of candidate NB TIC-enriched transcripts for further development as therapeutic targets. Clin Cancer Res; 16(18); 4572–82. ©2010 AACR.</div>

10.1158/1078-0432.c.6518082

Cite

Citations (0)

A newly developed genetic sex marker and its application to understanding chemically induced feminisation in roach (Rutilus rutilus)

Molecular Ecology Resources (2020)

Anke Lange Josephine R. Paris Karim Gharbi Timothée Cezard Shinichi Miyagawa

Abstract Oestrogenic wastewater treatment works (WwTW) effluents discharged into UK rivers have been shown to affect sexual development, including inducing intersex, in wild roach ( Rutilus rutilus ). This can result in a reduced breeding capability with potential population level impacts. In the absence of a sex probe for roach it has not been possible to confirm whether intersex fish in the wild arise from genetic males or females, or whether sex reversal occurs in the wild, as this condition can be induced experimentally in controlled exposures to WwTW effluents and a steroidal oestrogen. Using restriction site‐associated DNA sequencing (RAD‐seq), we identified a candidate for a genetic sex marker and validated this marker as a sex probe through PCR analyses of samples from wild roach populations from nonpolluted rivers. We also applied the sex marker to samples from roach exposed experimentally to oestrogen and oestrogenic effluents to confirm suspected phenotypic sex reversal from males to females in some treatments, and also that sex‐reversed males are able to breed as females. We then show, unequivocally, that intersex in wild roach populations results from feminisation of males, but find no strong evidence for complete sex reversal in wild roach at river sites contaminated with oestrogens. The discovered marker has utility for studies in roach on chemical effects, wild stock assessments, and reducing the number of fish used where only one sex is required for experimentation. Furthermore, we show that the marker can be applied nondestructively using a fin clip or skin swab, with animal welfare benefits.

Rutilus

Sex reversal

Sexual Differentiation

10.1111/1755-0998.13166

Cite

Citations (8)

Recommendations for the formatting of Variant Call Format (VCF) files to make plant genotyping data FAIR

F1000Research (2022)

Sebastian Beier Anne Fiebig Cyril Pommier Isuru Liyanage Matthias Lange

In this opinion article, we discuss the formatting of files from (plant) genotyping studies, in particular the formatting of metadata in Variant Call Format (VCF) files. The flexibility of the VCF format specification facilitates its use as a generic interchange format across domains but can lead to inconsistency between files in the presentation of metadata. To enable fully autonomous machine actionable data flow, generic elements need to be further specified. We strongly support the merits of the FAIR principles and see the need to facilitate them also through technical implementation specifications. They form a basis for the proposed VCF extensions here. We have learned from the existing application of VCF that the definition of relevant metadata using controlled standards, vocabulary and the consistent use of cross-references via resolvable identifiers (machine-readable) are particularly necessary and propose their encoding. VCF is an established standard for the exchange and publication of genotyping data. Other data formats are also used to capture variant data (for example, the HapMap and the gVCF formats), but none currently have the reach of VCF. For the sake of simplicity, we will only discuss VCF and our recommendations for its use, but these recommendations could also be applied to gVCF. However, the part of the VCF standard relating to metadata (as opposed to the actual variant calls) defines a syntactic format but no vocabulary, unique identifier or recommended content. In practice, often only sparse descriptive metadata is included. When descriptive metadata is provided, proprietary metadata fields are frequently added that have not been agreed upon within the community which may limit long-term and comprehensive interoperability. To address this, we propose recommendations for supplying and encoding metadata, focusing on use cases from plant sciences. We expect there to be overlap, but also divergence, with the needs of other domains.

Disk formatting

10.12688/f1000research.109080.2

Cite

Citations (4)

Mobilisation and analyses of publicly available SARS-CoV-2 data for pandemic responses

bioRxiv (Cold Spring Harbor Laboratory) (2023)

Nadim Rahman Colman O’Cathail Ahmad Zyoud Alexey Sokolov Bas B. Oude Munnink

Abstract The COVID-19 pandemic has seen large-scale pathogen genomic sequencing efforts, becoming part of the toolbox for surveillance and epidemic research. This resulted in an unprecedented level of data sharing to open repositories, which has actively supported the identification of SARS-CoV-2 structure, molecular interactions, mutations and variants, and facilitated vaccine development and drug reuse studies and design. The European COVID-19 Data Platform was launched to support this data sharing, and has resulted in the deposition of several million SARS-CoV-2 raw reads. In this paper we describe (1) open data sharing, (2) tools for submission, analysis, visualisation and data claiming (e.g. ORCiD), (3) the systematic analysis of these datasets, at scale via the SARS-CoV-2 Data Hubs as well as (4) lessons learned. As a component of the Platform, the SARS-CoV-2 Data Hubs enabled the extension and set up of infrastructure that we intend to use more widely in the future for pathogen surveillance and pandemic preparedness.

Pandemic

Toolbox

Data Sharing

Identification

Preparedness

Dashboard

10.1101/2023.04.19.537514

Cite

Citations (7)