Population Genomic Diversity of France (POPGEN)

Having access to DNA sequences from individuals that share common ancestry with patients is of interest when analysing individual genomes for diagnoses. Information regarding allele frequency distribution in the same geographic areas where patients have ancestry will be necessary to help select the variants that are the most likely involved in disease and should hence be tested in functional assays.

To provide such a general population panel for France, the POPGEN project will select 10,000 individuals from CONSTANCES cohort with ancestry in different regions of France outside western part. These participants will have their DNA extracted from salivary kits and genotyped using SNP-chip. Among these 10,000 individuals, 4,000 individuals will have their whole genome sequenced. This study is one of the four pilot projects of the France Genomic Medicine plan (FMG 2025). The FMG2025 plan aims at introducing genome sequencing in the routine clinical practice to accelerate and improve diagnoses. The POPGEN project will provide the required references from the general French population to help filter out common variants from the genomes of patients.

Study Overview

Status

Active, not recruiting

Detailed Description

The development of high throughput sequencing technologies has opened-up new possibilities to sequence the human genome and identify all genetic variants in individual genome. In each genome, more than four million differences from the reference sequence, i.e. variants, are found and the real challenge is to find "the needles in stacks of needles". A majority of these variants are neutral with no impact on individual phenotype but some of them could impair individual health and lead to disease. Guidelines have been proposed for implicating sequence variants in diseases and limiting false positive reports. In these guidelines, emphasis is put on the requirement to compare the distributions of variants between patients and large control datasets matched as closely as possible to the patients in terms of ancestry.

Several large reference datasets such as those from the 1000 Genomes Project Consortium, the Exome Sequencing Project (ESP) or the Exome Aggregation Consortium (ExAC) are publicly available and give information on the variants found in the exomes or whole genomes of individuals with ancestries in different populations and their respective frequencies. Europeans are well represented in this database. However, there is no information on the geographic region in Europe where individuals are originating from, except for the Finns that are considered separately from the rest of Europe. Previous studies using SNP-chips have shown however that there exist some differences in allele frequencies between continental European populations and that these differences could lead to false positive results in association studies. These allele frequency differences at common variants are also detectable between regions within a country as found by several studies on different European populations, including France. It is very important to describe these geographic fine-scale stratification patterns to allow an efficient matching of cases and controls in association studies. This is especially true when the interest is on rare variants as these variants are, for the majority, young variants that have recently appeared in local populations and not spread over large geographic regions.

Having access to DNA sequences from individuals that share common ancestry with patients is also of interest when analysing individual genomes for diagnoses. Information regarding allele frequency distribution in the same geographic areas where patients have ancestry will therefore be necessary to help select the variants that are the most likely involved in disease and should hence be tested in functional assays.

A first project focusing on the western part of France is ongoing at Institut du Thorax in Nantes. Preliminary results have shown that using this panel of individuals improve variant filtering in the genomes of patients with similar ancestries. Efforts should therefore be made to cover other geographic regions of France.

To provide such a general population panel for France, the POPGEN project will select 10,000 individuals from CONSTANCES cohort with ancestry in different regions of France outside western part. These participants will have their DNA extracted from salivary kits and genotyped using SNP-chip. Among these 10,000 individuals, 4,000 individuals will have their whole genome sequenced. This study is one of the four pilot projects of the France Genomic Medicine plan (FMG 2025). The FMG2025 plan aims at introducing genome sequencing in the routine clinical practice to accelerate and improve diagnoses. The POPGEN project will provide the required references from the general French population to help filter out common variants from the genomes of patients.

Study Type

Observational

Enrollment (Actual)

10250

Contacts and Locations

This section provides the contact details for those conducting the study, and information on where this study is being conducted.

Study Locations

      • Brest, France
        • Inserm - UMR1078 GGB

Participation Criteria

Researchers look for people who fit a certain description, called eligibility criteria. Some examples of these criteria are a person's general health condition or prior treatments.

Eligibility Criteria

Ages Eligible for Study

18 years and older (Adult, Older Adult)

Accepts Healthy Volunteers

Yes

Sampling Method

Non-Probability Sample

Study Population

Participants will be recruited among CONSTANCES cohort volunteers who agreed transmission of their data and provided information about their parents and grandparent birthplaces. Among all those individuals, 15,000 will be randomly selected according the following criteria:

  • The place of birth of the ascendants is known and clustered in a restricted geographic area (maximum distance between birth places will be defined after analysis of their distribution in the entire dataset),
  • Demography distribution in metropolitan France.

Description

Inclusion Criteria:

  • participant included in CONSTANCES Cohort and have agreed to transmit their data for research purposes,
  • participant meeting the geographic criteria of the study,
  • participant who has given his consent for participating to this study.

Exclusion Criteria:

  • participant who do not have sent back their informed consent or the informed consent is non-complying
  • participant who do not to have provided a written free informed consent, such as for individuals placed under tutorship or guardianship.

Study Plan

This section provides details of the study plan, including how the study is designed and what the study is measuring.

How is the study designed?

Design Details

  • Observational Models: Ecologic or Community
  • Time Perspectives: Prospective

Cohorts and Interventions

Group / Cohort
Intervention / Treatment
Single arm

There is a unique arm in which all the participants will be included. The intervention consists in the collection of a salivary sample Volunteers who agreed to participate will have to collect their saliva with the self-collection device. They will then send back their saliva sample and a dated and signed copy of the informed consent form in the pre-paid return envelope.

All DNA of the saliva samples will be automatically extracted, then DNA samples will be genotyped and a subset of 4,000 DNA will be sequenced.

The genotyping will consist of measurement of general genetic variation, including the Single Nucleotide Polymorphisms. The SNP genotyping will be carried out using Illumina high density chips, in CNRGH production platform.

Sequencing will be performed in order to reach a mean coverage of 30X for each sample and a minimum of 25X mean coverage.

Finally, a bioinformatics analysis will be performed on sequencing data.

Volunteers who agreed to participate will have to collect their saliva with the self-collection device. They will then send back their saliva sample and a dated and signed copy of the informed consent form in the pre-paid return envelope.

All DNA of the saliva samples will be automatically extracted, then DNA samples will be genotyped and 4,000 of them will be sequenced.

The genotyping will consist of measurement of general genetic variation, including the Single Nucleotide Polymorphisms. The SNP genotyping will be carried out using Illumina high density chips, in CNRGH production platform.

A subset of 4,000 individuals will be selected to provide a homogeneous geographic coverage and to avoid inclusion of related individuals who will share large genomic regions. Sequencing will be performed in order to reach a mean coverage of 30X for each sample and a minimum of 25X mean coverage.

Finally, a bioinformatics analysis will be performed on sequencing data.

What is the study measuring?

Primary Outcome Measures

Outcome Measure
Measure Description
Time Frame
Genotypes and allele frequencies of the participants
Time Frame: Through study completion, an average of 1 year
4,000 individuals will be selected based on the places of birth of their grandparents in order to ensure a homogeneous coverage of the different geographic regions. The DNA of these 4,000 individuals will be sequenced. Genotypes and allele frequencies at the different genomic positions where variations will be observed in the POPGEN population will be computed by simple counting of their occurences in the dataset.
Through study completion, an average of 1 year

Collaborators and Investigators

This is where you will find people and organizations involved with this study.

Investigators

  • Study Director: Emmanuelle GENIN, Researcher, Inserm - UMR1078 GGB

Study record dates

These dates track the progress of study record and summary results submissions to ClinicalTrials.gov. Study records and reported results are reviewed by the National Library of Medicine (NLM) to make sure they meet specific quality control standards before being posted on the public website.

Study Major Dates

Study Start (Actual)

March 31, 2021

Primary Completion (Actual)

February 19, 2022

Study Completion (Estimated)

March 31, 2031

Study Registration Dates

First Submitted

November 14, 2019

First Submitted That Met QC Criteria

November 27, 2019

First Posted (Actual)

December 3, 2019

Study Record Updates

Last Update Posted (Actual)

March 31, 2026

Last Update Submitted That Met QC Criteria

March 26, 2026

Last Verified

March 1, 2026

More Information

Terms related to this study

Other Study ID Numbers

  • C19-39
  • 2019-A01862-55 (Other Identifier: ANSM)

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

No

Studies a U.S. FDA-regulated device product

No

product manufactured in and exported from the U.S.

No

This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.

Clinical Trials on Genetics, Population

Clinical Trials on Collection of a salivary sample

Subscribe