Exploratory Data Analysis for Disease Pedigrees and Cancer Genetics

April 4, 2018 updated by: National Human Genome Research Institute (NHGRI)

The researchers have developed and distributed several software packages for pedigree analysis (FAST, CASPAR, PedHunter, IIC) and cancer genetics (oncotree, METREX). Users who need assistance with the software or who want to see new features added often send the researchers data files that include human data. The information is normally coded, and the researchers do not have access to the identification of the people whose information is in the files. Sometimes the content of the files gives rise to collaborations between the software developers and the providers of the files. Because concerns over the confidentiality of medical information have increased significantly over the past few years, the researchers must apply for exemptions from detailed ethics committee oversight for every data set they receive. This process is cumbersome and makes it difficult to assist software users. The amount of information required to apply for an exemption also poses a barrier to collaborations.

A full protocol will subject all data sets to ethics committee oversight without the need for individual exemption requests, enabling the researchers to assist users with software problems and to collaborate with other researchers.

From January 1, 2000, through December 15, 2001, the researchers received 71 requests for assistance, 19 of which included data files. None of the data files had any names or patient identifiers. Of these 19, in 8 cases the researchers sent back modified output files. In two of these eight cases, the researchers could see results of research interest; one of them concerned human data. In 2 of the 19 cases, the researchers sent back modified input files; in one such case, they established a collaboration with the originator of the files. In sum, most requests come under the heading of customer service, with no research contents. A few, however, do lead to research results or collaborations, for which ethics committee oversight is required.

Over the three-year time frame of this protocol, the researchers anticipate receiving data on a maximum of 10,000 individuals. They have modified their software documentation to explicitly instruct users to make sure the data files they send have no names. Should they receive files with names, they will delete the files and ask the originator to resubmit them with names encoded. Users submit data through unencrypted e-mail. The data are stored in password-protected computers at the National Institutes of Health.

Study Overview

Status

Completed

Conditions

Detailed Description

We have developed software packages for pedigree analysis (FAST, CASPAR, PedHunter, IIC) and cancer genetics (oncotrees, METREX). The purpose of this protocol is to allow us to work on software problems reports that may contain human subjects data in them. The data should be coded, by which we mean that if there are any links between identifiers and names, we do not possess the links to decode the names. Occasionally our assistance leads to a more formal research collaboration. This protocol seeks to clarify the guidelines under which we can provide assistance to users of our human genetics software and to establish a formal procedure under which we can seek IRB approval for the serendipitous collaborations that arise from providing that assistance. We cannot predict the sizes of samples or the diseases studied in the data sets sent to us, so most of the medical aspects of this protocol are necessarily general. We rely on the data being coded and the collectors of the data having their own institutional approvals to protect against most risks. The scientific aspects of investigating problem reports cannot be hypothesis driven because we cannot guess what problems will arise. On the engineering side, the basic hypotheses are that: 1) our software is likely to contain some bugs or other weaknesses, which cannot be easily found except by having others use the software and 2) a good way to improve the functionality of the software is to encourage users to submit problem reports and other suggestions.

This protocol has been in effect since early 2002. The only amendments during that time were to set up three collaborations, as described in Sections 4.6 and 4.7 and 4.8. The protocol has been quite useful and no changes are proposed in procedures.

Study Type

Observational

Enrollment (Actual)

13385

Contacts and Locations

This section provides the contact details for those conducting the study, and information on where this study is being conducted.

Study Locations

United States
- Maryland
  - Bethesda, Maryland, United States, 20892
    - National Human Genome Research Institute (NHGRI), 9000 Rockville Pike

Participation Criteria

Researchers look for people who fit a certain description, called eligibility criteria. Some examples of these criteria are a person's general health condition or prior treatments.

Eligibility Criteria

Ages Eligible for Study

Child
Adult
Older Adult

Accepts Healthy Volunteers

Genders Eligible for Study

All

Description

INCLUSION/EXCLUSION CRITERIA:

User Requests:

When receiving data sets for problem reports and/or new feature requests, all data sets would be accepted. We would not consider how the data were collected. The reasons for this are: the medical aspects of the data set are irrelevant to the reason we receiving the data; we cannot respond promptly to problem reports, if we have to get details on how the data were collected; the data are never used by us for any research into the traits being studies by the researchers who collected data.

Collaborations:

For collaborative studies, we would request details of how the data were collected, including evidence of approval by a local ethics board. We would submit to the NHGRI IRB an amendment describing the new collaboration. That amendment would necessarily include a formal indication that the collaborating research group has permission to collect and analyze the human data that they present to us (in coded and summarized format). For collaborators in the United States that permission would consist of an IRB-approved protocol or exemption from the collaborator's institution. For collaborators outside the United States the permission would be one of the types of agreements currently supported by the NIH Office of Human Subjects Research.

If we do not see evidence of appropriate permission to collect the data or the IRB turns down our proposed amendment, then we would exclude ourselves from participating in the proposed collaboration.

There are two other circumstances under which we have also excluded collaborating on analysis of data sets in the past and may do so in the future. One circumstance was where we did not feel that the proposed data set could possibly give sufficient statistical power to detect anything interesting. The other circumstance was where we had an existing collaboration with one research group, and a competing group asked us to collaborate also.

Study Plan

This section provides details of the study plan, including how the study is designed and what the study is measuring.

How is the study designed?

Collaborators and Investigators

This is where you will find people and organizations involved with this study.

Sponsor

National Human Genome Research Institute (NHGRI)

Investigators

Principal Investigator: Alejandro A Schaffer, Ph.D., National Human Genome Research Institute (NHGRI)

Publications and helpful links

The person responsible for entering information about the study voluntarily provides these publications. These may be about anything related to the study.

General Publications

Study record dates

These dates track the progress of study record and summary results submissions to ClinicalTrials.gov. Study records and reported results are reviewed by the National Library of Medicine (NLM) to make sure they meet specific quality control standards before being posted on the public website.

Study Major Dates

Study Start

March 19, 2002

Study Completion

May 9, 2017

Study Registration Dates

First Submitted

June 19, 2006

First Submitted That Met QC Criteria

June 19, 2006

First Posted (Estimate)

June 21, 2006

Study Record Updates

Last Update Posted (Actual)

April 5, 2018

Last Update Submitted That Met QC Criteria

April 4, 2018

Last Verified

May 9, 2017

More Information

Terms related to this study

Keywords

Other Study ID Numbers

999902152
02-HG-N152

This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.

Clinical Trials on Data Analysis

National Taiwan University Hospital

Unknown

Identification of Time-invariant EEG Signals for Brain-Computer Interface

Healthy Subjects | EEG Data Analysis

Taiwan
University of Oxford

Completed

Ethical Challenges of Consent in Data Sharing (Data Sharing)

Data Sharing

Thailand
Universitaire Ziekenhuizen KU Leuven

Unknown

Data Collection for CAD Evaluation

Data Collection

Belgium
Sheba Medical Center

Recruiting

Testing the Effectiveness of Two Hospitalization Alternatives Compared to Psychiatric Hospitalization (HOSP-ALT) (HOSP-ALT)

Data Collection | Questionnaires

Israel
University of Cincinnati

Active, not recruiting

The Wright Center of Innovation Imaging Archive (WIIA)

Medical Imaging Data

United States
University of Calgary
M.S.I. Foundation

Completed

Does Internet Data Collection Improve Cohort Retention?

Internet Data Collection

Canada
University of Portsmouth
Portsmouth Hospitals NHS Trust

Not yet recruiting

Healthy Data: Improving Health Information Quality Using Intelligent Systems

Data Quality
IntelligentUltrasound Limited

Completed

ScanNav Anatomy PNB Data Collection Study

Data Collection

United Kingdom
Stryker Neurovascular

Completed

Trevo® Retriever Registry (China)

Real World Data in China

China
University of Michigan
Food and Drug Administration (FDA)

Recruiting

Pharmacokinetic Study of Single-Dose Modified Release Glipizide in Healthy Volunteers

Human Gastrointestinal Physiology Data

United States

Exploratory Data Analysis for Disease Pedigrees and Cancer Genetics

Study Overview

Status

Conditions

Detailed Description

Study Type

Enrollment (Actual)

Contacts and Locations

Study Locations

Participation Criteria

Eligibility Criteria

Ages Eligible for Study

Accepts Healthy Volunteers

Genders Eligible for Study

Description

Study Plan

How is the study designed?

Collaborators and Investigators

Sponsor

Investigators

Publications and helpful links

General Publications

Study record dates

Study Major Dates

Study Start

Study Completion

Study Registration Dates

First Submitted

First Submitted That Met QC Criteria

First Posted (Estimate)

Study Record Updates

Last Update Posted (Actual)

Last Update Submitted That Met QC Criteria

Last Verified

More Information

Terms related to this study

Keywords

Other Study ID Numbers

Clinical Trials on Data Analysis

Search Similar Trials

Sponsors and Collaborators

Medical Conditions

Drug Interventions

CROs by country

CROs in Slovakia

Conditions

Rare Diseases

Drug Interventions

Dietary Supplements

Sponsor/Collaborators

Locations