Sunday, November 14, 2010

S08 Clinical Information Warehouses. notes. #AMIA2010 #research Kamal/Payne-OSUMC, Weng-Columbia, Batal-Pitt

J. Kamal  Ohio State UMC data warehouse 
13+ year project
Information Warehouse - IW
IW Data Marts: Business, Clinical, Research, External
Started with financial. 
Web scorecards, dashboards, NLP, data mining.
Powerful ad hoc query tools.
Oracle BI Discoverer.
iCount - User Query tool for aggregates -- aggregate counts of patient cohorts to define patient populations for research grants; tied to standard terminology; all users have access. No IRB needed.
IW has >4K users 
Queries increasing exponentially.
Users: Administrators, managers, clinicinas researchers, finance.
50 dashboards of approx 1K indicators. Powerful visual reports.
Research portal, Investigator profile
CIO dashboard.
MRSA reporting
Restraint Orders
Diabetes mgmt
Genetic data storage and analysis
Patient tracker automates patient flow recording
Managers notified of idle processes

Data driven issue awareness
More scorecards
Physician Profiling system
De-identified IW
Hagop Mekhjian MD Chief Medical Officer
Posters 14, 41, 102, 46, 91, ... (11+ posters)
IW at OSUMC has Honest Broker status; Two protocols - internal+ external - worked with IRB and Research Office to 
make processes robustly meet the criteria for non-human subject research.

C. Weng et al. Comparing Effectiveness of Clinical Registry vs. Clinical Data Warehouse for Supporting Clinical Trial Recruitment: Case Study
Columbia University
TECOS Trial Evaluating Cardiovasc Outcomes w Sitagliptin

Diabetes Registry created in 2005, contains 5K patients, few variables A1C, urine microalb., LDL chol
Clinical Data Warehouse
Too many false positives recruiting with Registry, PI turned to Informatics/Data Warehouse
Warehouse had 2X the # of variables of interest vs. Registry (8/12 vs. 4/12).
Temporarily Ineligible (awaiting result)
Definitely Ineligible
Potentially Eligible > recent visit, must be referred by their primary care provider.
Confirmed Eligible
Target n=60
                                 Registry        Warehouse
Potentially Eligible        2033             100   
Confirmed Eligible            29(6.6%)     31 (31%)
nonconsent rate                 S  i  m  i  l  a  r
No working days               74              59
Patients/wk                        1              2.5
Total in 3 months              14              30  

Registry lacked rich clinical data for evaluating exclusion criteria
Columbia Study site rated #3 of over 300+ sites worldwide.
Warehouse needs sophisticated query skills.
Registry has better quality for disease-specific markers.
CDW  more effective in excluding the ineligible.
Dynamically generated registries linked to CDW can lead to promising recruitment solutions,
Planning a dynamic Protocol-specific screening tool that utilizes CDW and Registry
PI Weng NLM R01 

Payne P. TRITON Project
Integrative Translational Research Information Management Platform
CLL Research Consortium
NCI funded P01 fo rCLL
n=5,000 cohort

Modernize legacy information system
provide tools to increase in research productivity
-data exchange
-bio-specimen mgmt
-adverse event detection
-protocol driven Decision support
-integraitve query and data aggregation
Provide community access to technologyes

Systems Design:
Open source, standards compliant
portal based approach
GWT environment "Google look and feel"
Current Project Status

billing, budgeting, trial mgmt, close-out and reporting

Model-driven design/architecture:
Real world protocols > represent as a logical model>map to standards and data sources

Some lessons:
Don't use just PIs - get research staff "on the ground".
Use Project Management methods.
Organization perception is just as important as technical functionality. 

Mining Clinical Data using Minimal Predictive Rules
Batal I. and Hauskrecht M., University of Pittsburgh
Rule Induction methods represent knowledge
Association Rules more complete than greedy algorithsm - eg. trees; generate a lot of rules
Add correlation measure - statistical test or interestingness measure (evaluates each rule individually
We should consider the nested structure of the rules.
Minimal Predictive Rule [MPR]:
predicts class significantly better than the subrules
Lossy pruning speeds the mining, at the risk of some missing MPRs.
A-MPR prunes 98% of the search space without changing AUC.
An efficient algorithm for approximate mining.

Friday, November 5, 2010

T. Kato on Epigenetics of bipolar disorder #ASHG2010 notes

Epigenetics of Bipolar disorder
T. Kato

Previous association studies:
Ferreira et al Nat Genet 2008
PBRM1 Nat Genet 2010
GABA A R alpha 1

MZ discordance for bipolar disorder:
1. Point mustation
2. CNV
3. mosaicism

Methods - unbiased approach RED, RAPD, RDA, MS-RDA
Two pairs MZ twins discordant for bipolar disorder
488K CNV
MBD2B MBD3L1 conjugated beads Active Motif
Tiling arrays

Promoter wide DNA methylation analysis p<10^-4 by MAT
Bisulfite sequencing of 13 regions overlapping with CpG island to validate; refined p<10^-6 
HTTLPR DMR upstream 25,590,000(?correct location -- nocx not sure about this position) -- seen in ss genotype only
Beach 2010, van  IJzendoorn Biol Psy 2010
Philibert group 2010 SLC6A4 papers
see also poster 2527W Hiroko Sugawara

Posted via email from nocx

Wednesday, November 3, 2010

D. Mehta: Polymorphisms in FKBPS define biologically distinct subtypes of PTSD #ASHG2010 notes

Divya Mehta  et al
Polymorphisms in FKBPS define biologically distinct subtypes of PTSD: evidence from endocrine and gene expression studies

Individuals with PTSD have supersensitive GR.
FKBP5 is a cochaperone, part of a mature heteroGR complex, and regulator of the heteroGR complex.
FKBP5 SNPs interact with child abuse to predict PTSD symptoms. 
Binder et al. 2008; Yehuda group paper.

African American sample from suburban Atlanta.
218 participants with trauma history.
PSS (PTSD measure)
183 transcripts associated with trauma scores, none previously described.
Then they stratified transcript results by genotype*PTSD --significant interaction, no main effect.
32 significant interactions, transcripts previously identified, associated with PTSD symptoms X FKBP5 SNP rs 92950158.
Opposite effects of transcripts in carriers of risk and protective markers.

Complmented with a literature based - pathway mining approach. 
Use intermediate phenotypes.
use co-expression networks.

Posted via email from nocx

Nancy Cox: Are we looking in the wrong places? #ASHG2010 notes

Follow up to GWAS:
Sequence regions near most significant SNPs, sequence all local genes 

Transcriptome studies using lymphoblastoid cell lines -  PLoS Genetics 2010
SNPs with highest eQTLs are enriched for WTCCC Crohn's susceptibility loci
+other loci too. Pattern different in CAD, Hypertension, T2D.
Enrichment of locally acting eQTLs versus distantly acting eQTLs.
T2D had most dramatic enrichment in a late bin-- transcriptome studies using adipose and muscle tissue.
Most QTLs are distant regulators of gene expression.
Took top 1K SNPs in WTCCC T2D studies. Vast majority enriched for eQTLs acting distantly.
Cancer mostly local regulators.
T1D more local than distant.
Distant regulators quite often on different chromosomes.
Will different phenotypes have different architecture WRT distant vs. local regulators?
Does tissue specificitiy play a role?
Has implications for how we think about sequencing studies.

Posted via email from nocx

Mingyao Li Plenary on RNA Editing #ASHG2010

RNA-DNA differences = RDD
RDD are not traditional RNA editing or transcription infidelity
n=27 CEU
RNA sequences of B cells
DNA sequences from publicly available
Each person has 3.8K RDDs+/-1.6K
20K RDD sites in 4500 genes

Posted via email from nocx