STR DNA Data Analysis. Genetic Analysis

Size: px
Start display at page:

Download "STR DNA Data Analysis. Genetic Analysis"

Transcription

1 STR DNA Data Analysis Genetic Analysis

2 Once a DNA profile is generated, how does one interpret the data? DNA data analysis is based on alleles detected DNA data analysis rarely can tell you time of deposit on inanimate objects DNA data analysis rarely can tell you exact circumstances without reconstruction but is an aid to interpretation DNA data can be an aid to crime scene reconstruction in positioning individuals in certain scene locations DNA data can establish if an individual is often likely to have ever been at a scene if sufficient DNA data is present

3 How do we go from raw collected data to processed DNA profiles? Steps in Data Interpretation

4 Steps in Analysis: Evidentiary DNA profiles are analyzed first independently of the known reference sample Was a profile obtained? If not, write the report as no results obtained Was a full DNA profile (all the loci obtained)? Was a partial DNA profile obtained? (only some of the alleles are present) Is the profile a single source? (no more than 2 alleles per locus at all the loci) Is the profile a mixture? (evidence at more than 1 locus of 3 or more alleles) For this class, we will discuss mostly single source samples These relate to known reference samples, convicted offender database samples, paternity casework, human remains identification

5 Definitions: Locus a region on a chromosome tested for genetic information Allele a variant form of DNA that differs between individuals in a population Genetic inheritance 50% of DNA from each parent; therefore, a chromosome pair consists of 1 allele derived from each parent Homozygous each parent donates the same allele (e.g. 11, 11) Heterozygous each parent donates a different allele (e.g. 11, 12) Genetic mutation a region of DNA is missing or duplicated so a rare result is observed in the DNA profile PCR artifact known visible features to the raw data that can be edited out based on classification of artifact (e.g. stutter, minus A/nontemplate addition)

6 What does an STR appear as on a strand of DNA? Whole number allele values vs. micro variants

7 What is a peak? A peak is defined by a visible thin apex a few pixels wide that has consistent sharp shape and resolution Not every fluorescence above baseline is consistent with being a peak Lint, urea crystals, air bubbles etc. can create changes in baseline and must be reviewed by the analyst raised baseline, shoulder

8 Threshold values - degradation: Peaks above threshold are valid to report Peaks below threshold must be noted in some way to alert the reviewer that another person may be present, however, the amount is so small that the laboratory has no confidence in labeling the allele calls Individual labs set their own thresholds

9 Identifying single source vs. mixtures in data

10 How does the software establish allele calls? GeneMapper Software DNA fragments are compared to standard curve of the internal size standard Fragment size is then crosscompared to allelic ladder to assign a numeric repeat value

11 Some established PCR artifacts: Stutter Spectral pull up Nontemplate addition (minus A) Amplification product balance Stochastic effects (low template quantity) Allele drop out (stochastic effect) Allele drop in (stochastic effect)

12 Stutter calculations and filters: Formula for recognizing stutter is n - 4 where n is the size (bp) of the main peak Software filters out allele labeling for all peaks at the n 4 position that are typically 10% of the value of the main peak (peak height ratio calculation) (10% rule)

13

14 Spectral pull up: Collection software has a matrix (virtual filter) that instructs the CCD camera in the DNA sequencer as to what true red, green, yellow etc. represents and gates the collection channel for that color Spectral pull up is when sample fluorescence is too high and bleeds through into a different color channel This can look like a peak but is not a real peak and data must be reviewed

15

16 Minus A or non template addition is due to overload of DNA template such that the Taq polymerase enzyme cannot add the adenine to the end of the DNA fragment (n 1)

17 Amplification product balance:

18 Stochastic effects: These are fairly straightforward for single source samples Stochastic effects are very difficult with complex mixtures and low level templates Extreme caution in interpreting samples at low level due to chance of false homozygotes or peaks below analytical threshold that could be exculpatory

19 DNA Data Interpretation Can be very straight forward and is attempted file by file before comparing to reference samples in a case Can be extremely complicated if data is complex Inconclusive or uninterpretable data do occur and should be reported a such Exclusions like all forensic testing are absolute; inclusions require some statistical analyses to give meaning or weight to the match Most laboratories do not report match if fewer than 3 loci match due to coincidental matching in a population