Journal Article•DOI•

The Rule of Probabilities: A Practical Approach for Applying Bayes' Rule to the Analysis of DNA Evidence

Ian Ayres¹, Barry Nalebuff¹, Barry Nalebuff²•Institutions (2)

13 Jan 2015-Social Science Research Network-

TL;DR: This Article shows that a correct application of Bayes’ rule should lead fact-finders and litigants to focus on the size of two variables that influence the source probability: the probability that a non-source in the DNA database would have an alibi, and the likelihood that the source of the DNA is included in the database.

read less

Abstract: Bayes’ rule is not being used to guide jury decision making in the vast majority of criminal cases introducing evidence of DNA testing. Instead of telling juries the “source probability,” the probability that the individual whose DNA matches was the source of the forensic evidence found at the crime scene, experts only present pieces of the puzzle. They provide the probability that a randomly selected innocent person would have a match or the expected number of innocent matches in the database. In some cases, the random match probability will be so low (one in a quadrillion) that the intuitive source probability is practically one hundred percent. But, in other cases, with large database trawls and random match probability at 1 in a million, jurors will have no ability to convert the random match probability or the likelihood ratio based on expected number of matches into relevant data that will help them address the question of guilt. This Article shows that a correct application of Bayes’ rule should lead fact-finders and litigants to focus on the size of two variables that influence the source probability: the probability that a non-source in the DNA database would have an alibi, and the probability that the source of the DNA is included in the database. This Article suggests practical means of estimating these two variables and argues that as a legal matter these parameters as well as the Bayesian posterior source probability are admissible in court. In particular, focusing on the prior probability that the “database is guilty,” i.e. the probability that someone in the database is the source of the forensic evidence, is not just analytically and empirically tractable, but avoids the evidentiary limitations concerning a particular defendant’s prior bad acts. Appropriate application of Bayes’ rule, far from preempting the fact-finding and adversarial process, can guide advocates to engage the important aspects of the evidence that are still likely to be open to contestation. Perhaps most important, appropriate application of Bayes’ rule will also allow jurors to reach verdicts via a coherent path that employs sound logic and reasoning.

...read moreread less

Summary (5 min read)

Jump to: [INTRODUCTION] – [. THE ISLAND OF EDEN] – [II.] – [35. See] – [38, See] – [P(S IM) P(S) P(M I S) P(-SI M) P(~ S) P(M I-S)] – [M : rD(1a)] – [III. COMPARATIVE STATICS] – [A. Trawling a Larger Database] – [IV. APPLICATION TO PEOPLE V. COLLINS] – [V. APPLICATION TO PEOPLE V. PUCKET] – [p:l-p] – [B. A Model for Calculating Priors] – [1-d] – [VI. EMPIRICIZING THE ALIBI AND PRIOR PROBABILITIES] – [2. Empiricizing the prior probability] – [B. Small Database Trawls] – [VII. ADMISSIBILITY] – [172. The] and [CONCLUSION]

INTRODUCTION

With the recent Supreme Court decision allowing the collection of DNA samples from any person arrested and detained for a serious offense, it seems inevitable that the justice system will collect and use large DNA databases.
There is concern that as database size increases, so too will the rate of false positives, and thus innocent peole will be convicted when their DNA matches evidence left at a crime scene.
The authors will show how estimation of both the prior probability and relevant database size can be assessed under alternative assumptions that are appropriately open to literal and figurative cross-examination to assure the robustness of the bottom-line conclusion: the defendant was or was not the true source of the crime scene evidence.

. THE ISLAND OF EDEN

Imagine that a singular crime has been committed on the otherwise idyllic island of Eden.
All of the other people in the population have been ruled out by the lack of a DNA match, Prior to the test, both Mr. Baker and Mr. Fisher were equally likely to have been the criminal.
Had the population been above 51,294, a test with a one-ina-million chance of a false positive would lead to more than a five percent chance that at least one person would match even when everyone is innocent.
What matters is the probability that the guilty party is in the database.
The defendant might have been convicted even without the confirmation of DNA evidence.

II.

BAYES FOR AN ERA OF BIG DATA Court cases introducing DNA evidence have traditionally focused on three different numbers: 1. The random match probability: the probability that a randomly selected person will be a DNA match.
See infra notes 76, 85 and accompanying text.
Indeed, if the expected number of innocent matches in the database were two, the authors would not say that the chance of a database match is 200% and thereby violate a fundamental tenet of probability that all probabilities must be at or below one.
This Article describes how, in practical terms, to convert the inputs into the number the trier of fact should care about.
It will be of enormous help to introduce some notation: S will stand for the result that the defendant is the source of DNA found at the crime scene.

35. See

The authors start with the aggregate probability that someone in the database is the source.
This probability will tell us a great deal about the posterior source probability with regard to every individual in the database.
Because the authors are assuming no false negatives, the posterior source probability of all unmatched indi-37.
For large databases, this makes little difference in that this changes the size of the database by one.
If there is a single unalibied match, then the posterior database source probability will be entirely focused on that matching individual (and the remaining unmatching individuals in the database will have a zero source probability).

38, See

The rate of false negatives varies depending on the width of the DNA band, or "match window," used to match the suspect to the source.
While this does not rule out the possibility of a false negative, for that to have happened, the authors would have to have experienced both a false positive and a false negative at the same time.
The intuition for this formula follows from a Venn diagram:.

P(S IM) P(S) P(M I S) P(-SI M) P(~ S) P(M I-S)

The odds of observing M unalibied matches is the ratio of the probability this would happen when the source is in the database versus the probability this would happen when the source is not in the database (and the M unalibied matches occur by chance).
The binomial distribution provides the probability for each possible number of heads, from 0 to N.
If it turns out that these two numbers perfectly coincide, then the authors do not update the prior probabilities.

M : rD(1a)

(5) This likelihood ratio can be restated simply as the ratio of the actual number of unalibied matches relative to the expected number of unalibied, nonsource matches 46 : M: E[M] (6) This likelihood ratio indicates how strong the new information is in terms of changing the prior opinion.
If the likelihood ratio in equation (6) is 10:1, then it is ten times more likely that the M matches observed are the result of the true match being in the database than all being there by luck.
If their initial view was that it was twice as likely that the database did not contain the true match (prior odds are 1:2), then Bayes' rule tells us (via equation ( 4)) that putting these together means the new odds are 5:1 in favor of the database containing the true match.
Bayes' rule says that to derive the updated, posterior odds of the source being in the dataset, all the authors need to do is simply multiply the prior odds by the likelihood ratio of equation (6).

III. COMPARATIVE STATICS

This Part explores how the source probability of equation (8) changes as the authors change the four underlying variables (a, r, D, and p) while holding M constant.
The authors also speculate on how these variables are likely to change over time.
Increasing the random match probability, r, while holding everything else equal decreases their confidence that the matching individual was the source of the forensic DNA.

A. Trawling a Larger Database

The question that often arises with a database trawl is how to adjust for the size of the database.
To answer this question, the authors first assume that the two databases are each comprised of individuals who, from a Bayesian perspective, have identical prior likelihoods of being the source of the forensic DNA.
In other words, the larger database has the same average quality as the smaller one in terms of finding matches.
58 As it turns out, there are two forces that almost exactly cancel each other out.

IV. APPLICATION TO PEOPLE V. COLLINS

The authors analysis of DNA evidence can be usefully compared to the use of eyewitness evidence in the famous People v. Collins case.
Thus, the relevant population should be the number of couples in greater Los Angeles, where the crime was committed.
If the couple in court is guilty, then the chance some other innocent couple will match is I -(Ir)7, where T is here the number of couples not yet examined by the police.
If the police had searched the entire population of possible couples and found that the defendants were the only match, then the authors would know that the couple is guilty.
The fact that they were dead broke just prior to the robbery and yet had unexplained spending right after the robbery should factor into the equation.

V. APPLICATION TO PEOPLE V. PUCKET

On February 21, 2008, John Puckett was found guilty of first-degree murder for the 1972 death of Diana Sylvester.
82 Smith describes the ambiguous evidentiary record: [Lead homicide investigator].
8 3 The jury also heard of Puckett's three prior rape and assault convictions.
But that is not his burden (and it would be difficult for most people who had lived in San Francisco to explain in 2008 where they were on a particular night in 1972).

p:l-p

With the new data, the updated (or posterior Bayesian) odds become: 125x p:lx(l-p) or 125p:1-p (12) Associated with these odds is a probability that Puckett is the guilty party.
If the authors imagine that proof beyond reasonable doubt requires establishing a source probability at or above 99%, then they can work backward to derive a minimum prior that would produce that posterior probability range: l25p a 0.99 1+ 124p.
If the authors believe there is a 44% or higher chance that the guilty party is in the 338,711-felon database, they can conclude that Puckett has a 99% or higher chance of being the person whose DNA was left at the crime scene.

B. A Model for Calculating Priors

Certainly a random person on the street would not have a 44% chance of being the guilty party.
One approach to estimating the prior would be to compare the size of the database to the size of the population without alibis.
The authors assume that criminals behave in the following manner.
Again, fractionf are caught and (1 -f) are not.
Those caught are entered into the database, and those who have escaped conviction twice are still not in the database.

1-d

Imagine thatf, the chance of getting caught, is 50%.
In addition, if a random criminal retires with a 39% chance, this says that the average criminal would commit 2.6 crimes before "retiring," This seems like a small number.
This modeling approach to estimating the prior probability of database guilt-more precisely, the prior probability that someone in the database is the source of the crime scene DNA-has to their knowledge never before been used.'.
That fraction and the database size are determined by the probability of being caught.
Thus, if the database contains felons from all of California rather than just from San Francisco, then moving to Los Angeles is not enough to retire.

VI. EMPIRICIZING THE ALIBI AND PRIOR PROBABILITIES

The application to People v. Puckett motivates a broader discussion of how to empirically assess the underlying parameters that influence the estimation of the posterior source probability.
At the other extreme, the authors would assume that Baker comprises 30% of the 40% other category, and thus make no change to the 60% priors.

2. Empiricizing the prior probability

Of course, there will be and should be reasonable disagreement about what constitutes a similar crime.
Similarity would have to be with regard to a host of factors-including not just the crime type but also the modus operandil41 and the characteristics of the defendant.
14 3 A sensible way forward would be to derive alternative priors based on alternative assumptions of what constitutes similar crimes as well as on plausible structural models, and then see if the defendant's source probability is sufficiently high even after combining the likelihood ratio with the most conservative (i.e., lowest) probability estimate within this range,.

B. Small Database Trawls

The analysis above is done under the stylized assumption that everyone in the database has the same prior probability of having been at the crime scene.
Take the case where a woman, who had a documented history of being a victim of spousal battery, is found murdered.
The authors again suggest that the prior can be inferred from adjusting the match rate in similar confirmation cases.
It might be tempting to infer that thirty percent of the time the husband was the source of the forensic DNA.

VII. ADMISSIBILITY

The authors goal of this Part is to suggest specific ways an expert might present his or her opinion under existing law and when existing evidentiary rules should change to accommodate a more coherent factfinding process.
To be admitted, the proposed probability evidence must also be consistent with Rules 104(a), 702, 703, and 403, or their state equivalents.
1 63 However, changes in courts' approaches to similarity with respect to DNA evidence provide some reason for predicting that adjusted match probabilities are increasingly likely to be admissible.
Courts that consider the relevance of statistical evidence have different philosophies, are influenced by a host of situation-specific variables, and occasionally make rulings that might go the other way.").

172. The

Prior data are just as important as data that allow us to update their beliefs.
What ultimately matters is where the authors end up, and that they arrived at that destination via a path that employs sound logic and reasoning.

CONCLUSION

In the 2012 presidential election, Nate Silver caused a stir by correctly predicting the winner of all fifty states and the District of Columbia in the general election, 17 6 Beyond accuracy, Silver's larger impact has been in changing the central polling metric and improving the way that metric is calculated.
In case 2, each company only has an 80% chance of being found liable for its accidents (as the eyewitness may read the license plate incorrectly), and thus each company only has 80% of the full incentive.
After Silver, the same evidence can be described as a 97.5% chance that Obama will win the election.'.
And, like Silver, the authors advocate that the method of estimating this probability be explicitly Bayesian.

Did you find this useful? Give us your feedback

Content maybe subject to copyright Report

THE RULE

PROBABILITIES:

PRACTICAL

APPROACH

FOR

APPLYING

BAYES'

RULE

THE

ANALYSIS

DNA

EVIDENCE

Ian

Ayres*

Barry

Nalebuff*

Bayes'

rule

not

being

used

guide

jury

decisionmaking

the

vast

major-

ity

criminal

cases

involving evidence

DNA

testing.

Instead

telling

juries

the

"source

probability"-the

probability

that

the

individual

whose

DNA

match-

was

the

source

the

forensic

evidence

found

the

crime

scene-experts

only

present

pieces

the

puzzle.

They

provide

the

probability

that

randomly

select-

innocent

person

would have

match

the

expected

number

innocent

matches

the

database.

some

cases,

the

random

match

probability

will

low

(one

quadrillion)

that

the

resulting

source

probability

practically

one

hundred

percent.

But,

other

cases,

with

large database

trawls

and

random

match

probability

one

million,

jurors

will

have

ability

convert

the

random

match

probability

the

likelihood

ratio

based

the

expected

number

matches

into

relevant

data

that

will

help

them

address

the

question

guilt.

This

Article

shows

that

correct application

Bayes'

rule

should lead

factfind-

ers

and

litigants

focus

the

size

two

variables that

influence

the

source

probability:

the

probability

that a

nonsource

the

DNA

database

would

have

alibi,

and

the

probability

that

the

source

the

DNA

included

the

database.

This

Article

suggests

practical

means

of estimating

these

two

variables

and

ar-

gues

that,

legal

matter,

these

parameters

well

the

Bayesian

posterior

source

probability

are

admissible

court,

particular,

focusing

the

prior

probability

that

the

"database

guilty"

(i.e.,

the

probability

that

someone in

the

database

the

source

the

forensic

evidence)

not

just

analytically

and

empir-

ically

tractable,

but

avoids

the

evidentiary limitations concerning

particular

de-

fendant's

prior

bad

acts. Appropriate

application

Bayes'

rule,

far

from

preempting

the

factfinding

and

adversarial

process,

can

guide

advocates

en-

gage

the

important aspects

the

evidence

that

are

still

likely

be open

con-

testation.

Perhaps

most

important,

appropriate

application

Bayes'

rule

will

al-

William

Townsend

Professor,

Yale

Law

School.

E-mail:

ian.ayres@yale.edu.

Milton

Steinbach Professor, Yale

School

Management. E-mail:

barry.nalebuff

@yale.edu.

Gregory Keating, Jonathan

Koebler, Lewis

Kornhauser,

Steve

Salop,

Joseph

Stiglitz,

and

Eric

Talley

provided

helpful

comments.

Anthony

Cozart,

Ben Picozzi, and Robert

Baker

provided

excellent research assistance.

are

especially

grateful

Kwixuan

Maloof

for

providing

us with

court

transcripts.

1447

1448

STANFORD

LAW

REVIEW

[Vol.

67:1447

allow

jurors

reach verdicts

via

coherent

path

that

employs

sound

logic

and

reasoning.

APPRECIATION

RICHARD

CRASWELL............................1448

INTRODUCTION..................................................1449

THE

ISLAND

EDEN..........................................1453

II.

BAYES

FOR

ERA

BIG

DATA.................................1457

III.COMPARATIVE

STATICS..........................................1466

Trawling

Larger

Database..............................1468

Tom

Versus

Tom,

Dick,

and

Harry..........................1470

IV.

APPLICATION

PEOPLE

COLLINS.......

............................

1472

APPLICATION

PEOPLE

PUCKETT.

..................................

1476

Minimal

Priors

............................

............

1482

Model

for

Calculating

Priors

...............

..............

1483

Application

Pukett........................................

1487

VIEmPRICIZING

THE

ALIBI

AND

PRIOR

PROBABILITIES.

...................

1490

Large

Database

Trawls..................................

1491

Empiricizing

the

alibi

probability........................1491

Empiricizing

the

prior

probability

.............................

1492

Small

Database

Trawls.....................

................

1493

VIADMISSI

TY....

...............................................

1496

CONCLUSION....................................................

1502

APPRECIATION

RICHARD

GRAS

WELL

entirely

fitting

for

Article

the

legal

implications

information

economics

honor

Dick

Craswell.

Large

swaths

Dick's

writings

are

care-

ful

workings-out

the

ways

that

imperfect

information

can

impact

private

be-

Ian

and

Dick

first

connected

1987,

when

Dick

delivered

paper

the

Law

and

Society

Conference.

Ian

was

just

finishing

his

first

year

teaching,

and

hadn't

yet

over-

come

the

kind

anxiety

that

caused

him

tremble

the

prospect

asking

question.

Not

only

was

Dick

open

and

gentle

his

response,

but

soon

after

the

conference

Ian received

from

him

letter

encouragement.

Ian

was

floored

his

letter

because

Dick,

unbidden,

had

also

included

comments

draft

article

Ian's.

That

e-mail

began

long-distance

relationship

that

was

immensely

helpful

young

scholar.

Over

the

decades,

Dick

and

Ian

mailed

and

e-mailed

comments

over

dozen

drafts

each

other's

writings,

Though

they

never

taught

the

same

school

the

same

semester,

Ian

nonetheless

felt

Dick

was

one

his

closest

colleagues.

Their

academic

havruta

not

only

predates

e-mail,

predates

WordPerfect

and

Word-as

those

first

comments

were

composed

the

XyWrite

word-

processing

software.

one

else comes

taking

the

time

have

written

many

high-quality

comments

Ian's

drafts.

Dick

has been

role model

not

only

the

typed

page,

but

also

the

way

has

carried

himself

seminars,

both

while

presenting

and

the

audience.

Dick

speaks

softly

but

carries

collaborative

and

constructive

stick.

His ready

smile

emblematic

his

warmth.

Just

bringing

mind

memories

Dick

seminar

talking

over

coffee

lifts

one's

spirits.

course,

even

very

good

academics

can

acerbic

and

annoying.

with

the

Tsar, one

sometimes

wants to

bless

and

keep

them

far

away,

Not

Dick

Craswell.

proof

that

you

can be

serious,

respected scholar,

and

uni-

versally

admired.

Even

the

boisterous

milieu

the University

Chicago

seminar,

was

calming

presence.

There

balm

Gilead.

Richard

Craswell.

THE

RULE

PROBABILITIES

havior

and

constrain

judicial

decisionmaking.2

Dick's

analysis

how

consum-

ers

might mistakenly

update their

prior

beliefs

after

corrective

advertising

or-

der

analogy

our

claim

that

unguided

juries

are

likely

mistakenly

update

response

incomplete statistical

DNA

evidence.

INTRODUCTION

With the

recent

Supreme

Court

decision

allowing

the

collection

DNA

samples from

any

person

arrested

and

detained

for

serious

offense,

it seems

inevitable

that the

justice

system

will

collect

and

use

large

DNA

databases.

Currently,

DNA

databases

are

widely

used.

April

2015,

the

Combined

DNA

Index

System

(CODIS)

maintained

the

Federal Bureau

Investiga-

tion

(FBI)

had

than "283,440

hits

[and

had

assisted]

than

270,211

investigations."

There

concern

that

database

size

increases,

too

will

the

rate

false

positives,

and

thus innocent

peole

will

convicted

when

their

DNA

matches

evidence

left

crime

scene.

This

concern

has

led

courts

convoluted

and

misguided

use

multiple

lenses to

evaluate

DNA

evidence.

this Article,

argue

that

there

single

right

answer

for

how

in-

corporate

the

use

DNA

evidence.

That

answer

the

application

Bayes'

rule,

250-year-old

formula

for

updating

starting

probability

estimate

for

See,

e.g.,

Richard Craswell

John

Calfee,

Deterrence

and Uncertain

Legal

Standards, 2

J.L.

ECON.

ORG.

279

(1986);

Richard

Craswell,

Taking

Information

Serious-

ly:

Misrepresentation

and

Nondisclosure

Contract

Law and

Elsewhere,

VA. L.

REV.

565

(2006);

see

also

Richard

Craswell,

Against

Fuller

and

Perdue,

Ctm,

REv.

(2000).

Richard

Craswell,

Interpreting

Deceptive

Advertising,

B.U.

REv.

657,

689

(1985);

see

also

Howard

Beales,

Richard

Craswell

Steven

Salop,

The

Efficient

Regula-

tion

Consumer

Information,

.L.

ECoN.

491,

532-33

(1981)

(discussing

how

consum-

ers may

update

response

banning

certain advertising

claims).

Maryland

King,

133

Ct.

1958,

1980

(2013)

("When officers

make

arrest

supported

probable

cause

hold

for

serious

offense

and

they

bring

the

suspect

the

station

to be

detained

custody,

taking

and

analyzing

cheek

swab

the

arrestee's

DNA

is, like

fingerprinting

and

photographing,

legitimate police

booking procedure

that

rea-

sonable

under

the

Fourth Amendment.").

Under Maryland

law,

serious

offense

the

commission

of or

attempt

commit

violence

burglary.

See

id,

1967.

CODIS-NDIS

Statistics,

FED.

BUREAU INVESTIGATION,

http://www.fbi.gov/about

-uslab/biometric-analysis/codislndis-statisties

(last

visited

June

2015).

There

additional

concern

false

positives

occurring

not

chance

but

contamination

laboratory

error.

See,

e.g.,

F.H.R.

VINCENT,

REPORT:

INQUIRY

INTO

THE

CIRCUMSTANCES

THAT

LED

THE

CONVICTION OF

FARAH

ABDULKADIR

JAMA

21-23,

(2010);

Jonathan

Koehler,

Error

and

Exaggeration

the

Presentation

DNA Evidence

Trial,

JURIMETRICs

21,

(1993);

William

Thompson,

Subjective

Interpretation,

Laboratory

Error

and

the

Value

Forensic

DNA

Evidence:

Three

Case

Studies,

GENET-

ICA

153

(1995);

Julie

Szego,

Wrongfully

Accused,

SYDNEY

MORNING

HERALD

(Mar.

29,

2014),

http://www.smh.com.au/nationallwrongfully-accused-20140324-35cga.htmi.

There

also

the

possibility

innocent

defendant's

DNA

being

left

the

scene

chance.

See

Jonathan

Koehler,

DNA

Matches

and

Statistics:

Important

Questions,

Surprising

Answers,

JUDICATURE

224

(1993)

[hereinafter Koehler,

DNA

Matches

and Statistics].

June

2015]1

1449

STANFORD

LAW

REVIEW

hypothesis

given

additional

evidence.

Applying

Bayes'

rule,

argue

that

tri-

ers

fact

evaluating

DNA

evidence

should be

presented

with what

call

the

"source

probability":

(he

probability

that

defendant

whose

DNA

matches

the

DNA

found

the crime

scene

was

the

true

source

that

evidence.

dis-

cuss

below,

the

source

probability

not

the

same

the

chance

random

DNA

match

and

does

not

equal

the

probability

guilt; even

the

defendant

was

the

source

the

forensic

DNA,

the

defendant

might

not have

committed

the

crime.

Our

primary

contribution

will

be to

show

that

the

source

probability

may

turn

crucially

the

size

of two

variables

that

have

not

been

introduced

(or

re-

lied

upon

experts)

DNA

matching

cases:

(i) the

initial

prior probability

that

the

source

the

DNA

included

the

database,

and

(ii)

the

relevant

adjusted

size

the

DNA

database,

calculation

that takes

into account

the

demographic

information

known

about

the

criminal

and

the

probability

that

nonsource

the DNA

database

would

have

alibi.

Experts have shied

away

from helping

jurors

form

baseline

beliefs,

which

are

more formally called

prior

probabilities,

and

from then

helping

them

con-

vert

those priors into

conclusion. The

problem

that, absent

priors,

not

clear

how

coherently

employ

the

expert

information.

discuss

in our

analysis

People

Puckett,

expert

might

well

conclude

that certain

evi-

dence

makes

100

times

likely

that

the

suspect

was at

the scene

the

crime.

But

100

times

likely

than

what?

The

starting

point

prior

for

suspect

who

identified

from

large

database

trawl

might well

less

than

1000.

that

case,

100-to-I

evidence

not

persuasive.

the

suspect

was

re-

lated

the

victim

and

had

motive

and

opportunity,

then

100-to-i would

much

convincing.

will

argue that

there

are

practical means

estimating

the

prior proba-

bilities

and

the

relevant database

size

and that,

legal

matter,

these

parame-

ters

well

the

final

source

probability

are

admissible.

particular, chang-

See

Thomas

Bayes,

Essay

Towards

Solving

Problem

in the

Doctrine

Chances,

PHIL.

TRANSACTIONS

378-81

(1763).

discuss

Bayes'

rule

Part

and

note

below.

Propositions

and

that

Part

present

the

rule.

For

introduction

Bayes'

rule,

see

ERIc

GOSSETT,

DISCRETE

MATHEMATICS

WITH

PRooF

316-22

(2d

ed.

2009).

For

application

Bayes'

rule

trial

evidence,

see

David

Faigman

A.J.

Baglioni,

Jr.,

Bayes'

Theorem

the

Trial

Process:

Instructing

Jurors

the

Value

Statistical

Evidence,

LAW

HUM.

BEHAv.

n.1

(1988).

For

colorful

history

the

rule's

uses,

see

SHARON

BERTSCH

MCGRAYNE,

THE

THEORY

THAT

WOULD

NOT

DTE:

How

BAYES'

RULE

CRACKED

THE

ENIGMA

CODE,

HUNTED

DowN

RUSSIAN SUBMARINES,

EMERGED TRIUM-

PHANT

FROM

Two

CENTURIES

CONTROVERSY

(2011).

Moreover,

while

will

speak

this

source

probability

as the

probability

that

the

defendant

was

the source

the

forensic

DNA,

the

matching

evidence cannot

distinguish

between

identical

twins

and

thus

can

only

speak

the

probability

with

regard

the

defend-

ant

the

defendant's

close genetic

relatives. See Brief

Scholars

Forensic Evidence

Amici

Curiae

Supporting

Respondent at 34,

King,

133

Ct.

1958

(No.

12-207), 2013

476046;

Koehler,

DNA

Matches

and Statistics,

supra

note

224;

Natalie

Ram,

For-

tuity

and

Forensic

Familial

Identification,

STAN.

REV.

751, 757,

758 n.35

(2011).

See

infra

notes

70-87.

1450

[Vol.

67:1447

THE

RULE

PROBABILITIES

ing

the

focus

from

question

about

the

prior

probability

that

the

defendant

was

the

source

the

prior

probability

that

the

"database

guilty"-that

is,

the

probability

that

someone

the

database

the

source

the forensic

evi-

dence-not

only

analytically

and

empirically

tractable,

but

also avoids

the

evidentiary

limitations

concerning

particular

defendant's

prior

bad

acts.

People

Johnson,

California Court

Appeal

panel,

reviewing

dif-

ferent

types

DNA

statistics,

emphasized

that

'"the

database

not

trial.

Only

the

defendant

is.'

Thus,

the question

how

probable

that

the

defend-

ant,

not

the

database,

the

source

the

crime

scene

DNA

remains

rele-

vant."

But

apply

Bayes'

rule,

the

probability

that the database contains

the

source

the

forensic

DNA,

assessed

prior

to any

consideration

whether

individual

the

database

actually

matches,

becomes

crucial

input

deter-

mining

the (posterior)

likelihood

that

particular matching defendant

the

source

the

forensic

DNA.

Contrary

Johnson,

assessing

the

prior probabil-

ity

that

the

database

includes

the

source-colloquially,

"the

probability

that

the

database is

guilty"-provides

once

readier

means

estimation

and

stronger

argument

for admissibility.

the

end

the day,

will

acquit

convict

defendant,

not

database.

The

problem

that

very

hard to

directly

estimate

starting

point

prior

probability

for

the

likelihood

that

specific

defendant committed

crime.

For

example,

what

the

chance

that

some

"John

Doe" committed

crime before

have

any evidence

about Mr.

Doe? In

contrast, it

coherent

ask the

chance

that

class

individuals,

for

example,

convicted

felons,

would include

the

perpetrator

crime."

For

example,

if half

rapes

are

committed

convicted

felons,

then the

starting point would

fifty

percent,

assuming

that

the

database

contains

all

convicted felons.

jurors

are

properly

understand

the

implications

finding

match

from

large

database trawl,

the size

and

characteristics

that

database

are

relevant information.

Some

legal analysts

have

been dismayed

the

ways

which evidence

DNA

match

tends

eclipse

any

role

for

adversarial

engagement-turning

litigants

into

little more

than

potted

plants.i

But

appropriate

application of

10.

No.

C068950,

2014

4724896,

*20

(Cal. Ct.

App.

Sept.

24,

2014)

(citations

omitted)

(quoting

People

Nelson,

185

P.3d

49,66

(Cal.

2008)).

11.

the

defendant

is a

member

that

group,

then

we can

convert

the group proba-

bility

individual

probability

dividing

the

size

the

group

the

relevant

subset

that

group

that matches

any

known

facts

the

case.

While

could

call

this the

prior

probability

for

the

individual,

the

heavy

lifting

has

all

been

done

the

group

level,

and

should

not

pretend

otherwise.

12.

rich

scholarly literature

investigates

jurors'

tendencies

over-

underestimate

the probative

value

DNA

statistical

evidence.

See

Jonathan

Koehler,

The

Psychology

Numbers

the

Courtroom;

How

Make

DNA-Match

Statistics

Seem

Impressive

Insuffi-

cient,

CAL.

REV.

1275,1277-80

(2001); Jonathan

Koehler

Laura

Macchi,

Think-

ing

About

Low-Probability

Events:

Exemplar-Cuing

Theory,

PSYCHOL.

Sci.

540

(2004); Jonathan

Koebler,

When

Are

People

Persuaded

by DNA

Match

Statistics?,

LAW

HUM.

BEHAV.

493

(2001);

Jason Schklar

Shari

Seidman

Diamond,

Juror

Reactions

DNA

Evidence:

Errors

and

Expectancies,

LAW

HUM.

BEHAV.

159

(1999).

This

lit-

June

2015]

1451

HTML Viewer

Frequently Asked Questions (6)

Q1. What contributions have the authors mentioned in the paper "The rule of probabilities: a practical approach for applying bayes' rule to the analysis of dna evidence" ?

This Article shows that a correct application of Bayes ' rule should lead factfinders and litigants to focus on the size of two variables that influence the source probability: the probability that a nonsource in the DNA database would have an alibi, and the probability that the source of the DNA is included in the database. This Article suggests practical means of estimating these two variables and argues that, as a legal matter, these parameters as well as the Bayesian posterior source probability are admissible in court,

Q2. What is the probability of a rapist being overrepresented in the database?

It might, for example, be possible that African Americans, because of disparities in policing, are overrepresented in the database.

Q3. How do you calculate the probability that there would be another couple that matches?

To calculate the correct probability that there would be another couple that matches, it would be necessary to average the two numbers, weighting the 100% result by the chance the couple is innocent and the The author- (I - r)T probability by the chance the couple is guilty.

Q4. How can one reject the conclusion that the true source probability is anything below 19.7%?

With a 30% hit rate based on 100 trawls, the authors show that with 99% confidence one can reject the conclusion that the true source probability is anything below 19.7%.

Q5. Why have courts been reluctant to introduce evidence about the prior probability of an individual defendant's being?

2. Empiricizing the prior probabilityCourts have been particularly reluctant to introduce evidence about the prior probability of an individual defendant's being the source of forensic DNA 1 34because the very process of constructing a prior probability seems inconsistent with the notion of a fair trial.

Q6. What is the probability of a trawl from a larger database?

In this case, the chance that the source match is in the database is unchanged at p, but the expected number of innocent matches has gone up by r, Now the result of a trawl from a larger database is less convincing:Source Probability [S The authorM = 11= p(1 p + (1- p)r(D+1)(1- a)An increase in D to D + 1, holding p constant, increases the denominator and reduces the posterior source probability.

The Rule of Probabilities: A Practical Approach for Applying Bayes' Rule to the Analysis of DNA Evidence

Summary (5 min read)

INTRODUCTION

. THE ISLAND OF EDEN

II.

35. See

38, See

P(S IM) P(S) P(M I S) P(-SI M) P(~ S) P(M I-S)

M : rD(1a)

III. COMPARATIVE STATICS

A. Trawling a Larger Database

IV. APPLICATION TO PEOPLE V. COLLINS

V. APPLICATION TO PEOPLE V. PUCKET

p:l-p

B. A Model for Calculating Priors

1-d

VI. EMPIRICIZING THE ALIBI AND PRIOR PROBABILITIES

2. Empiricizing the prior probability

B. Small Database Trawls

VII. ADMISSIBILITY

172. The

CONCLUSION

Citations

Cites methods from "The Rule of Probabilities: A Practi..."

Cites background or methods from "The Rule of Probabilities: A Practi..."

Cites background from "The Rule of Probabilities: A Practi..."

References

Related Papers (5)

Frequently Asked Questions (6)

Q1. What contributions have the authors mentioned in the paper "The rule of probabilities: a practical approach for applying bayes' rule to the analysis of dna evidence" ?

Q2. What is the probability of a rapist being overrepresented in the database?

Q3. How do you calculate the probability that there would be another couple that matches?

Q4. How can one reject the conclusion that the true source probability is anything below 19.7%?

Q5. Why have courts been reluctant to introduce evidence about the prior probability of an individual defendant's being?

Q6. What is the probability of a trawl from a larger database?