Message from @realz

Discord ID: 776149928938307584


2020-11-11 18:13:58 UTC  

that at least has enough samples, though the orders-of-magnitude issue might be a problem there

2020-11-11 18:14:58 UTC  

@RobertGrulerEsq Stand up maths (humble pi) isn't just talking about number of samples

2020-11-11 18:15:14 UTC  

he makes an argument about the distribution of the numbers as well

2020-11-11 18:15:38 UTC  

i.e if they don't span several orders of magnitude, then it isn't really that useful as proof

2020-11-11 18:16:21 UTC  

it isn't impossible to split up the data to get a large number of samples (I think I saw 700+ samples from one jurisdiction, might have been Nigrini)

2020-11-11 18:16:32 UTC  

Right but then he references Mark Nigrini as an authority in this stuff who then in his own video uses the exact method of analysis.

2020-11-11 18:16:34 UTC  

but 69 samples is egregiously small

2020-11-11 18:16:49 UTC  

I see what you mean.

2020-11-11 18:17:19 UTC  

I noticed Mark seems to validate the data before the benford's analysis by doing the end digits distribution analysis first before looking at the first digits.

2020-11-11 18:17:35 UTC  

mmm

2020-11-11 18:17:47 UTC  

I can rerun my experiment with 69 samples, it will be quite wild

2020-11-11 18:17:55 UTC  

And I was thinking there must be a lower limit on the sample size.

2020-11-11 18:19:03 UTC  

And maybe the order of magnitude applies within the samples? i.e., some precincts or counties with dozens, hundreds, thousands, ten-thousands, etc. but all those numbers being lumped into the single analysis.

2020-11-11 18:19:12 UTC  

I don't speak math very well not sure if I'm communicating that well.

2020-11-11 18:21:15 UTC  

@RobertGrulerEsq The brief teaching I had in my different forensics class Benford's Law can be used on small data sets that have enough number cycles i.e. sufficient use of 1-9

2020-11-11 18:21:16 UTC  

benford's law test, with 69 (number of counties in PA) samples, (random) normally distributed between 0 and 19000000

https://cdn.discordapp.com/attachments/771201221145919499/776149566645993502/benford-69-samples.gif

2020-11-11 18:21:43 UTC  

PA*

2020-11-11 18:22:09 UTC  

That's pretty amazing you can do that so quickly, I'm jealous of your skills.

2020-11-11 18:22:38 UTC  

I had it ready lol

2020-11-11 18:22:43 UTC  

I just changed the number

2020-11-11 18:23:18 UTC  

Have you run any that have a bell-curve distribution that look like Biden's?

2020-11-11 18:23:31 UTC  

to be clear this is not run on real data

2020-11-11 18:23:39 UTC  

oh

2020-11-11 18:23:44 UTC  

normal distribution is a bell curve

2020-11-11 18:24:13 UTC  

it isn't exactly the same as Biden's bell curve though

2020-11-11 18:24:33 UTC  

it is a bell curve centered on zero (I abs() the negative numbers)

2020-11-11 18:24:40 UTC  

I can center it on some number though

2020-11-11 18:26:26 UTC  

I see. Just wondering if adjusting of the sample size adjusted the curve more in alignment with the thumbnail in the math guy's video above of Biden's numbers.

2020-11-11 18:29:08 UTC  

oh I see

2020-11-11 18:29:27 UTC  

no, increasing the sample size for my experiment makes it more like benford's law

2020-11-11 18:29:34 UTC  

@realz Correct me if I am wrong here, but in using Benford distribution as evidence of voter fraud, you would have to have representative sample sizes from an election verified to be true (baseline) and one that has been confirmed as fraud, and calculate probability based on how much the current election approach the latter, yes?

2020-11-11 18:29:34 UTC  

probably because I center around 0

2020-11-11 18:30:06 UTC  

@Doc that is another good point; to test previous elections for the same thing

2020-11-11 18:30:23 UTC  

(but there are those that might say "it's always been going on")

2020-11-11 18:30:36 UTC  

Right, so you need some sort of baseline.

2020-11-11 18:30:49 UTC  

well that is _yet_ another litmus test for these proofs

2020-11-11 18:30:50 UTC  

because all elections will have some systematic and random defects.

2020-11-11 18:31:08 UTC  

my experiment just shows that with few samples, the expected error for each bar is too high

2020-11-11 18:31:15 UTC  

right

2020-11-11 18:31:18 UTC  

very nice.