Size Matters: Smaller Batches Yield More Efficient Risk-Limiting Audits

Similar documents
Certified Home Brewer Program. Minimum Certification Requirements

A Note on a Test for the Sum of Ranksums*

CIRCLE The Center for Information & Research on Civic Learning & Engagement

THE MEAT PRODUCTS REGULATIONS 2003 SUMMARY GUIDANCE NOTES

Statistics: Final Project Report Chipotle Water Cup: Water or Soda?

Buying Filberts On a Sample Basis

FACTORS DETERMINING UNITED STATES IMPORTS OF COFFEE

Slide 1. Slide 2. A Closer Look At Crediting Fruits. Why do we credit foods? Ensuring Meals Served To Students Are Reimbursable

BILL NUMBER: AB 727 BILL TEXT AMENDED IN ASSEMBLY MARCH 25, 2011 FEBRUARY 17, 2011

Wine-Tasting by Numbers: Using Binary Logistic Regression to Reveal the Preferences of Experts

Influence of GA 3 Sizing Sprays on Ruby Seedless

Fleurieu zone (other)

Washed agar gave such satisfactory results in the milk-powder. briefly the results of this work and to show the effect of washing

Name: Adapted from Mathalicious.com DOMINO EFFECT

Online Appendix to. Are Two heads Better Than One: Team versus Individual Play in Signaling Games. David C. Cooper and John H.

Level 2 Mathematics and Statistics, 2016

Dining Your Way into Reading

codex alimentarius commission FOOD AND AGRICULTURE ORGANIZATION OF THE UNITED NATIONS

All Adult and Youth cooks must sign up through the ICS website to be recognized as official ICS members.

food and drink Nutritionists advise us to eat at least 5 fruit or vegetable portions a day. What does a portion look like?

The 2006 Economic Impact of Nebraska Wineries and Grape Growers

Recipe Adjustment Factor Method

Stocktaking in. memsec EPoS. Categories

BarkeepOnline Managing Recipes

Hamburger Pork Chop Deli Ham Chicken Wing $6.46 $4.95 $4.03 $3.50 $1.83 $1.93 $1.71 $2.78

Country of Origin Food Labelling - Factsheet. Reforms to country of origin food labels

Name: Hour: Review: 1. What are the three elements that you need to measure to guarantee a successful recipe?

Brady Center to Prevent Gun Violence

Timmie s tops in customer satisfaction

Cost of Establishment and Operation Cold-Hardy Grapes in the Thousand Islands Region

Labor Supply of Married Couples in the Formal and Informal Sectors in Thailand

Costa Rica: In Depth Coffee Report: COFFEE INDUSTRY STRUCTURE

United States Standards for Grades of Walnuts (Juglans regia) in the Shell

Summary Report Survey on Community Perceptions of Wine Businesses

SA Winegrape Crush Survey Regional Summary Report 2017 South Australia - other

Multiple Imputation for Missing Data in KLoSA

Mischa Bassett F&N 453. Individual Project. Effect of Various Butters on the Physical Properties of Biscuits. November 20, 2006

THE COBURN COMPANY, INC. PO Box 147 Whitewater, WI U.S.A. Phone: Fax:

QUALITY DESCRIPTOR / REPRESENTATIONS GUIDELINES FOR THE

Napa County Planning Commission Board Agenda Letter

Release #2461 Release Date: Thursday, February 20, 2014

Grocery List (Step 2)

HONDURAS. A Quick Scan on Improving the Economic Viability of Coffee Farming A QUICK SCAN ON IMPROVING THE ECONOMIC VIABILITY OF COFFEE FARMING

The Bottled Water Scam

Archdiocese of New York Practice Items

Module 6. Yield and Fruit Size. Presenter: Stephan Verreynne

9: MyPlate Dairy Group

Estimating and Adjusting Crop Weight in Finger Lakes Vineyards

Standardization Portion Control

Mathacle. PSet Algebra, Ratios Level Number Name: Date: Solve.

18 May Primary Production Select Committee Parliament Buildings Wellington

Predicting Fruitset Model Philip Schwallier, Amy Irish- Brown, Michigan State University

Access to Affordable and Nutritious Food: Measuring and Understanding Food Deserts and Their Consequences

2016 China Dry Bean Historical production And Estimated planting intentions Analysis

Assignment 60 Marks 1 March, 2018

What's in the box 2 ENGLISH

1/17/manufacturing-jobs-used-to-pay-really-well-notanymore-e/

COLORADO REVISED STATUTES, TITLE 35, AGRICULTURE

COMPARISON OF EMPLOYMENT PROBLEMS OF URBANIZATION IN DISTRICT HEADQUARTERS OF HYDERABAD KARNATAKA REGION A CROSS SECTIONAL STUDY

Adelaide Plains Wine Region

The Contribution made by Beer to the European Economy. Poland - January 2016

Research - Strawberry Nutrition

CAUTION!!! Do not eat anything (Skittles, cylinders, dishes, etc.) associated with the lab!!!

FOR PERSONAL USE. Capacity BROWARD COUNTY ELEMENTARY SCIENCE BENCHMARK PLAN ACTIVITY ASSESSMENT OPPORTUNITIES. Grade 3 Quarter 1 Activity 2

AN ELEMENTARY SCHOOL STUDENT WASTE CASE STUDY Plate Waste Study. Funded by USDA SNAP-Ed, an equal opportunity provider and employer.

Archival copy. For current information, see the OSU Extension Catalog:

STA Module 6 The Normal Distribution

STA Module 6 The Normal Distribution. Learning Objectives. Examples of Normal Curves

Amy Porter FN / 20/ 06 Written Report

Wine Australia Wine.com Data Report. July 21, 2017

BOARD OF ZONING ADJUSTMENT STAFF REPORT Date: June 4, 2018

Supply & Demand for Lake County Wine Grapes. Christian Miller Lake County MOMENTUM April 13, 2015

OREGON AGRICULTURAL COLLEGE EXTENSION SERVICE. RALPH D. HETZEL, Director.

Report to Zespri Innovation Company Ltd. An Analysis of Zespri s 2003 Organic Kiwifruit Database: Factors Affecting Production

THOMA AND NEUBAUER IMPROVED CHAMBER COUNTING YEASTS (TIRAGE)

2016 Gourmia The Steelstone Group Brooklyn, NY

HOW TO OPEN A BUBBLE TEA SHOP

5 Populations Estimating Animal Populations by Using the Mark-Recapture Method

ASSEMBLY, No STATE OF NEW JERSEY. 217th LEGISLATURE PRE-FILED FOR INTRODUCTION IN THE 2016 SESSION

Fractions with Frosting

Algebra 2: Sample Items

MODEL# GMF255 MOKA POT MILK FROTHER. PHOTO OF PRODUCT

FOOD PREPARTION MANAGEMENT. Reading a Recipe

Lauren Paradiso, Ciara Seaver, Jiehao Xie

BRIQUTTE SECTOR IN KENYA. Briquettes have been produced on a small scale in Kenya since the 1970 s.

Directions for Menu Worksheet. General Information:

Harvesting Soybean. Soybean Loss. John Nowatzki Extension Agricultural Machine Systems Specialist

Berkeley Menu Assessment

A.P. Environmental Science. Partners. Mark and Recapture Lab addi. Estimating Population Size

Sales of Prepared Food by Food Service Providers

Silage Corn Variety Trial in Central Arizona

Saturday October 7, 2017

MARIE BRIZARD INTERNATIONAL BARTENDER SEMINAR AND COCKTAIL COMPETITION

MODEL# GCG165 COFFEE GRINDER. PHOTO OF PRODUCT

Beverage Cost Control: Managing for Profit

United States Standards for Grades of Italian Sprouting Broccoli

GALA SPLITTING WASHINGTON TREE FRUIT POSTHARVEST CONFERENCE. March 13 th & 14 th, 2001, Wenatchee, WA PROCEEDINGS, Gala Splitting page 1 of 6

Problem Set #15 Key. Measuring the Effects of Promotion II

Economic Contributions of the Florida Citrus Industry in and for Reduced Production

Wine On-Premise UK 2016

Transcription:

Size Matters: Smaller Batches Yield More Efficient Risk-Limiting Audits Small-Batch Audit Meeting Washington, DC 27 28 March 2010 Philip B. Stark http://statistics.berkeley.edu/~stark This document: http: //statistics.berkeley.edu/~stark/seminars/smallbatch10.pdf 1

Risk-Limiting Audits If the electoral outcome is wrong, there s a known minimum chance of a full hand count (which fixes it), no matter what caused the outcome to be wrong. The risk is the largest chance that an outcome that is wrong won t be fixed. Wrong means the outcome isn t what a full hand count would show. Role of statistics: Less counting when the outcome is right, but still a big chance of a full hand count when outcome is wrong. Bigger reductions are possible when batches are smaller: Need data plumbing and vote tabulation systems (VTSs) that export machine-readable subtotals for small batches ideally, individual ballots (CVRs). 2

California AB 2023 (Saldaña, sponsored by SoS Bowen): http://www.leginfo.ca.gov/pub/09-10/bill/asm/ab_2001-2050/ ab_2023_bill_20100325_amended_asm_v98.html First proposed audit that limits risk! 15560. (a) The Secretary of State is authorized to establish a postcanvass risk-limiting audit pilot program in five or more counties to improve the accuracy of, and public confidence in, election results. The Secretary of State is encouraged to include urban and rural counties; counties from northern, central, and southern California, and counties with various different voting systems. The volunteer counties audit one or more contests after each election in 2011. The Secretary of State reports to the Legislature by March 2012 on the effectiveness, efficiency, and costs of risk-limiting audits. 3

California AB 2023, contd.: (b)(3) Risk-limiting audit means a manual tally employing a statistical method that ensures a large, predetermined minimum chance of requiring a full manual tally whenever a full manual tally would show an electoral outcome that differs from the outcome reported by the vote tabulating device for the audited contest. A risk-limiting audit shall begin with a hand tally of the votes in one or more audit units and shall continue to hand tally votes in additional audit units until there is strong statistical evidence that the electoral outcome is correct. In the event that counting additional audit units does not provide strong statistical evidence that the electoral outcome is correct, the audit shall continue until there has been a full manual tally to determine the correct electoral outcome of the audited contest. Amen! 4

Quantifying the Evidence the Audit Sample Gives What is the biggest chance that if the outcome is wrong the audit would have found as little error as it did? That chance depends on how the sample is drawn and its size batch sizes and reported votes in each batch the errors that are found Chance can be big even if no errors are found if the sample is small or the margin is small or the batches are big. Don t stop counting until that chance is small! 5

Yolo County Measure P, November 2009 Reg. voters ballots precincts batches yes no 38,247 12,675 31 62 3,201 9,465 VBM and in-person ballots were tabulated separately (62 batches). For risk-limit 10%, initial sample size 6 batches; gave 4 distinct batches, 1,437 ballots. 6

Single-ballot audit can greatly reduce burden when the outcome is right. For actual 62 batches (31 IP, 31 VBM), audit required counting 1,437 ballots (11.33% of ballots cast) For risk-limit 10%, would need to look at CVRs for 6 ballots. Less than 0.05% of ballots cast one twentieth of one percent. For risk-limit 1%, would need to look at CVRs for 12 ballots. Less than 0.1% of ballots cast one tenth of one percent. (Assuming sufficiently few errors were found.) 7

Director, Esparto Community Service District, Yolo County Voters could select up to f = 2 candidates. 1 precinct; 988 registered voters; 187 ballots cast. Reg. ballots Jordan Pomeroy Fescenmeyer Moreland under over voters votes votes 988 187 95 80 64 62 57 8 Initial sample 32 ballots, for risk-limit 25%. 8

Jelly Beans 100 4oz bags of various flavors of jelly beans 25lbs in all. Some assorted flavors, some a single flavor. Want to estimate how many coconut beans there are in all. pour the 100 bags into a large pot and stir well. Draw 4oz without looking. Estimate the total number of coconut jelly beans to be the number in the sample, times 100. Select 1 of the 4oz bags at random. Estimate the total number of coconut jelly beans to be the number in that bag, times 100. 9

Jelly Beans, contd. Both estimates are unbiased but the first has much lower variability. Mixing disperses the coconut jelly beans pretty evenly throughout the pot. The sample is likely to contain coconut jelly beans in roughly the same proportion as the 100 bags do overall, so multiplying the number in the sample by 100 gives a reasonably reliable estimate of the total. Single bag is quite likely to have coconut jelly beans in a proportion quite different from the overall proportion, so multiplying that number by 100 could easily be far from the total number of coconut jelly beans among the 100 bags. More efficient to mix the beans before selecting 4oz. Then 4oz suffices to get a reasonably reliable estimate. 10

How salty is the stock? 100 12-ounce cans of stock, of a variety of brands, styles, and types: chicken, beef, vegetable, low salt, regular, etc. We want to know how much salt there is in all 1,200 ounces of stock. Open all the cans, pour the contents into a cauldron, stir well, and remove a tablespoon of the mix. Assay that tablespoon, multiply by the total number of tablespoons in the 100 cans (1T = 0.5oz, so the total number of tablespoons in the 100 cans is 12 100 2 = 2, 400T ). Select a can at random, determine the amount of salt in that can, and multiply by 100. 11

Salty stock, contd. The single tablespoon is extremely likely to have some stock from all 100 cans. The salt is likely to be spread out quite evenly through all the stock in the cauldron; it is very unlikely that the tablespoon will consist almost entirely of salt or almost entirely of water. Rather, the tablespoon is likely to contain salt in roughly the same concentration as the 100 cans do on the whole. Single can selected at random can be quite likely to contain salt in a concentration quite different from the 1,200 ounces of stock as a whole, unless all the cans have nearly identical concentrations of salt. 12

Connection to Election Auditing A vote-tabulation error is like a coconut jelly bean or a fixed quantity of salt. A precinct or other audit batch is like a bag of jelly beans or a can of stock. Drawing the audit sample is like selecting a bag or a 4oz scoop of jelly beans or a tablespoon or can of stock. 1st approach is like single-ballot or small-batch auditing. All the ballots are mixed together well. 2nd approach is like auditing using precincts or other large batches of ballots. 13

Numerical Examples 50,000 ballots, 500 ballots cast in each of 100 precincts. 1,000 (i.e., 2%) tallied incorrectly; 49,000 tallied correctly. What is the chance the sample contains any misinterpreted ballots at all? What is the chance that the percentage of misinterpreted ballots in the sample is at least half the percentage of misinterpreted ballots in the contest as a whole? 14

Sampling schemes draw a single precinct (batch) of 500 ballots at random divide the precincts into 10 batches of 50 ballots each and draw 10 batches at random without replacement from the resulting 1,000 batches draw a simple random sample of 500 ballots at random. 15

Chance of finding at least one misinterpreted ballot errors 1 pct 10 batches of 50 SRS of 500 10 in every pct 100% 100% 99.996% 10 in 98 pcts, 20 in 1 pct 99% 100% 99.996% 20 in 50 pcts 50% 99.9% 99.996% 250 in 4 pcts 4% 33.6% 99.996% 500 in 2 pcts 2% 18.4% 99.996% Phrased differently: if the sample has no misinterpreted ballots, the confidence that no more than 2% of the 50,000 ballots were misinterpreted is: sampling method 1 pct 10 batches of 50 SRS of 500 confidence 2% 18.4% 99.996% 16

Chance percentage of misinterpreted ballots in the sample is at least 1% errors by pct. 1 pct. 10 batches of 50 SRS of 500 10 in every pct 100% 100% 97.2% 10 in 98 pcts, 20 in 1 pct 99% 100% 97.2% 20 in 50 pcts 50% 62.4% 97.2% 250 in 4 pcts 4% 5.7% 97.2% 500 in 2 pcts 2% 18.4% 97.2% Phrased differently: if the sample has 1% misinterpreted ballots, the confidence that no more than 2% of the 50,000 ballots were misinterpreted is: sampling method 1 pct 10 batches of 50 SRS of 500 confidence 2% 18.4% 97.2% 17

What do we need for efficient audits? Laws that allow/require risk-limiting audits (such as CA AB 2023), but mostly... Data plumbing: Structured, small batch data export from VTSs. A way to associate individual CVRs with physical ballots. Reducing counting effort is largely about reducing batch sizes. 18