FOI Request LEX2979

Date: 2022

FOI Request Summary: LEX2979 - Senate Ballot Paper Sampling Methodology

Main Purpose of the FOI Request

The FOI request, identified as LEX2979, sought to obtain the specific document detailing the methodology for the ballot paper sampling process used in the audit of Senate ballot papers. It specifically requested the "advice from the Australian Bureau of Statistics" (ABS) and their "guidance for calculating, analysing and reporting the statistical conclusions that can be drawn," explicitly excluding any general process outlines already published by the AEC.

Documents from the FOI Request

The request resulted in the release of one document:
* "ABS Advice to AEC on sampling methodology"

Main Content from the FOI Request Documents

The released document, "ABS Advice to AEC on sampling methodology," outlines the statistical methodology recommended by the ABS to the Australian Electoral Commission (AEC) for assuring the accuracy of Senate ballot paper processing.

Key content includes:

Objective: To determine the optimal number of ballots to manually check to ensure a high level of confidence that the national error rate in the Senate ballot scanning and data extraction process is low. The primary focus is on "Stage 2 errors" (matching scanned images to extracted data), given that "Stage 1 errors" (scanned images matching physical ballots) were found to be negligible in previous audits.
Recommended Sampling Rates: The ABS recommends specific assurance rates per state/territory for Stage 2 testing, which vary based on the estimated number of Senate forms. Examples include:
- 1 in 3,000 ballots in New South Wales and Victoria.
- 1 in 120 ballots in the Northern Territory.
- This approach is estimated to involve checking 9,895 ballots nationally for the 2021/22 Senate election.
Confidence Targets: The methodology aims to achieve 99% confidence that the observed error rate in the sample for each state will be less than 1%, assuming a true population error rate of 0.45% (derived from the 2019 assurance). This translates to a 99% confidence that the national error rate is less than 6.5 errors per 1,000 ballot papers and less than 10 errors per 1,000 in any given state.
Efficiency Improvements: The proposed methodology is noted to be more efficient than the 2019 internal AEC approach, requiring fewer total ballots for assurance while providing higher confidence in the national error rate. It also allows assurance to occur concurrently with ongoing ballot processing.
Practical Implementation: The document details a "clustered sampling" approach for practical implementation. This involves:
- Selecting a proportion of ballot "bundles" (each containing 50 ballots) at a constant rate per state (e.g., 1 in every 300 bundles in NSW).
- Then, selecting 1 in 10 of the ballots within the chosen bundles for Stage 2 testing.
- Subsequently, selecting 1 in 10 of the Stage 2 sample for Stage 1 testing.
National Error Rate Calculation: Guidance is provided on how to calculate the national error rate when different sampling rates are applied across states, emphasizing the need to weight state-level errors by their proportion of the national population.
Alternate Options Considered: The document also discusses alternative sampling allocation strategies that were evaluated (e.g., constant national sample rate, maximum state margin of error constraint), explaining the rationale for selecting the final recommended approach for its robustness and practical benefits.

In essence, the released document provides the comprehensive statistical and practical guide developed by the ABS for the AEC's Senate ballot paper assurance process, detailing sample sizes, confidence levels, and implementation methods.

AI Generated

The provided FOI documents detail the Australian Bureau of Statistics (ABS) advice to the Australian Electoral Commission (AEC) on the statistical methodology for auditing Senate ballot papers. This technical advice focuses on determining the sample size and assurance rates for checking the accuracy of scanned ballot images against physical papers and the extraction of voter preferences.

From a politically left-leaning perspective, the documents reveal several points concerning social justice and civil liberties, while issues related to environmental impact, wealth distribution, and corporate influence are not directly addressed by the technical scope of this methodology.

Alignment with Progressive Values:

Commitment to Electoral Integrity and Civil Liberties: The core purpose of the ABS advice is to establish a statistically robust method to verify the accuracy of the Senate vote count. This directly aligns with the progressive value of protecting the civil liberty of every citizen to have their vote accurately recorded and counted. The aim for a "high level of confidence" (99%) that error rates are "low" (under 1% observed) demonstrates a commitment to maintaining public trust in democratic processes.
Transparency and Accountability: The very fact that this detailed methodology is accessible through an FOI request highlights a degree of transparency in government operations. This openness, where the public can scrutinize the statistical underpinnings of electoral audits, is crucial for holding government agencies accountable and fostering an informed citizenry, a cornerstone of progressive governance.
Data-Driven Decision Making: The reliance on the ABS, a non-political statistical authority, for expert advice underscores a commitment to objective, data-driven methodologies for election assurance. This approach prioritizes scientific rigor over arbitrary or politically motivated decisions, which enhances fairness and reduces opportunities for manipulation or bias in the electoral process.

Deviations from Progressive Values / Areas for Scrutiny:

Tolerance for Error and Social Justice Implications: While the document frames a 0.45% national error rate for Stage 2 (preference extraction) as "low" and aims to confirm it, a progressive analysis might question the acceptability of such a figure. The 2019 assurance estimated 69,065 errors nationally. Even statistically "low," errors in tens of thousands of ballot papers, particularly in preference allocation, could potentially alter outcomes in very close elections or disproportionately impact the representation of smaller parties, independent candidates, or specific demographics. From a social justice perspective, every vote, and every preference, should ideally be counted without error to ensure precise democratic representation. The question arises whether efficiency (fewer ballots assured than in 2019) is being prioritized over absolute accuracy, especially if errors are not truly random but systematically affect certain voting patterns or regions.
Potential Risks of Clustered Sampling: The proposed "clustered sampling" method, while acknowledged as more efficient, carries the stated risk that "Clustered samples can lead to lower accuracy if errors can also be clustered together." While the ABS states this risk has been balanced, a progressive lens would demand more robust transparency on the specific assessments and mitigation strategies for this risk. If, for example, a technical malfunction or human error affects an entire batch of ballots, concentrated errors could be missed, potentially skewing results in a specific area, which could have implications for the fairness of representation.
Disparities in State Assurance Rates: The recommended assurance rates vary significantly by state/territory (e.g., 1 in 3,000 in NSW/VIC vs. 1 in 120 in NT). While statistically justified for achieving confidence in smaller populations, it means a ballot in the Northern Territory is proportionally much more likely to be checked than one in New South Wales. While this ensures statistical validity across smaller populations, it highlights an unequal level of direct scrutiny per vote across different regions, raising a subtle question about equitable treatment, even if technically sound.
National vs. State-Level Focus: The new allocation explicitly aims for "higher confidence in the national error rate" with fewer total ballots. While efficient, a progressive perspective might question if this national optimization sufficiently prioritizes absolute accuracy and confidence within each state, particularly given observed differences in 2019 state-level error rates (e.g., ACT had zero errors). Applying a national average error rate assumption (0.45%) to all states "in the interests of simplicity" could be seen as an averaging that potentially masks or underplays localized issues or anomalies crucial for local electoral integrity.

In conclusion, these FOI documents demonstrate a technocratic commitment to ensuring electoral integrity through statistical sampling, aligning with progressive values of transparency and data-driven governance. However, they also prompt critical questions from a left-leaning perspective regarding the acceptable tolerance for electoral errors, the potential trade-offs between efficiency and absolute accuracy, and how statistical methods impact the fundamental right to a perfectly counted vote across all communities.

AI Generated

The FOI documents detail the Australian Bureau of Statistics’ (ABS) advice to the Australian Electoral Commission (AEC) on the statistical methodology for auditing Senate ballot papers, specifically focusing on the accuracy of scanning and data extraction processes. From a right-leaning analytical perspective, several key aspects align positively with conservative principles:

Economic Efficiency and Fiscal Responsibility:
The new assurance methodology is highlighted as requiring fewer ballots to be manually checked (9,895 vs. 10,400 in 2019) while simultaneously delivering higher confidence in the national error rate. This represents a direct gain in economic efficiency, achieving a better outcome with fewer resources. The ability to conduct assurance "while processing is ongoing" further contributes to efficiency by streamlining operations and potentially reducing labor costs associated with delays or separate audit phases. The adoption of "clustered sampling" for "logistical efficiency" also points to a pragmatic effort to optimize resource allocation, provided the acknowledged risk to accuracy from error clustering is genuinely offset by the built-in "slack" in the sample, as stated. This focus on optimizing output (confidence) relative to input (ballots assured, processing time) demonstrates a commitment to fiscal responsibility by ensuring taxpayer funds are used effectively for an essential government function.

Individual Liberty:
The core purpose of this methodology is to ensure a low error rate in the counting of Senate ballots. By striving for a high degree of confidence (e.g., 99% confidence that the observed error rate in any state will be less than 1%), the process aims to ensure that the outcome of elections accurately reflects the will of the voters. This is fundamental to individual liberty, as the integrity of the vote directly underpins the democratic process and the legitimate derivation of government power from the consent of the governed. A trustworthy electoral system is vital for citizens to have confidence that their individual vote contributes meaningfully to the selection of their representatives, thereby upholding their democratic rights.

Limited Government:
The documents illustrate an example of government agencies operating within their defined scope and seeking to perform their functions transparently and competently. The AEC, responsible for elections, consults with the ABS, an independent statistical authority, to ensure methodological rigor. This reliance on expert advice and the use of statistically sound practices to ensure accuracy, rather than arbitrary or opaque processes, aligns with the principle of limited government that operates under established rules and transparent accountability. It demonstrates a commitment to precise, data-driven governance in a core administrative function, rather than an expansion of discretionary power.

National Security (Indirectly):
While not directly related to defense or intelligence, a robust and trustworthy electoral system is a pillar of domestic stability. Public confidence in election results is critical to maintaining social cohesion and preventing unrest. By rigorously auditing the ballot counting process and transparently outlining the methodology, the government indirectly contributes to national security by bolstering faith in democratic institutions and mitigating internal divisions that could otherwise be exploited.

Deviation/Area for Scrutiny:
The documents implicitly reveal an existing national error rate (0.45% in 2019) and a tolerance for a continued, albeit low, error rate (e.g., less than 6.5 errors per 1,000 ballots nationally, or less than 1% in each state with 99% confidence). While the proposed method reduces this, a right-leaning perspective might always push for the lowest possible error rate, even zero, to maximize electoral integrity. However, the document acknowledges the practicalities and costs of achieving absolute perfection, justifying the current approach as a "conservative" balance between accuracy and efficiency.

In summary, the FOI documents largely depict government action that aligns with conservative principles of efficiency, fiscal prudence, safeguarding individual liberties through electoral integrity, and operating within defined roles using transparent, data-driven methods.

AI Generated

The released FOI documents, obtained only after a specific request for the underlying methodology beyond public outlines, shed light on the Australian Electoral Commission's (AEC) approach to "assurance" in Senate ballot paper audits. Far from robust oversight, the advice from the Australian Bureau of Statistics (ABS) appears to prioritize administrative convenience and cost-cutting over comprehensive scrutiny of electoral integrity.

Key findings raising red flags include:

Reduced Scrutiny, Not Enhanced Confidence: Despite claims of "higher confidence in the national error rate," the proposed methodology actually reduces the total number of ballots to be assured nationally (9,895 compared to 10,400 in 2019). This reduction occurs even as the total number of Senate forms is estimated to increase (16.095 million vs. 15.184 million). This suggests a deliberate reduction in the proportion of ballots checked, potentially leaving more errors undetected.
Unequal Oversight and Assumed Purity: The new model shifts away from assuring a constant number of ballots per state to a variable rate, leading to "less ballots in the less populous states." While this might contribute to a "national" confidence figure, it means smaller jurisdictions receive significantly less individual scrutiny. This could mask localized issues or systemic problems in less visible areas. Furthermore, the methodology largely assumes a national error rate of 0.45% (based on 2019 data) across all states, despite acknowledging that "the prevalence of stage 2 errors differed by state." This simplification could obscure actual, higher error rates in specific states.
Convenience Over Accuracy: The document repeatedly emphasizes efficiency: "simplifying the implementation," "speed up the assurance," and allowing checks "while processing is ongoing." This drive for speed and convenience is explicitly linked to the adoption of "clustered sampling" of ballots. While the report acknowledges this "can lead to lower accuracy if errors can also be clustered together," it dismisses the risk by claiming "some ‘slack’" was already allowed. This sounds suspiciously like a trade-off where logistical ease trumps the thoroughness needed for electoral integrity.
High Tolerance for Error: The stated confidence levels, even if achieved, permit a significant number of uncorrected errors. For example, there is "99% confidence that nationally there are less than 6.5 errors per 1,000 ballot papers." While sounding small, on 16 million votes, this translates to potentially over 104,000 undetected errors. This indicates a high acceptable threshold for inaccuracy within the electoral system.
"Illustrative Only" Disclaimers: The report undermines its own "statistical statements" by noting they are "illustrative only" and that "Final confidence intervals will depend on the actual error rates found." This suggests that the "high confidence" they promote is based on assumptions rather than guaranteed outcomes, potentially inflating public trust in the audit process.
Minimal Stage 1 Scrutiny: The crucial first stage of testing, ensuring the scanned image matches the physical ballot, receives even less attention: only "1 in 10 of the ballots selected for stage 2 testing." This relies heavily on a 2019 finding of "no errors," which is a risky assumption for future processes.

In essence, these documents reveal an electoral "assurance" process that appears designed to achieve a statistical comfort level with minimal effort and cost, rather than a truly exhaustive and transparent verification of every vote. The emphasis on "efficiency" and "simplicity" (for the AEC) seems to come at the expense of comprehensive scrutiny, unequal treatment of states, and a potentially high tolerance for errors, all while claiming "high confidence" through carefully managed statistical assumptions. This approach risks eroding public trust in the integrity of election outcomes.

AI Generated

These FOI documents clearly demonstrate the government's unwavering commitment to effective governance and the integrity of our democratic processes. They showcase how the Australian Electoral Commission (AEC) proactively sought expert statistical guidance from the Australian Bureau of Statistics (ABS) to enhance the assurance methodology for Senate ballot papers. This collaborative approach ensures that our electoral audits are grounded in robust, scientifically-backed principles.

The resulting methodology outlines a meticulous and highly efficient statistical sampling process, specifically designed to provide a high level of confidence that the national error rate in ballot processing is exceedingly low. Far from being a static process, this new approach builds on valuable lessons from the 2019 election, notably delivering higher national confidence with a reduced sample size and allowing for assurance to be undertaken concurrently with ballot processing. This represents a significant stride in operational efficiency and intelligent resource allocation, ensuring timely and accurate results for the public.

Understanding the diverse electoral landscapes across states and territories, the ABS recommended a strategically tailored assurance rate for each region. This nuanced approach, with rates adjusted to population size, is a testament to the government's dedication to achieving uniform confidence levels and accuracy nationwide, addressing regional specificities with precision.

Furthermore, the documents transparently address practical implementation challenges, such as the use of clustered sampling for logistical efficiency. This is not a compromise on integrity but a carefully balanced and expertly managed solution to ensure that the audit process is both effective and practical, without undermining statistical accuracy. The comprehensive consideration of alternative allocation models before arriving at the final recommendation further underscores the thoroughness and due diligence inherent in these processes.

In essence, these documents reveal a government dedicated to continuous improvement, leveraging inter-agency expertise, and implementing data-driven strategies to safeguard the accuracy and public trust in our elections. Every decision, from the sample rates to the implementation strategies, reflects a deep commitment to serving the public by ensuring a robust, reliable, and transparent democratic system.

AI Generated

The provided FOI documents lay bare a deeply flawed and disturbingly complacent approach to ensuring the integrity of Australian Senate elections. Far from guaranteeing accuracy, the methodology outlined by the ABS, adopted by the AEC, prioritizes administrative convenience and cost-cutting over rigorous scrutiny, effectively legitimizing a concerning level of vote miscounting.

Damning Revelations and Failures:

Explicit Acceptance of High Error Rates: The most egregious failure is the outright admission and acceptance of significant error rates. The 2019 Senate election data, used as a baseline, already showed a national error rate of 0.45% in Stage 2 (data extraction from scanned images). This is not "low" for an election; on 16 million ballots, this translates to an estimated 69,065 miscounted votes in 2019 alone. The new methodology confidently sets an acceptable upper limit for errors at 0.65% nationally (or 6.5 errors per 1,000 ballots) and a staggering 1% at the state level (10 errors per 1,000 ballots) with 99% confidence. This means the AEC is prepared to certify results where potentially over 100,000 votes could be misallocated, fundamentally undermining democratic legitimacy.
Sacrificing Accuracy for "Efficiency" and "Speed": The report repeatedly justifies methodological choices by prioritizing "speed up the assurance," "simplifying the implementation," and allowing processing "while ongoing." This explicit trade-off indicates a severe lack of commitment to absolute accuracy. The chosen clustered sampling method is even acknowledged to "lead to lower accuracy if errors can also be clustered together," a known flaw accepted for logistical "benefits." This is a clear case of operational convenience overriding the foundational principle of precise vote counting.
Reduced Scrutiny Despite Known Issues:
- Fewer Ballots Audited: The proposed 2021/22 assurance will examine fewer ballots (9,895) than the 2019 audit (10,400), despite a larger estimated vote count. This represents a deliberate reduction in oversight.
- Dangerous Assumption for Stage 1 Testing: The critical Stage 1 check (physical ballot matching scanned image) is drastically reduced, with only "1 in 10 of the ballots selected for stage 2 testing" being examined. This reduction is based on the flimsy justification of "no errors detected" in a small 2019 sample (1,368 ballots). To scale down vital verification on such thin evidence borders on negligence, leaving a gaping hole for undetected scanning errors or even deliberate image manipulation.
Deliberate Blindness to State-Level Failures: Despite acknowledging that "the prevalence of stage 2 errors differed by state," the methodology "assumed" the national 0.45% error rate for each state (except ACT). This choice actively ignores potential higher error rates or systemic issues in individual states, effectively masking localized problems under a national average. This approach protects the "national confidence" statistic while potentially allowing significant, localized vote miscounts to go unchecked and unaddressed.
Inflated Confidence in Flawed Metrics: The "high level of confidence" touted in the executive summary refers to the confidence that the observed error rate will stay below an already unacceptably high maximum (0.65% nationally, 1% per state), not confidence in achieving near-zero errors. The use of "round numbers" for sampling skips, prioritizing simplicity over statistical precision, further highlights the superficiality of the "robustness" claims. The "small buffer for error" is not about striving for greater accuracy, but merely protecting the statistical validity of a deliberately relaxed confidence interval.
"Illustrative Only" Disclaimers Undermine Trust: The admission that "These statistical statements are illustrative only. They are based on the assumption of a true error rate of 0.45%..." reveals the precariousness of their entire confidence framework. If the foundational assumption of a low error rate is incorrect (i.e., if the true error rate is higher than 0.45%), then the entire report's "confidence" projections are meaningless.

In essence, these documents reveal an electoral body that, supported by its statistical advisors, has adopted a methodology designed not to eliminate errors, but to certify the results while tolerating tens of thousands of miscounted votes. This approach prioritizes operational ease and statistical appeasement over the fundamental imperative of ensuring every vote is accurately counted, casting a dark shadow over the integrity of Australia's democratic process.

AI Generated

The provided Freedom of Information documents detail the Australian Bureau of Statistics’ (ABS) advice to the Australian Electoral Commission (AEC) on the sampling methodology for auditing Senate ballot papers. From an Objectivist perspective, these documents present a rare instance where bureaucratic activity largely aligns with, rather than violates, fundamental principles of reason and individual liberty, particularly in its commitment to verifiable truth and efficiency.

Alignment with Reason and Individual Liberty:
The core purpose of this advice is to ensure the accuracy and integrity of election results. This is paramount for upholding the individual's right to vote—a political right vital to a free society. A vote unreliably counted is a vote effectively nullified, thus undermining an individual's political agency. The document's emphasis on achieving a "high level of confidence" in the error rate, using precise statistical methodologies, is a testament to the application of reason to a critical administrative function. The pursuit of objective, verifiable data ("99% confidence that the observed error rate... will be less than 1%") stands as a bulwark against arbitrary decisions or potential manipulation, ensuring that facts, not whims or errors, determine electoral outcomes. This commitment to truth through rational means is indispensable for a society founded on individual rights and objective reality.

Virtue of Productive Achievement and Rational Self-Interest:
The ABS, through its specialized expertise, provides a valuable intellectual product: a statistically sound method to verify election accuracy. This is a productive achievement in the realm of applied science and administrative optimization. The recommendation to perform assurance with "fewer ballots to be assured" (9,895 compared to 10,400 in 2019) while simultaneously delivering "higher confidence in the national error rate" demonstrates a rational commitment to efficiency and prudent resource management. This benefits all productive individuals by ensuring that tax dollars are not squandered on redundant or ineffective processes, while simultaneously upholding the reliability of the system that impacts their self-governance. A stable, fact-based electoral process is undeniably in the rational self-interest of every individual who seeks to live and prosper in a free society, as it minimizes instability and preserves the framework for individual action and achievement.

Critique of Collectivism and Bureaucratic Interference:
While Objectivism advocates for a minimal government confined to protecting individual rights, the activity described herein falls squarely within this protective function. The auditing process is not an act of collectivist coercion but a rational verification procedure to safeguard the integrity of individual choices expressed through voting. There is no evidence of "forced altruism"; individuals are not compelled to sacrifice for an amorphous collective, but rather their own political rights are being secured. Nor is there suppression of personal initiative; this is a technical administrative function, not a control over economic activity or personal expression.

Any governmental body, by its very nature, can be prone to bureaucratic bloat and irrational interference. However, these documents showcase the opposite: a deliberate effort to streamline a process, enhance its accuracy, and utilize specialized knowledge for an objective, beneficial outcome. The discussion of "clustered sampling" and its balance of "risk to accuracy with benefits" for efficiency highlights a pragmatic, rational approach to implementation, mitigating potential bureaucratic inertia. The "interference" described is limited to the methodological rigor necessary to ensure factual accuracy, which, in the context of maintaining a legitimate electoral system, is a proper and limited function of government to protect rights, not violate them.

In conclusion, these documents describe a necessary, rational, and efficient governmental function aimed at upholding the integrity of individual political rights. By emphasizing reason, verifiable truth, and optimized resource allocation in its audit methodology, the process aligns with, rather than deviates from, the principles of individual liberty and the virtue of productive achievement.

AI Generated

FOI Request LEX2979, Schedule of Released Documents [PDF 546KB] (pdf)

Download cached file | Download from AEC


--- Page 1 ---

Request for: 

FOI REQUEST NO. LEX2979 

 

“The document specifying the methodology to be used for the ballot paper sampling process in the audit of Senate ballot papers. Not the process 
outline published here: https://www.aec.gov.au/About_AEC/cea-notices/files/2022/s273AC-senate-assurancemethodology-fe2022.pdf but the 
document referred to in the above document as "advice from the Australian Bureau of Statistics" and "ABS' guidance for calculating, analysing and 
reporting the statistical conclusions that can be drawn." 

Doc No.  Description 

ABS Advice to AEC on sampling methodology 

SCHEDULE OF RETRIEVED DOCUMENTS

Document Summary and Relevance to FOI Request LEX2979

This document, "ABS Advice to AEC on sampling methodology," directly addresses FOI request LEX2979 by providing the specific methodology sought. It details the statistical approach recommended by the Australian Bureau of Statistics (ABS) to the Australian Electoral Commission (AEC) for auditing the accuracy of Senate ballot paper processing, particularly targeting "Stage 2 errors." The document outlines varying sampling rates across states/territories (e.g., 1 in 3,000 for NSW, 1 in 120 for NT) designed to achieve 99% confidence that the national error rate remains low (e.g., below 6.5 errors per 1,000 ballot papers). It specifies the use of a "clustered sampling" technique, selecting bundles of ballots and then individual ballots within those bundles, and includes guidance for calculating the national error rate, also discussing alternative options considered. This document is central to the FOI request as it contains the precise methodological advice from the ABS that LEX2979 aimed to uncover.

AI Generated

LEX2979 Relevant Document - ABS Advice to AEC on sampling methodology.pdf (pdf)

Download file


--- Page 1 ---

ABS advice to AEC on sampling methodology 

Executive Summary 

The Australian Electoral Commission (AEC) has requested advice from the ABS to determine the 
number of ballots for assurance as part of the elections for the Australian Senate. The number of 
ballots that are manually checked for errors should be sufficient to demonstrate with a high level 
of confidence that the possible national error rate is low. 

The ABS recommends that Senate ballots should be assured at the following rate: 

•  1 in 3,000 ballots in New South Wales and Victoria; 
•  1 in 2,500 ballots in Queensland; 
•  1 in 1,250 ballots in Western Australia; 
•  1 in 1,000 ballots in South Australia; 
•  1 in 350 ballots in Tasmania; 
•  1 in 300 ballots in Australian Capital Territory; 
•  1 in 120 ballots in Northern Territory. 

Based on these rates, it is estimated that 9,895 ballots will be assured nationally for the 2021/22 
Senate election. A state breakdown is provided in Table 1: 

This assurance approach will provide a high level of confidence in confirming that the national 
error rate and error rates in each of the states and territories is low. 

In comparison with the internal AEC assurance approach implemented in 2019, the proposed 
allocation delivers a higher confidence in the national error rate, while requiring fewer ballots to 
be assured. The proposed approach also allows ballot assurance to be undertaken while 
processing. This is helpful to speed up the assurance. 

Background 

The Senate assurance process implements two stages of ballot testing. The first stage of testing 
checks that the scanned image matches the physical ballot paper. The second stage checks that 
the scanned image of the ballot paper matches the extracted data file, i.e. that the preferences 
from the scanned image match the datafile that is used to run the preference allocation process. 

An assurance of the 2019 Senate election found no errors during the first stage at ballot testing. 
The national estimate of the proportion of errors during the second stage of ballot testing is 
0.45%. The calculation of the national error rate is discussed here. 

1


--- Page 2 ---

The emphasis of this report is to determine an appropriate allocation to assurance for stage 2 
errors. Given that no stage 1 errors were detected as part of the 2019 assurance from a sample 
of 1,368, it is evident that the true stage 1 error rate is very low. For the purposes of stage 1 
testing, it should be sufficient to assurance 1 in 10 of the ballots selected for stage 2 testing. The 
practical implementation is discussed here. 

Recommended Allocation 

This section details the recommended allocation and diagnostics associated with it 
Alternate allocations were considered and informed the final recommended allocation. See 
Appendix. 

The allocation utilised the following assumptions. 

•  While the 2019 assurance indicated that the prevalence of stage 2 errors differed by 
state, the difference between the state and national proportion of errors was not 
statistically significant, with the exception of the ACT, which had no errors detected.1 
Therefore, the calculated national stage 2 error rate of 0.45% was assumed in each 
state.  

•  An estimate of 16.095 million Senate forms nationally for the 2021/22 election. The 

distribution of form by state as provided by the AEC – see Table A1. 

The main criterion implemented for designing the target number of ballots to assurance by state 
was to have 99% confidence that the observed error rate in the sample for each state will be less 
than 1%, assuming that an error rate of 0.45% (as estimated in 2019) applies for the full 
population of senate votes. 

The minimum sample size to achieve this is to select 828 ballots in each state and territory – see 
Appendix for details. 

The recommended allocation places sample beyond this minimum value into each state. This is 
a conservative approach to ensure we have enough sample to meet the accuracy targets, and it 
produces round numbers for the sampling skips to be used, simplifying the implementation of this 
proposal.  It also helps to ensure robustness. The sample allocation will remain statistically valid 
if the actual number of Senate ballots in a particular state or the error rate differs slightly from 
what has been assumed.  

Table 1: Number of ballots to assure for stage 2 error by state 

 State 

Estimated 
Forms 2021/22 

Estimated 
Ballots assured 
(stage 2) 

Assurance 
Rate (1 in X 
ballots) 

95% confidence 
limit for maximum 
error rate 

99% confidence 
limit for maximum 
error rate 

NSW 

5,200,000 

VIC 

QLD 

SA 

WA 

4,130,000 

3,180,000 

1,200,000 

1,590,000 

1,733 

1,377 

1,272 

1,200 

1,272 

3,000 

3,000 

2,500 

1,000 

1,250 

2 

0.72% 

0.75% 

0.77% 

0.77% 

0.77% 

0.83% 

0.88% 

0.89% 

0.91% 

0.89% 

1 The 2019 assurance found zero errors in ACT, during stage 2 testing.  Consequently, there is over 95% 
confidence that the true ACT stage 2 error rate is less than the national stage 2 error rate. The national second 
stage error rate is applied to ACT in the interests of simplicity and to ensure that ACT is not under-allocated.


--- Page 3 ---

TAS 

NT 

ACT 

AUS 

387,000 

115,000 

293,000 

1,106 

958 

977 

350 

120 

300 

16,095,000 

9,895 

0.79% 

0.81% 

0.81% 

0.59% 

0.92% 

0.96% 

0.95% 

0.65% 

Testing conclusions 

Based on the observed error rates from the 2019 assurance and the sample sizes in each state 
the following statistical statements could be made. 

• 

If there is a 0.45% error rate found in the assurance sample, then the AEC can be 95% 
confident that nationally, there are less than 6 errors per 1,000 ballot papers in the 
Senate scanning process.  It is also true that if the true error rate in the population is 
0.45%, then the AEC can be 95% confident that the error rate estimated from the 
assurance sample will be less than 6 errors per 1,000 ballot papers. 

•  Similarly, there is 99% confidence that nationally there are less than 6.5 errors per 1,000 

• 

ballot papers. 
In any given state, there is 99% confidence that there are less than 10 errors per 1,000 
ballot papers. 

These statistical statements are illustrative only. They are based on the assumption of a true 
error rate of 0.45% in the population to give confidence on the size of the estimated error rate 
from the sample; or similarly on the assumption of an error rate of 0.45% in the assurance 
sample to give confidence in what the error rate is for the full population.  Final confidence 
intervals will depend on the actual error rates found during the 2021/22 assurance. 

Comparison with 2019 assurance approach 

It is instructive to compare the proposed assurance approach with the assurance approach 
previously implemented in 2019. 

First, it is noted that the total expected number of ballots to assurance (9,895) is slightly lower 
than in 2019 (10,400).  

Secondly, rather than assuring a constant number of ballots in each state, the proposed 
allocation is assurances of more ballots in the more populous states and less ballots in the less 
populous states.  

Increasing the number of ballots assured in the more populous states allows the proposed 
allocation to deliver a higher confidence in the national error rate, while assuring a smaller 
number of ballots. 

Third, it is specified to assure at a constant rate in each state, rather than a fixed total number of 
ballots. This is efficient to allow ballots to be assured while processing is ongoing, rather than 
having to wait for all ballots to be processed before commencing assurance. 

3


--- Page 4 ---

Practical implementation of assuring 

The AEC arranges senate ballots into bundles of 50. From a logistical perspective, it would be 
more efficient to first select a number of bundles and then select more than one ballot from each 
bundle. 

Furthermore, selecting bundles at a constant rate allows assurance to be undertaken while 
processing is ongoing – as it will not be necessary to have every bundle processed for assurance 
to commence. 

This is known as clustered sampling of the ballots.  Clustered samples can lead to lower 
accuracy if errors can also be clustered together, i.e. if errors are not evenly spread across all 
bundles.  We have suggested an approach that we believe balances the risk to accuracy from 
using a clustered sample with the benefits that it provides, i.e. reducing the number of bundles 
that need to be selected for the assurance sample.  The allocations provided in Table 1 have 
already allowed for some ‘slack’ by selecting more ballots than strictly necessary to obtain a 
precise national estimate of the stage 2 error. 

We propose the assurance selects a certain proportion of ‘bundles’ (e.g. 1 in every 300 bundles 
in NSW) and then to select 1/10 of all ballots in the bundle for stage 2 testing (so that overall 1 in 
every 3,000 ballots is selected in NSW). 

Once ballots have been selected for stage 2 testing, select 1 in every 10 of the stage 2 sample 
for stage 1 testing.  

If the sampling rate from Table 1 is adopted, then the process is described below in Table 2. 

Table 2: Number of forms to assure by state 

 State 

Estimated 
Forms 
2021/22 

Estimated 
Bundles 
2021/22 

NSW 

5,200,000 

104,000 

4,130,000 

82,600 

3,180,000 

63,600 

1,200,000 

24,000 

1,590,000 

31,800 

387,000 

115,000 

293,000 

7,740 

2,300 

5,860 

VIC 

QLD 

SA 

WA 

TAS 

NT 

ACT 

AUS 

Assurance 
Rate  
(1 in X 
bundles) 

Estimated 
Bundles 
selected 

Estimated 
Ballots 
assured  
(stage 2) 

Assurance 
Rate  
(1 in X 
ballots) 

Estimated 
Ballots 
assured 
(stage 1) 

300 

300 

250 

100 

125 

35 

12 

30 

347 

275 

254 

240 

254 

221 

192 

195 

1,733 

1,377 

1,272 

1,200 

1,272 

1,106 

958 

977 

9,895 

3,000 

3,000 

2,500 

1,000 

1,250 

350 

120 

300 

173 

138 

127 

120 

127 

111 

96 

98 

989 

16,095,000 

321,900 

1,979 

4


--- Page 5 ---

Calculating the national error rate 

If an assurance approach uses a different sampling rate in different states, then in order to 
calculate the national error rate, it is  important to weight the number of errors found in each state 
by the state’s proportion of the national population. 

Table 3: 2019 assurance calculation of national error rate 

Total Senate 
ballots 2019  
(formal + informal) 

Proportion of 
national total 

Stage 2 
errors 
2019 

Stage 2 
sample 
2019 

Error 
rate 

Estimated 
total errors 

4,905,472 

3,896,236 

2,999,372 

1,134,556 

1,497,532 

365,272 

108,994 

276,651 

15,184,085  

32.3% 

25.7% 

19.8% 

7.5% 

9.9% 

2.4% 

0.7% 

1.8% 

7 

6 

6 

5 

4 

6 

2 

0 

1,300 

0.54% 

1,300 

0.46% 

1,300 

0.46% 

1,300 

0.38% 

1,300 

0.31% 

1,300 

0.46% 

1,300 

0.15% 

1,300 

0.00% 

26,414 

17,983 

13,843 

4,364 

4,608 

1,686 

168 

0 

0.45% 

69,065 

 State 

NSW 

VIC 

QLD 

SA 

WA 

TAS 

NT 

ACT 

AUS 

The error rate in each state is estimated by dividing the number of errors in each state by the 
assurance sample size.  For example, in NSW the assurance for 7 errors from a sample of 
1,300, giving an error rate of 0.54%.  An error rate of 0.54% would mean that there is a total of 
26,414 errors from the full population of 4,905,472 votes in NSW. 

After calculating the estimated number of total errors in each state they can be added to produce 
an estimate of total number of errors in Australia.  This total is 69,065 based on the 2019 
assurance results. 

Dividing the estimate of 69,065 errors by the total national votes of 15,184,085 gives the 
estimated national error rate of 0.45%. 

An alternate approach to calculate this national error rate is to multiply the error rate in each state 
by the proportion of votes in that state.  This gives:  
(0.323 x 0.0054) + (0.257 x 0.0046) + (0.198 x 0.0046) + (0.075 x 0.0038) +  
(0.099 x 0.0031) + (0.024 x 0.0046) + (0.007 x 0.0015) + (0.018 x 0)  
= 0.0045.   

5


--- Page 6 ---

Appendix 

Table A1: Estimated senate forms by state for 2021/2022 Senate Election – source AEC 

State 

Estimated 
Senate Forms 

NSW 

VIC 

QLD 

SA 

WA 

TAS 

NT 

ACT 

5,200,000 

4,130,000 

3,180,000 

1,200,000 

1,590,000 

387,000 

115,000 

293,000 

Table A2: number of stage 2 errors by state – 2019 Senate assurance – source AEC 

 State 

NSW 

VIC 

QLD 

SA 

WA 

TAS 

NT 

ACT 

Stage 2 errors 
2019 assurance 

2019 Error rate 

7 

6 

6 

5 

4 

6 

2 

0 

0.54% 

0.46% 

0.46% 

0.38% 

0.31% 

0.46% 

0.15% 

Alternate allocations 

This section outlines various allocation options that were considered, that informed the final 
recommended approach. These options are presented for technical background and can be 
skipped. 

The allocation described in Table 1 represents the ABS’ main recommendation.  

6


--- Page 7 ---

Option A1: Allocation using a constant national sample rate 

The first option considered is to apply a constant assurance rate across each state nationally. 
This would differ from the assurance process from 2019, which assured a constant number of 
ballots (1,300) in each state as part of stage 2 testing.  

The advantages of applying a constant sample rate nationwide, is that it would allow the same 
assurance procedure to be applied in each state. Furthermore, the estimate of the national error 
rate would be easier to interpret as no weighting would be required. 

The disadvantage of applying a constant sample rate is that the smallest states would have 
relatively few ballots assured. This would result in a less confidence in the estimate of the state 
error rate. 

Sample allocations 

Table A3 shows the national level of accuracy associated with different sample sizes, while 
applying a constant sample rate nationally. 

Table A3: National sample size vs 95% margin of error of estimate 

Scenario 

National 
sample size 

1 in 
Rate 

One-sided 95% 
confidence level 

One-sided 99% 
confidence level 

A 

B 

C 

10,400 

1,548 

5,810 

2,770 

6,438 

2,500 

0.56% 

0.60% 

0.59% 

0.61% 

0.66% 

0.65% 

Scenario A represents the national sample size that was used for stage 2 testing as part of the 
2019 assurance. Scenario B represents the minimum national sample size to be 95% confident 
that the national error rate is less than 0.6%. 

From a practical perspective, it would make sense to use a larger sample size than this. 
Scenario C represents this, using a ‘round’ sample rate of 1 in 2,500 dwellings for each state.  

Table A4: Number of forms to assurance by state by scenario 

Estimated 
Forms 
2021/22 

5,200,000 

4,130,000 

3,180,000 

1,200,000 

1,590,000 

387,000 

115,000 

293,000 

 State 

NSW 

VIC 

QLD 

SA 

WA 

TAS 

NT 

ACT 

Scenario A 

Scenario B 

Scenario C 

3,360 

2,669 

2,055 

775 

1,027 

250 

74 

189 

1,877 

1,491 

1,148 

433 

574 

140 

42 

106 

2,080 

1,652 

1,272 

480 

636 

155 

46 

117 

 TOTAL 

16,095,000 

10,400 

5,810 

6,438 

7


--- Page 8 ---

It is evident that if precisely estimating the national error rate is the key objective, than the 
sample rate required can be significantly lower than what was applied in 2019 (Scenario A). 
It is also clear that this approach results in a relatively small number of ballots being sampled in 
Tasmania, Northern Territory and Australian Capital Territory. 

Option A2: Allocation with maximum state margin of error (MOE) constraint 

A notable disadvantage of applying a fixed sampling rate across all states is that the number of 
ballots assured in the smaller states is low. This will result in wide confidence intervals for the 
state level estimates of proportion of errors in smaller states/territories. 

The following two allocations examine the number of ballots required to be assured in each state 
in order to be 95% or 99% confident that the true state level error rate would be less than 1%  

Table A5 : state assurance size required to be 95/99% confident that the true error rate < 1% 

State one-sided confidence 
interval 

State sample 

National 95% confidence 
interval bound 

National 99% confidence 
interval bound 

95% 

413 

99% 

828 

0.71% 

0.64% 

0.82% 

0.71% 

Therefore, the state allocation to be 99% confident that the observed error rate is less than 1% in 
each state (assuming a 0.45% error rate in the population) is as in Table A6. 

Table A6 : State sample size and rate to be 99% confident that the assurance error rate is less than 1% 

 State 

Estimated 
Forms 
2021/22 

 State 
sample 

State 
sample rate 
(1 in X) 

NSW 

5,200,000 

VIC 

QLD 

SA 

WA 

TAS 

NT 

ACT 

4,130,000 

3,180,000 

1,200,000 

1,590,000 

387,000 

115,000 

293,000 

828 

828 

828 

828 

828 

828 

828 

828 

6,280 

4,988 

3,841 

1,449 

1,920 

467 

139 

354 

Table A6 was used as the basis behind the recommended option in Table 1. Additional sample 
was put into each state, in order to round off the sampling rates, and to allow a small buffer for 

8


--- Page 9 ---

error (e.g. if total votes in a state is smaller than expected; or if the true population error rate is 
higher than 0.45%). 

9


--- Page 10 ---

Glossary2 

Confidence Interval 

A confidence interval is an interval which has a known and controlled probability (generally 95% 
or 99%) to contain the true value. In the context of senate assurance, one-sided confidence limits 
are calculated for the stage 2 error rates, to determine the maximum error rate that could 
potentially occur, for the given level of confidence. 

Margin of Error (MoE) 

Margin of Error describes the distance from the population value that the assurance estimate is 
likely to be within, for a specified given level of confidence. For instance, at the 95% confidence 
level, the MoE indicates that there are about 19 chances in 20 that the estimate will differ from 
the population value (the figure obtained if all senate ballots had been assured) by less than the 
specified MoE. Equivalently it is one chance in 20 that the difference is greater than the specified 
MoE, i.e. outside the MoE. . 

Significance testing 

To determine whether a difference between two survey estimates is a real difference in the 
populations to which the estimates relate, or merely the product sampling variability, the 
statistical significance of the difference can be tested. The test is performed by calculating the 
standard error of the difference between two estimates and then dividing the actual difference by 
the standard error of the difference. If the result is greater than 1.96, there are 19 chances in 20 
that there is a real difference in the populations to which the estimates relate.  

Standard error 

The square root of the variance of the sampling distribution of a statistic (square root of variance 
of state or national error rate in the context of senate assurance) 

Variance 

The variance is the mean square deviation of the variable around the average value. It reflects 
the dispersion of the empirical values around its mean. 

2 Glossary definitions have been taken from ABS publications and The OECD Glossary of Statistical Terms 
 and modified to fit the context of senate assurance 

10

This document, "ABS Advice to AEC on sampling methodology," directly addresses FOI request LEX2979 by detailing the Australian Bureau of Statistics' (ABS) recommended statistical methodology for sampling Australian Senate ballot papers. The core purpose is to audit the accuracy of ballot paper processing, specifically focusing on "Stage 2 errors" (where the scanned image of a ballot paper does not match the extracted preference data).

The methodology proposes varying sampling rates per state and territory to achieve high confidence in the accuracy:
* 1 in 3,000 ballots in New South Wales and Victoria
* 1 in 2,500 ballots in Queensland
* 1 in 1,250 ballots in Western Australia
* 1 in 1,000 ballots in South Australia
* 1 in 350 ballots in Tasmania
* 1 in 300 ballots in Australian Capital Territory
* 1 in 120 ballots in Northern Territory

This approach is estimated to involve assuring 9,895 ballots nationally. It aims to provide 99% confidence that the national error rate is low (e.g., less than 6.5 errors per 1,000 ballot papers) and that in any given state, there are less than 10 errors per 1,000 ballot papers.

Key aspects of the methodology include:
* Efficiency: It is deemed more efficient than the 2019 approach, requiring fewer ballots while providing higher national confidence and allowing assurance to occur concurrently with processing.
* Clustered Sampling: It utilizes a clustered sampling technique, selecting bundles of 50 ballots at specified rates (e.g., 1 in 300 bundles in NSW) and then sampling individual ballots (e.g., 1 in 10 ballots within selected bundles for Stage 2 testing, and 1 in 10 of those for Stage 1 testing).
* Error Rate Calculation: The document provides guidance on how to calculate the national error rate by weighting errors found in each state by that state's proportion of the national population.

The advice outlines the statistical assumptions, diagnostic information, and alternative sampling allocation options considered, solidifying the transparency of the ABS's recommendations to the Australian Electoral Commission (AEC) for ensuring the integrity of Senate vote processing.

AI Generated

AEC FOI Disclosure Log Archive

FOI Request LEX2979

FOI Request Summary: LEX2979 - Senate Ballot Paper Sampling Methodology

Main Purpose of the FOI Request

Documents from the FOI Request

Main Content from the FOI Request Documents

FOI Request LEX2979, Schedule of Released Documents [PDF 546KB] (pdf)

Document Summary and Relevance to FOI Request LEX2979

LEX2979 documents [ZIP 350KB] (zip)

ZIP Contents

LEX2979 Relevant Document - ABS Advice to AEC on sampling methodology.pdf (pdf)