Internet-Draft | Prio L1 Bound Sum | November 2024 |
Thomson & Cook | Expires 31 May 2025 | [Page] |
A Prio Verifiable Distributed Aggregation Function is defined that supports vector or histogram addition, where the sum of the values in the contribution is less than a chosen value.¶
This note is to be removed before publishing as an RFC.¶
The latest revision of this draft can be found at https://martinthomson.github.io/prio-l1-bound-sum/draft-thomson-ppm-l1-bound-sum.html. Status information for this document may be found at https://datatracker.ietf.org/doc/draft-thomson-ppm-l1-bound-sum/.¶
Discussion of this document takes place on the Privacy Preserving Measurement Working Group mailing list (mailto:ppm@ietf.org), which is archived at https://mailarchive.ietf.org/arch/browse/ppm/. Subscribe at https://www.ietf.org/mailman/listinfo/ppm/.¶
Source for this draft and an issue tracker can be found at https://github.com/martinthomson/prio-l1-bound-sum.¶
This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.¶
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at https://datatracker.ietf.org/drafts/current/.¶
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."¶
This Internet-Draft will expire on 31 May 2025.¶
Copyright (c) 2024 IETF Trust and the persons identified as the document authors. All rights reserved.¶
This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Revised BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Revised BSD License.¶
Existing Prio instantiations of a Verifiable Distributed Aggregation Function (VDAF) [VDAF] all support a simple summation of measurements. From Prio3Count (Section 7.4.1 of [VDAF]), which adds measurements containing a single one or a zero value, to Prio3SumVec (Section 7.4.3 of [VDAF]), which adds measurements containing an vector where each dimension is a limited number of bits, all instantations take the same basic form.¶
One case that is presently not included in the suite of instantiations is the addition of vectors or histogram contributions, where each measurement has an L1 bound. The L1 norm of a vector is defined as the sum of its components. An L1 bound limits that sum to some maximum.¶
This document defines the Prio3L1BoundSum instantiation. This instantiation limits the L1 norm of a vector or histogram to a value that is one less than a chosen power of 2, or 2n-1. This choice significantly reduces the size of the encoding relative to a more flexible limit.¶
This instantiation has similarities with other instantiations. Unlike Prio3Histogram (Section 7.4.4 of [VDAF]), in which measurements need to have an L1 norm of exactly 1, a valid measurement for Prio3L1BoundSum can have an L1 norm equal to any value between 0 and the chosen limit. Unlike Prio3MultiHotCountVec (Section 7.4.5 of [VDAF]), in which each component can only be zero or one, components in Prio3L1BoundSum can take any value up to the L1 bound as long as their sum is within that bound.¶
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here.¶
This document uses the terminology and functions defined in Section 2 of [VDAF].¶
The Prio3L1BoundSum instantiation of Prio [PRIO] supports the addition of a vector of integers. The instantiation is summarized in Table 1.¶
Parameter | Value |
---|---|
field | Field128 (Section 6.1.2 of [VDAF]) |
Valid | L1BoundSum(field, length, bits, chunk_length) |
PROOFS | 1 |
XOF | XofTurboShake128 (Section 6.2.1 of [VDAF]) |
The function takes three parameters:
length
, bits
, and chunk_length
.
The vector contains "length
" components,
each of which is a non-negative integer less than 2bits
.¶
The chunk_length
parameter can be chosen
in approximately the same way as for Prio3SumVec,
as detailed in Section 7.4.3.1 of [VDAF].
The difference is that Prio3L1BoundSum involves validation of
bits * (length + 1)
values,
which might increase the most efficient value for chunk_length
.¶
The encoded form of each measurement appends a bitwise decomposition of the L1 norm (the sum of the vector components) to the encoding:¶
def encode(self, measurement: list[int]) -> list[F]: encoded = [] weight = self.field(0) for v in measurement: weight += v encoded += self.field.encode_into_bit_vector(v, self.bits) w_bits = self.field.encode_into_bit_vector(weight, self.bits) return encoded + w_bits¶
The encoded measurement has a total length of (length + 1) * bits
.¶
This extra information is not included in the measurement
that is submitted for aggregation.
That is, the truncate()
function emits only the core measurements.¶
def truncate(self, meas: list[F]) -> list[F]: return [ self.field.decode_from_bit_vector(m) for m in chunks(meas, self.bits) ]¶
This uses a chunks(v, c)
function that takes a list of values, v
,
and a chunk length, c
,
to split v
into multiple lists from v
,
where each chunk has a length c
.¶
The decode()
function is therefore identical to that in Prio3SumVec.¶
def decode(self, output: list[F], _count) -> list[int]: return [x.as_unsigned() for x in output]¶
The validity circuit for Prio3L1BoundSum uses an extended version of the validity circuit used by Prio3SumVec, see Section 7.4.3 of [VDAF].¶
The encoded measurement is checked
to ensure that every component of the vector –
plus the added L1 norm –
is encoded in the specified number of bits.
That is, the circuit checks that each component has a value between
0 (inclusive) and 2bits
(exclusive)
by checking that each of the first "bits
" bits of the value
are either zero or one.
This process is identical to the Prio3SumVec check,
except that one additional value is checked.¶
The validity circuit then checks whether the added L1 norm value is consistent with the encoded vector elements. The L1 norm is checked by decoding the measurement values, including the encoded L1 norm, recomputing the L1 norm as the sum of the individual components, and subtracting the reported and computed values to confirm that they are identical.¶
The complete circuit is specified in Figure 1.¶
The Prio3L1BoundSum VDAF is subject to the same considerations as other Prio-based VDAFs. These considerations are detailed in Section 9 of [VDAF].¶
In particular, this instantiation uses Field128 to ensure robustness despite the use of joint randomness in proofs. Joint randomness increases the risk of an attacker finding a combination of invalid inputs that passes validation. A larger field increases the computational cost of finding such a combination.¶
This document registers a codepoint for Prio3L1BoundSum in the "Verifiable Distributed Aggregation Functions (VDAF)" registry as defined by Section 10 of [VDAF]. This entry contains the following fields:¶
Chris Patton provided extensive input into the construction of this VDAF.¶