Skip to content

Commit 737e562

Browse files
committed
Use "deduplication" instead of "de-duplication"
Apparently "deduplication" is a word, so let's use it.
1 parent cc67d29 commit 737e562

File tree

3 files changed

+19
-19
lines changed

3 files changed

+19
-19
lines changed

src/docs/asciidoc/api.adoc

Lines changed: 15 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -346,8 +346,8 @@ The following table sums up the main settings to create a `Producer`:
346346

347347
|`name`
348348
|The logical name of the producer. Specify a name to enable
349-
<<outbound-message-de-duplication,message de-duplication>>.
350-
|`null` (no de-duplication)
349+
<<outbound-message-deduplication,message deduplication>>.
350+
|`null` (no deduplication)
351351

352352
|`batchSize`
353353
|The maximum number of messages to accumulate before sending them to the broker.
@@ -357,7 +357,7 @@ The following table sums up the main settings to create a `Producer`:
357357
|[[producer-sub-entry-size-configuration-entry]]The number of messages to put in a sub-entry. A sub-entry is one "slot" in a publishing
358358
frame, meaning outbound messages are not only batched in publishing frames, but in sub-entries
359359
as well. Use this feature to increase throughput at the cost of increased latency and
360-
potential duplicated messages even when de-duplication is enabled.
360+
potential duplicated messages even when deduplication is enabled.
361361
|1 (meaning no use of sub-entry batching)
362362

363363
|`maxUnconfirmedMessages`
@@ -475,8 +475,8 @@ type system. It provides good interoperability, which allows streams
475475
to be accessed as AMQP 0-9-1 queues, without data loss.
476476
====
477477

478-
[[outbound-message-de-duplication]]
479-
===== Message De-duplication
478+
[[outbound-message-deduplication]]
479+
===== Message Deduplication
480480

481481
RabbitMQ Stream provides publisher confirms to avoid losing messages: once
482482
the broker has persisted a message it sends a confirmation for this message.
@@ -490,9 +490,9 @@ Luckily RabbitMQ Stream can detect and filter out duplicated messages, based
490490
on 2 client-side elements: the _producer name_ and the _message publishing ID_.
491491

492492
[WARNING]
493-
.De-duplication is not guaranteed when using sub-entries batching
493+
.Deduplication is not guaranteed when using sub-entries batching
494494
====
495-
It is not possible to guarantee de-duplication when
495+
It is not possible to guarantee deduplication when
496496
<<producer-sub-entry-size-configuration-entry, sub-entry batching>> is in use.
497497
Sub-entry batching is disabled by default and it does not prevent from
498498
batching messages in a single publish frame, which can already provide
@@ -502,9 +502,9 @@ very high throughput.
502502
====== Setting the Name of a Producer
503503

504504
The producer name is set when creating the producer instance, which automatically
505-
enables de-duplication:
505+
enables deduplication:
506506

507-
.Naming a producer to enable message de-duplication
507+
.Naming a producer to enable message deduplication
508508
[source,java,indent=0]
509509
--------
510510
include::{test-examples}/ProducerUsage.java[tag=producer-with-name]
@@ -518,13 +518,13 @@ will automatically recover and retry outstanding messages. The broker will then
518518
filter out messages it has already received and persisted. No more duplicates!
519519

520520
[IMPORTANT]
521-
.Why setting `confirmTimeout` to 0 when using de-duplication?
521+
.Why setting `confirmTimeout` to 0 when using deduplication?
522522
====
523-
The point of de-duplication is to avoid duplicates when retrying unconfirmed messages.
523+
The point of deduplication is to avoid duplicates when retrying unconfirmed messages.
524524
But why retrying in the first place? To avoid _losing_ messages, that is enforcing
525525
_at-least-once_ semantics. If the client does not stubbornly retry messages and gives
526526
up at some point, messages can be lost, which maps to _at-most-once_ semantics. This
527-
is why the de-duplication examples set the
527+
is why the deduplication examples set the
528528
<<producer-confirm-timeout-configuration-entry,`confirmTimeout` setting>> to `Duration.ZERO`:
529529
to disable the background task that calls the confirmation callback for outstanding
530530
messages that time out. This way the client will do its best to retry messages
@@ -539,11 +539,11 @@ stream at the same time.
539539

540540
====== Understanding Publishing ID
541541

542-
The producer name is only one part of the de-duplication mechanism, the other part
542+
The producer name is only one part of the deduplication mechanism, the other part
543543
is the _message publishing ID_. If the producer has a name, the client automatically
544544
assigns a publishing ID to each outbound message for the producer. The publishing ID
545545
is a strictly increasing sequence, starting at 0 and incremented for each message. The default
546-
publishing sequence is good enough for de-duplication, but it is possible to
546+
publishing sequence is good enough for deduplication, but it is possible to
547547
assign a publishing ID to each message:
548548

549549
.Using an explicit publishing ID
@@ -570,7 +570,7 @@ properties (e.g. `messageId`).
570570
[IMPORTANT]
571571
.Do not mix client-assigned and custom publishing ID
572572
====
573-
As soon as a producer name is set, message de-duplication is enabled.
573+
As soon as a producer name is set, message deduplication is enabled.
574574
It is then possible to let the producer assign a publishing ID to each
575575
message or assign custom publishing IDs. *Do one or the other, not both!*
576576
====

src/docs/asciidoc/overview.adoc

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -49,7 +49,7 @@ you need to use an AMQP 0-9-1 client library.
4949
RabbitMQ stream provides at-least-once guarantees thanks to the
5050
publisher confirm mechanism, which is supported by the stream Java client.
5151

52-
Message <<api.adoc#outbound-message-de-duplication,de-duplication>>
52+
Message <<api.adoc#outbound-message-deduplication,deduplication>>
5353
is also supported on the publisher side.
5454

5555
[[stream-client-overview]]
@@ -63,7 +63,7 @@ to build fast, efficient, and robust client applications.
6363
* _administrate streams (creation/deletion) directly from applications._ This
6464
can also be useful for development and testing.
6565
* _adapt publishing throughput_ thanks to the configurable batch size and flow control.
66-
* _avoid publishing duplicate messages_ thanks to message de-duplication.
66+
* _avoid publishing duplicate messages_ thanks to message deduplication.
6767
* _consume asynchronously from streams and resume where left off_ thanks to
6868
automatic or manual offset tracking.
6969
* _enforce https://blog.rabbitmq.com/posts/2021/07/connecting-to-streams/[best practices] to create client connections_ – to stream leaders for publishers to minimize inter-node traffic and to stream replicas for consumers to offload leaders.

src/docs/asciidoc/performance-tool.adoc

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -451,10 +451,10 @@ tracking consumers would ignore the specified offset and would start where they
451451

452452
===== Producer Names
453453
You can use the `--producer-names` option to set the producer names pattern and therefore
454-
enable <<api.adoc#outbound-message-de-duplication, message de-duplication>> (using the default
454+
enable <<api.adoc#outbound-message-deduplication, message deduplication>> (using the default
455455
publishing sequence starting at 0 and incremented for each message).
456456
The same naming options apply as above in <<api.adoc#consumer-names, consumer names>> with the only
457-
difference that the default pattern is empty (i.e. no de-duplication).
457+
difference that the default pattern is empty (i.e. no deduplication).
458458

459459
=== Building the Performance Tool
460460

0 commit comments

Comments
 (0)