Deduplication looks like tidying. It is not. A HubSpot merge is permanent and cannot be cleanly undone, so a wrong merge is not untidy, it is a data-loss event: two people's histories fused into one record, with no clean way back. That is why the interesting question in deduplication is not "how do I merge duplicates?" but "how do I avoid merging the wrong ones?"
The most common way teams get this wrong is matching on a single strong-looking signal. Here is why that is dangerous, and the method we use to merge safely.
Why matching on phone number alone is dangerous
A phone number feels like a unique identifier. It is not a person. Two genuinely different people regularly share one number:
- a couple or family on a single household mobile,
- a reception or main-line number entered against several individuals,
- a handset reissued to a new employee after someone leaves,
- a shared or hot-desk line.
Match on the number alone and you will confidently merge two different people. In a regulated database, where consent and history attach to the individual, that is exactly the merge you most need not to make.
Why matching on email alone is also risky
- Shared mailboxes and role addresses (info@, accounts@, sales@) belong to several people.
- Typo domains look like a match but are not, and a naive matcher both over-merges and under-merges around them.
The typo-domain trap, and correcting it
A misspelled email domain quietly defeats deduplication: the misspelled record looks different, so the duplicate is missed, and a careless matcher can also merge unrelated records that share the typo. The fix is to correct against a library of known domain misspellings before matching, not after, so the corrected address is what you match and the correctly spelled record is the one you keep.
The name-gate: the rule that makes merging safe
The single most important safeguard is simple: only auto-merge when the name agrees as well. If two records share a phone number or an email but the names do not match, they are not merged. They are held for a person to review.
That one rule is what stops two different people who share a handset being fused automatically. It turns the dangerous case (strong-signal match, different person) into a safe one (held for review), while still clearing the genuine duplicates without manual effort.
Confidence, not a binary merge
Better still is to stop treating a merge as on-or-off and score it. A confidence model (for example High, Medium, Low, Restricted) decides what happens to each candidate: high-confidence pairs merge automatically, anything uncertain is quarantined for human review, and certain cases are held entirely. The team works a defined review queue instead of trusting a bulk merge blindly or checking everything by hand.
Guardrails that should be non-negotiable
- a short-name guard so initials and very short surnames cannot drive a fuzzy match,
- automatic safety checks that abort the run if anything looks inconsistent,
- a full backup of every affected record before any merge,
- a dry run that is reviewed before anything is changed,
- a complete audit log of what was merged and why.
HubSpot's native tool does not apply an identity check like the name-gate, and its merges are permanent, so these safeguards have to come from how you run the job.
Safe merging is a decision library, not a button
The lesson underneath all of this: safe deduplication is a set of decisions (how you normalise, how you match, when you refuse to merge, who survives, what you hold for review), not a single button. The same decision library powers both a one-time backlog clean-up and an ongoing guardrail that keeps the database clean.
If you would rather have this done safely for you, see our HubSpot data engineering work, or read how we clear a large duplicate backlog at scale. Tell us what you are running and we will scope it.
FAQ
Can two different people share one phone number?
Yes, often: shared household mobiles, reception numbers, reissued handsets and hot-desk lines. That is why matching on phone number alone merges the wrong records.
Are HubSpot merges permanent?
Yes. A HubSpot merge cannot be cleanly undone, so safe deduplication relies on a dry run, a full pre-merge backup, and holding uncertain matches for review.
How do you avoid merging the wrong contacts?
With a name-gate: only auto-merge when the name agrees too, otherwise hold the pair for human review, plus a confidence score and a short-name guard.
Is matching on email alone safe?
No. Shared mailboxes, role addresses and typo domains all cause email-only matching to merge the wrong records.
Stay Updated with Our Latest Insights
Get expert HubSpot tips and integration strategies delivered to your inbox.

