top of page

Your certificate is now private

CertificateBackground.png

Certificate of Completion

ErdosHorizontal.png

THIS ACKNOWLEDGES THAT

HAS COMPLETED THE SPRING 2026 DEEP LEARNING BOOT CAMP

Dharineesh Somisetty

Roman Holowinsky, PhD

MARCH 25, 2026

DIRECTOR

DATE

clear.png

TEAM

Fragmented ID Resolution

Noimot Bakare Ayoub, Dharineesh Somisetty, Arpith Shanbhag, Pedro Fontanarrosa

clear.png

Scope: Detect duplicate identities across noisy, fragmented datasets (fraud, patient mismatch, citizen records)
Architecture: CNN Embeddings + Siamese Network

Problem: Real-world identity data is messy, small inconsistencies cause one person to appear as multiple records, creating operational risk and inefficiency.

Approach: We learn record similarity using CNN embeddings and a Siamese network. LinkID detects, ranks, and resolves duplicate identities auto-linking high-confidence matches and routing borderline cases for review.

Data: HPI snapshot of North Carolina voter records with labeled duplicate and non duplicate pairs.

Results: Strong performance overall, with ~25-point improvement on hard cases where traditional models struggle.

Conclusion: Learned similarity models significantly outperform traditional approaches in complex identity resolution tasks.

Screen Shot 2022-06-03 at 11.31.35 AM.png
github URL

©2017-2026 by The Erdős Institute.

bottom of page