Extract unique party mentions from HTML content. Uses Unicode-aware regex boundaries for proper word detection across scripts.
HTML content to search for party references
Set of canonical party codes found in the content
Extract unique party mentions from HTML content. Uses Unicode-aware regex boundaries for proper word detection across scripts.