Write a function that reads a very large text file and outputs the frequency of each word. Define your tokenization and normalization rules (case folding, punctuation, Unicode handling), and explain how to process inputs larger than RAM using streaming, chunking, or external sorting. Discuss producing the top‑K most frequent words efficiently and analyze time and space complexity.