Computation of term dominance in text documents

Patent Number: 8,166,051
Issued: 4/24/2012
Official Filing: View the Complete Patent
Abstract: An improved entropy-based term dominance metric useful for characterizing a corpus of text documents, and is useful for comparing the term dominance metrics of a first corpus of documents to a second corpus having a different number of documents.
Filed: 2/3/2009
Application Number: 12/364,753
Government Interests: STATEMENT OF GOVERNMENT INTEREST This invention was made with Government support under Contract No. DE-NA0003525 awarded by the United States Department of Energy/National Nuclear Security Administration. The Government has certain rights in the invention.