Recent U.S. patents related to Text Categorization:
6,363,174: Method and apparatus for content identification and categorization of textual data
6,353,825: Method and device for classification using iterative information retrieval techniques
6,327,581: Methods and apparatus for building a support vector machine classifier
6,308,176: Associating files of data
6,308,172: Method and apparatus for partitioning a database upon a timestamp, support values for phrases and generating a history of frequently occurring phrases
6,304,864: System for retrieving multimedia information from the internet using multiple evolving intelligent agents
6,269,368: Information retrieval using dynamic evidence combination
6,263,335: Information extraction system and method using concept-relation-concept (CRC) triples
6,253,169: Method for improvement accuracy of decision tree based text categorization
6,247,004: Universal computer assisted diagnosis
6,233,575: Multilevel taxonomy based on features derived from training documents classification using fisher values as discrimination values
6,212,532: Text categorization toolkit
6,192,360: Methods and apparatus for classifying text and for building a text classifier
6,167,369: Automatic language identification using both N-gram and word information
6,161,130: Technique which utilizes a probabilistic classifier to detect "junk" e-mail by automatically updating a training and re-training the classifier based on the updated training set
6,137,911: Test classification system and method
6,078,918: Online predictive memory
6,076,088: Information extraction system and method using concept relation concept (CRC) triples
6,047,277: Self-organizing neural network for plain text categorization
6,038,561: Management and analysis of document information text
6,038,560: Concept knowledge base search and retrieval system
6,038,527: Method for generating descriptors for the classification of texts
6,026,388: User interface and other enhancements for natural language information retrieval system and method
6,021,404: Universal computer assisted diagnosis
6,006,223: Mapping words, phrases using sequential-pattern to find user specific trends in a text database
6,006,221: Multilingual document retrieval system and method using semantic vector matching
5,983,214: System and method employing individual user content-based data and user collaborative feedback data to evaluate the content of an information entity in a large information communication network
5,963,940: Natural language information retrieval system and method
5,960,422: System and method for optimized source selection in an information retrieval system
5,907,839: Algorithm for context sensitive spelling correction
5,905,863: Finding an e-mail message to which another e-mail message is a response
5,867,799: Information system and method for filtering a massive flow of information entities to meet user information classification needs
5,850,561: Glossary construction tool
5,748,973: Advanced integrated requirements engineering system for CE-based requirements assessment
5,687,364: Method for learning to infer the topical content of documents based upon their lexical content
5,675,710: Method and apparatus for training a text classifier
5,659,766: Method and apparatus for inferring the topical content of a document based upon its lexical content without supervision
5,652,829: Feature merit generator
5,526,443: Method and apparatus for highlighting and categorizing documents using coded word tokens
: 44 6,237,529
6,233,575: Multilevel taxonomy based on features derived from training documents classification using fisher values as discrimination values
6,233,414: Methods and systems for providing capability and status indication of an imaging system
6,227,113: Printing machine and method of operating a printing machine
6,224,617: Methods and apparatus for defibrillating a heart refractory to electrical stimuli
6,222,539: Extended help feature for image forming devices
6,219,049: Mate inferencing
--: overy-driven exploration of OLAP data cubes
: