![]() |
||
![]() ![]() |
||
|
|
||
| Home Papers Policy Agendas Project Other Projects Books Position Papers |
PapersAutomated Classifcation of Congressional LegislationAbstract: For social science researchers, content analysis and classification of United States Congressional legislative activities has been time consuming and costly. The Lirbary of Congress THOMAS system provides detailed information about bills and laws, but its classification system, the Legislative Indexing Vocabulary (LIV), is geared toward information retrieval instead of the pattern or historical trend recognition that social scientists value. The same event (a bill) may be coded as about 1, 2, 3, or 12 subjects, with little indication of its primary emphasis. In addition, because the LIV system has not been applied to other activities, it cannot be used to compare (for example) legislative issue attention to executive media or public issue attention. This paper presents the Congressional Bills Project's automated classification system. This system applies a topic spotting classification algorithm to the task of coding legislative activities into one of 226 subtopic areas. The algorithm uses a traditional bag-of-words document representation, an extensive set of human coded examples, and an exhaustive topic coding system developed for use by the Congressional Bills Project and the Policy Agendas Project. Experimental results demonstrate that the automated system is about as effective as human assessors, but with significant time and cost savings. The paper concludes by discussing challenges to moving the system into operational use. |
|
![]() |
||
| 2008 University of Texas at Austin. All Right Reserved. Credits and Acknowledgements | Disclaimer | Citation Guidelines | ||
| Home | Codebooks | Datasets | Data Tools | Publications | News & Events | Resources | About Us | Site Help | Contact Us | Site Map | ||