|
Home
Datasets
Budget
Cong. Quarterly Almanac
Congressional Hearings
Executive Orders
New York Times Stories
Most Important Problem
U.S. Public Laws
State of the Union
Supreme Court
Roll Call Votes
Agendas Update
|
 |
Datasets
Below are links to download the datasets. You can download each dataset as an Excel file or as a Tab-delimited text file. Codebooks related to each dataset are also available in the table below. Some of the codebooks are available as PDF files. To read these files you will need Acrobat Reader. A free version of Acrobat Reader is available in the button below.
To download the Tab-delimited text files, right-click the link and select "Save Link As..." from the appearing menu. Left-clicking the text file links will open them in the browser window. If this happens, go to the "File" menu in your browser and select "Save Page As." In both cases, make sure that in the Save File Dialogue box, the "Save As Type" box says "Text Document." More Help? Click Here
Tip: Move your mouse over the dataset headings for more information about each dataset. (For IE 6+ users, if you click the headings, the descriptions will stay until you click another heading.)
Budget
: OMB FY
This dataset provides annual data, adjusted for inflation, of U.S. Budget Authority from FY1947 through FY2008. Using Office of Management and Budget Functions and Subfunctions, we have revised the data to be consistent over time.
|
| Codebook [ms word - 344 kb] [pdf - 354 kb] |
|
Current Year Dollars & FY 2008 Dollars
|
[Excel - 846 kb] |
N/A |
|
Current Year Dollars
|
N/A |
[Text - 54 kb] |
|
FY 2008 Dollars
|
N/A |
[Text - 56 kb] |
Congressional Quarterly Almanac
: CQ Almanac
This dataset contains information from all articles in the main chapters of the CQ Almanac from 1948 to 2007 (~13,000 records). CQ Almanac articles typically cover one legislative initiative. When a CQ article contains information about several different public laws or bills, it is divided so that each record in our dataset contains information about one legislative initiative. We code each record by our system of policy content codes. Several other variables concerning each legislative initiative (e.g., bill numbers, Public Law Number, if applicable, committees involved, primary sponsors, etc.) are also available. Identification variables link our records to the original CQ source material as well as to our Public Laws dataset. A note of caution, article length has varied over the span of this dataset. Users should weight accordingly.
|
|
Codebook [web page]
|
|
CQ Data
|
[Tab Delimited Text]
|
Congressional Hearings: CIS Abstracts
This dataset contains information summarizing each U.S. Congressional hearing from 1947 to 2006 (~84,000 hearings). Using the Congressional Information Service (CIS) Abstracts, we code each hearing by our system of policy content codes. The system includes 19 major topics and over 200 subtopics. Other variables, including committee and subcommittee, are also available. Identification variables link our records to the original CIS source material. Note: Research making use of the congressional hearings dataset should bear in mind that the hearings for the last year available on our website are incomplete. This is due to the CIS archival system.
|
|
Codebook [web page]
|
|
All Hearings Data
|
[Tab Delimited Text - 14.5mb]
|
|
House Hearings Data
|
[Tab Delimited Text - 8.3mb]
|
|
Senate Hearings Data
|
[Tab Delimited Text - 5.9mb]
|
|
Joint Hearings Data
|
[Tab Delimited Text - 368kb]
|
Executive Orders:
The Executive Orders dataset contains information about each executive order issued from 1945 to 2003 (~3,800 records). Each record is coded by our system of policy content categories and other variables. Other variables of interests included are the party of the president, whether the order was issued during a time of divided government, and whether the order was issued at the beginning or end of a presidential term.
|
| Codebook [ms word - 26 kb]
|
|
Executive Orders Data
|
[Tab Delimited Text - 752kb]
|
New York Times Index: New York Times Index
This dataset is a systematic random sample of the New York Times Index from 1946 to 2005 (~35,000 records). The sample is the first entry on every odd-numbered page of the Index. Each entry is coded by the major topics from our system of policy content codes. Other variables include the length, date, and location of the story, whether it addressed government actions, etc.
|
| Codebook [web page]
|
|
New York Times Index Data
|
[Tab Delimited Text - 5.8mb]
|
Gallup's Most Important Problem: Gallup Public Opinion Polls, Most Important Problem Question
This dataset contains the responses to Gallup's Most Important Problem facing the nation question from 1947 to 2007 (~1,200 records). For each poll taken the responses are coded by issue area. This worksheet captures the aggregated proportions for each major category, on an annual basis, for all of the polls contained in the working dataset. The annual relative rankings of each issue area are also provided in the dataset.
|
| Codebook [ms word - 3 mb]
|
|
MIP Annual Data
|
[Tab Delimited Text - 64kb]
|
|
MIP Quarterly Data
|
[Tab Delimited Text - 21kb]
|
Public Laws:
U.S. Public Laws
This dataset contains information about each public law passed from 1948 to 2007 (~19,000 records). Each record is coded by our system of policy content categories and other variables. Identification variables allow linkage to the CQ Almanac dataset. NEW: Dataset now dynamically links users to the full text (104th to 108th) and bill summary (93rd to 108th) information on THOMAS and other public domain websites.
Most Important Laws 1948-1998
This dataset identifies 576 of the most important laws over the period 1948 through 1998. The identification of the most important laws is based on CQ lines of coverage (with adjustments made based on CQ under coverage between 1948 and 1961).
|
| Codebook [web page]
|
|
Public Laws Data
|
[Tab Delimited Text - 3.3mb]
|
|
Most Important Laws 1948-1998 |
[Excel - 145 kb] |
[Text - 72 kb] |
State of the Union Speeches:
State of the Union Speeches
This dataset (~19,000 records) contains information on each quasi-statement in the Presidential State of the Union Speeches from 1946 to 2005. Each record is coded by our system of policy content categories and other variables. Users can dynamically link to full text versions of the speech for further analysis.
|
| Codebook [ms word 27.5 kb]
|
|
State of the Union Data
|
[Excel - 6mb]
|
Supreme Court Cases: The Supreme Court dataset (~7,000 records) spans from 1953 to 1998 and is one of the most comprehensive looks at the Court from a policy perspective. A codebook accompanies the dataset which, in addition to basic data issues, gives users an introduction to the court's terminology and procedures.
|
| Codebook [webpage]
|
|
Supreme Court Data |
[Excel - 18mb]
|
Congressional Roll Call Votes: The Congressional Roll Call Voting dataset codes every congressional roll call vote from 1946 to 2000 using the Policy Agendas Project common topic coding system. The dataset currently contains ~37,000 records, with ~19,000 votes in the House and ~18,000 votes in the Senate.
|
| Codebook [MS Word 42 kb]
|
|
House Records |
[Excel - 8.4mb]
|
|
Senate Records |
[Excel - 7.4mb]
|

|